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Abstract.  Using  intensity  autocorrelation  of  multiply  scattered  light,  we  show  that  the  increase 
in  interparticle  interaction  in  dense,  binary  colloidal  fluid  mixtures  of  particle  diameters  0-115  /^m 
and  0-089 /im  results  in  freezing  into  a  crystalline  phase  at  volume  fraction  <j)  of  0-1  and  into 
a  glassy  state  at  (j)  =  0-2.  The  functional  form  of  the  field  autocorrelation  function  g(1)(i)  for  the 
binary  fluid  phase  is  fitted  to  exp[—  y(6k^De{ft)112']  where  fe0  is  the  magnitude  of  the  incident 
light  wavevector  and  y  is  a  parameter  inversely  proportional  to  the  photon  transport  mean  free 
path  /*.  The  Deff  is  the  /*  weighted  average  of  the  individual  diffusion  coefficients  of  the  pure 
species.  The  /*  used  in  calculating  Deff  was  computed  using  the  Mie  theory.  In  the  solid  (crystal  or 
glass)  phase,  the  g(i}(t)  is  fitted  (only  with  a  moderate  success)  to  exp  [  —  y(6k20  W(t))l/2  ]  where  the 
mean-squared  displacement  W(t)  is  evaluated  for  a  harmonically  bound  overdamped  Brownian 
oscillator.  It  is  found  that  the  fitted  parameter  y  for  both  the  binary  and  monodisperse 
suspensions  decreases  significantly  with  the  increase  of  interparticle  interactions.  This  has  been 
justified  by  showing  that  the  calculated  values  of  /*  in  a  monodisperse  suspension  using  Mie 
theory  increase  very  significantly  with  the  interactions  incorporated  in  /*  via  the  static  structure 
factor. 

Keywords.    Colloids;  dynamic  light  scattering;  crystal  and  glass  transitions. 
PACS  Nos    82-70;  64-70;  42-20;  05-40 

1.  Introduction 

Charged  stabilized  aqueous  colloidal  suspensions  of  polystyrene  particles  (polyballs) 
and  sterically  stabilized  nearly  hard  sphere  colloids  of  polymethyl  methacrylate 
particles  have  proved  to  be  model  condensed  matter  systems  to  study  a  rich  variety  of 
cooperative  behaviour  in  equilibrium  and  non-equilibrium  conditions  [1],  These 
systems  show  liquid,  crystal  and  even  glassy  states  under  suitable  experimental 
conditions.  Monodisperse  polyball  suspensions  show  either  a  body-centered  cubic 
(BCC)  phase  at  low  volume  fraction  0  or  a  face-centered  cubic  (FCC)  phase  at  high  0. 
Small-angle  neutron  scattering  measurements  [2]  of  the  structure  factors  reveal 
a  glassy  phase  in  these  systems  for  $  >  0-2.  Monodisperse  suspensions  of  polymethyl 
methacrylate  particles  have  been  studied  extensively  from  the  point  of  view  of  freezing 
of  a  liquid  to  an  equilibrium  crystalline  phase  or  a  metastable  glassy  state  as  a  function 
of  volume  fraction  0  [3-5]. 
In  comparison  with  monodisperse  suspensions,  much  less  attention  has  been  paid 
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and  complex  with  the  introduction  of  two  more  parameters,  namely,  the  diameter  ratio 
(7[=crA/o-B,<r<  1]  and  the  relative  number  density  of  the  small  particles  x[=nA/ 
(nA  +  MB)],  to  characterize  binary  mixtures.  Several  compound  crystalline  structures 
have  been  identified  [6, 7]  in  the  optical  microscopy  of  concentrated  (</>  ~  0-3)  aqueous 
suspensions  of  binary  charged  polystyrene  spheres  with  diameters  between  0-2  and 
0-8  /am.  The  observed  structures  were  similar  to  those  found  in  the  intermetallic 
compounds  MgCu2,  A1B2,  CaCu5,  NaZn13  and  also  of  the  type  AB4  with  J°63/mmc 
symmetry,  which  has  no  counterpart  in  the  conventional  atomic  systems.  Bartlett  et  al 
[8]  have  studied  the  phase  diagram  of  binary  mixtures  of  nearly  hard-sphere  polyballs 
for  some  values  of  a.  These  studies  confirm  the  existence  of  AB2  and  ABj  3  superlattices 
in  the  (0A,  </>B)  phase-diagram.  In  addition  to  the  crystalline  phase,  glassy  state  has  also 
been  observed  in  polyball  binary  mixtures  in  measurements  of  shear  modulus  [9], 
static  structure  factor  [10],  diffusing  wave  spectroscopy  [11]  and  Brownian  dynamics 
simulations  [12,13].  The  ease  in  the  formation  of  glassy  state  in  binary  colloidal 
mixtures  or  charge-polydisperse  colloids  [12, 14]  is  similar  to  the  fact  thatmulticompo- 
nent  solid  solutions  in  conventional  atomic  systems  are  good  glass  formers  [15]. 

Dynamics  of  the  colloidal  systems  have  been  extensively  studied  by  dynamic  light 
scattering  (DLS)  experiments  where  the  temporal  fluctuations  of  the  scattered  light  are 
analyzed  in  real  time  in  terms  of  the  intensity  autocorrelation  functions  [16, 17].  In 
using  the  DLS  technique,  a  stringent  requirement  is  that  the  incident  radiation  should 
by  scattered  only  once  in  the  medium.  Multiple  scattering  is  avoided  either  by  matching 
the  refractive  indices  of  the  particles  and  the  solvent  [5]  or  by  taking  a  low  concentra- 
tion of  particles  or  thin  sample  cells.  DLS  measurements  of  the  intermediate  scattering 
functions  have  yielded  valuable  information  on  the  non-ergodicity  parameters  across 
the  kinetic  glass  transitions  [4]  allowing  a  direct  comparison  with  the  mode-coupling 
theory  predictions  [18]. 

The  recently  developed  technique  of  the  diffusing- wave  spectroscopy  (DWS)  [19- 
22]  has  made  it  possible  to  study  the  particle  dynamics  from  the  intensity  correlation 
functions  in  concentrated  interacting  suspensions.  Subsequently,  useful  information 
'about  the  polarization  dependence  in  the  freezing  of  monodisperse  polyball  suspen- 
sions [23]  and  also  about  the  particle  diffusion  in  the  binary  polyball  mixtures  are 
obtained  by  using  DWS  [24].  During  the  course  of  our  studies,  using  DWS  experi- 
ments on  binary  mixtures  of  strongly  interacting  polyballs,  Meller  and  Stavans  [11] 
have  reported  comprehensive  phase  diagrams  (0  versus  x)  consisting  of  liquid,  crystal- 
line and  glassy  states  for  three  values  of  a,  namely,  0-54  +  0-02,  0-78  +  0-04  and 
0-87  +  0-03.  The  different  states  of  the  system  were  confirmed  from  the  presence  or 
absence  of  Bragg-iridescence  from  the  samples  and  the  long-time  temporal  decay  or 
saturation  of  the  intensity  autocorrelation  functions.  Our  Brownian  dynamics  (BD) 
simulations  [13]  on  binary  mixtures,  with  cr  =  0-495  and  x  =  0-5,  reveal  a  liquid  to 
crystal  transition  at  0  ~  0-2  and  a  liquid  to  glass  transition  at  $  ~  0-3.  This  is  quite 
different  from  the  results  read  from  Meller  and  Stavans's  [11]  phase  diagram  with  the 
parameters  closest  to  our  simulation,  namely,  a  =  0-54  ±  0-02  and  x  =  0-5.  In  their 
experiments  [11],  the  mother  samples  of  10%  volume  fraction  were  treated  with 
ion-exchange  resins  to  reduce  the  impurity  ion  concentration  nt  till  these  showed  strong 
crystalline  iridescence.  The  samples  with  different  </>  and  x,  on  which  the  measurements 
were  performed,  were  then  made  out  of  these  mother  samples.  In  the  simulations,  on  the 
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preparations  in  the  two  cases,  one  must  carry  out  DWS  experiments  and  look  for  the 
liquid  to  crystal  and  the  liquid  to  glass  transitions  on  concentrated  binary  polyball 
mixtures,  prepared  similarly  as  in  the  simulation.  This  is  the  motivation  of  the  present 
work.  We  indeed  find,  in  accordance  with  our  simulation  [13],  that  a  binary  liquid 
mixture  freezes  into  a  crystalline  phase  at  a  lower  volume  fraction  (</>  =  0-1),  while  it 
forms  a  glassy  state  at  </>  =  0-2.  Along  the  way,  the  DWS  intensity  autocorrelation 
functions  of  the  binary  liquid  phase  were  analyzed  in  terms  of  effective  diffusion 
constants  [22],  wherein  the  transport  means  free  path,  I*  [25],  has  been  incorporated 
using  the  Mie  scattering  theory  [26].  The  slope  parameter  y,  to  be  explained  in  the 
following  subsection  [eq.  (9)],  shows  a  systematic  dependence  on  the  interaction 
between  the  particle.  This  has  been  understood  by  realizing  that  y  is  inversely 
proportional  to  /*,  which  increases  with  the  interaction  strength. 

2.  Theoretical  background 

DWS  extends  DLS  to  the  optically  turbid  media  and  relates  the  temporal  fluctuations 
of  the  multiply  scattered  light  to  the  motion  of  the  scatterers.  The  transport  of  photon  is 
assumed  to  be  diffusive  with  a  diffusion  constant  D,  =  c/*/3,  where  c  is  the  velocity  of  light 
in  the  medium.  The  transport  mean  free  path  /*  is  related  to  the  scattering  mean  free  path 
/  (the  mean  distance  between  successive  scattering  events)  by  /*//  =  2k^/(q2  ),  where  fc0  is 
the  photon  wave  vector  in  the  medium  and  <g2  >  is  the  mean  squared  scattering  wave 
vector.  The  normalized  temporal  field  autocorrelation  function  is  defined  by 


^.  (1) 

Within  DWS,  for  the  noninteracting  monodisperse  Brownian  particles,  it  is  given  by  [22] 

(2) 


where  P(s)  is  the  distribution  of  path-lengths  5  for  the  random  walk  of  photons  and 
1 


T  " 


(3) 


Here  D  is  the  free  particle  diffusion  coefficient. 

For  the  "backscattering"  geometry  and  the  source  at  a  fixed  distance,  z0  ~  I*,  inside 
from  the  illuminated  face  at  z  =  0  of  a  slab  of  thickness  L,  one  obtains  [22] 


G<»(0= 


2     6t      ,f     6tfL     z0 
+  -     —coshl     —        - 


(4) 
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For  a  sample  of  infinite  thickness  (L-*  oo),  (4)  simplifies  to 

-(zo/O(6r/To)"2 


(5) 


A  distribution  in  the  position  z0  of  the  source  results  in  the  replacement  of  z0/l*  by 
<z0  >//*.  It  has  been  found  experimentally  that  the  g(1)(t)  can  be  simply  described  using 
[19,20] 

0(1)W  =  exp[-y(6t/T0)1/2],  (6) 

where  y  =  <z0  >//*  +  2/3,  i.e.,  y  oc  I//*  [22].  When  /*//  =  10,  then  y  ^  2  for  both  polarized 
and  depolarized  scattering  [27]. 

For  the  case  of  interacting  particles,  Mackintosh  and  John  [27]  have  shown  that 
g(l)(t)  is  related  to  the  dynamic  structure  factor  S(q,  t)  such  that  2t/x0  in  (2)  is  replaced  by 
(/*//)[!  -  S(q,  f)/S{q,  0)]  where  S(q,  t)  is  defined  by 

(7) 

For  short  times,  S(q,  t)  can  be  expanded  to  linear  power  in  t  and  hence  it  amounts  to 
replacing  t/xQ  by  t/t'0  where  TQ/TO  =  (q2S(q)y/(q2y  [22].  Thus  the  time  scale  of  the 
decay  of  g(1}(t)  is  modified  and  its  functional  form  (i.e.  square  root  singularity)  for  short 
times  is  not  affected  by  interactions. 

The  form  of  g(l}(t)  for  all  times  has  been  obtained  [27]  only  under  the  simplifying 
assumption  that  the  interparticle  spacing  is  larger  than  the  wavelength  and  the  particle 
size,  so  that  the  particle  vibrations  in  a  colloidal  crystal  or  a  glass,  around  their  mean 
positions  may  be  considered  independent  of  each  other.  In  addition,  due  to  the  viscosity 
of  the  fluid,  phonon-like  motions  are  strongly  overdamped.  They  have  also  assumed 
large  wavevector  transfers  q  which  correspond  to  distances  smaller  than  the  mean 
interparticle  spacing.  With  these  simplifying  assumptions,  for  the  long  paths,  for  which 
the  scattered  light  is  still  diffusive,  the  dynamic  structure  factor  is  replaced  by  its 
self-part  Ss(q,  t)  given  by 

S-fo'H  (-iZexp[fti-(ra(t)-ra(0))]\  (8) 

" 


where-  W(t)  is  the  mean  square  displacement  (MSD)  of  a  particle  in  time  t.  Then 

],  (9) 


Since  W(t)  shows  a  non-decaying  part  at  long-times  in  a  solid,  eq.  (9)  will  predict  a 
non-zero  g(i](t)  at  large  times  for  both  polarized  and  depolarized  scattered  light. 

The  MSD  can  be  evaluated  by  considering  a  model  for  the  interacting  Brownian 
particles.  For  a  one-dimensional  harmonically  bound  Brownian  particle  of  mass  M  and 
diameter  crp  =  2ap,  suspended  in  a  solvent  of  viscosity  77,  the  velocity  auto-correlation 
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function  in  the  strongly  overdamped  case  ((3/co  »  1)  is  given  by  [16] 

(10) 


where  a>  is  the  angular  velocity  associated  with  the  harmonic  potential,  /?  =  T  ~  1  with 
tv  =  M/6nrjap  as  the  Brownian  velocity  fluctuation  time.  This  model  applies  strictly  to 
the  motion  of  a  single  particle  in  a  colloidal  crystal  if  the  combined  interparticle  forces 
are  treated  in  a  harmonic  approximation.  It  has  been  suggested  [16]  that  the  model  can 
also  be  applied  to  the  liquid-like  ordering,  if  the  similar  form  for  the  velocity 
autocorrelation  function  is  retained,  i.e., 

(11) 


M 

where  one  expects  that  A«k3T/M  and  the  fluctuation  time  associated  with  the 
interaction  forces  T,  »  TU.  Within  this  model  for  times  t»rv,  the  MSD  is  given  by 

W(t)  =  Dt  —  At^t  +  T,exp{  —  t/Tj}  —  T,),  (12) 

where  the  parameter  A  satisfies  XT,  <  D. 

Let  us  now  discuss  DWS  from  a  binary  colloidal  systems  with  species  of  different 
sizes.  It  has  been  shown  that  for  noninteracting  particles,  the  g(i)(t)  is  given  by  (2),  where 
TO  [eq.  (3)]  is  replaced  by  reff  [22], 


f)1/2),  (13) 

where 

l/Teff  =  -^effN)' 


The  effective  diffusion  constant  Deff  is  the  weighted  average  of  the  individual  diffusion 
coefficients  of  the  pure  species,  the  values  of  which  are  calculated  via  Stokes'  law  [28], 
i.e.,  Di  ~  kB  T/(6nr)ai)  where  r\  is  the  viscosity  of  water  (=  0-01089  P)  and  a,  are  the  radii 
of  the  particles.  The  weighting  factors  in  (14),  i.e.,  the  inverse  transport  mean  free  paths 
l//f  for  the  individual  pure  species  are  summed  to  yield  the  optical  analogue  of  the 
resistivity  of  the  binary  liquid  alloy  [29] 

V£f  =  lW-  (15) 

i 

As  stated  above,  the  key  quantity  within  the  diffusion  approximation  of  the  photon 
transport  is  the  photon  random-walk  step  length  I*,  which  depends  on  the  number  of 
scattering  events  needed  to  randomize  the  direction  of  photon.  The  number  of 
scattering  events,  in  turn,  depends  on  the  scattering  properties  of  the  individual 
particles  as  well  as  the  interparticle  spatial  correlations  [21,  22].  A  general  expression 
for  /*  for  the  monosized  particles  calculated  [21,  30,  31]  within  the  Mie  theory  [26]  is 
summarized  in  the  Appendix. 
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In  the  laboratory,  one  measures  the  normalized  temporal  intensity  autocorrelation 
function  gz(i)  which  is  related  to  c/(1)(t)  by  the  Siegert's  relation,  i.e., 

02(0  =  0(2)(0-1=/I0(1)WI2]  (16) 

The  parameter/  is  fixed  by  the  system  optics  and  can  be  taken  as  a  free  parameter  in 
fitting  (6)  and  (16)  to  the  relevant  data. 

3.  Experiments 

We  use  aqueous  suspensions  of  charged  polystyrene  spheres  (M/s  Seradyn  Inc.,  USA) 
of  diameters  aK  =  0-115  /an  and  crA  =  0-089  /im.  The  quoted  standard  deviation  of  the 
diameters  is  less  than  1%.  Two  sets  of  samples  are  prepared  from  these  mother 
solutions.  In  one  set,  the  samples  of  the  small  and  the  large  spheres  are  obtained 
independently  from  their  respective  mother  solutions  by  adding  required  amounts  of 
triple-distilled,  deionized  water,  so  that  the  final  volume-fraction  of  each  of  them 
becomes  (/>A  =  4>E  =  0-05.  Equal  volumes  of  these  samples  are  then  mixed  to  get  total 
0  =  0-1  in  a  cylinderical  quartz  scattering  cell  of  8mm  inner-diameter  containing 
a  mixed-bed  of  ion-exchange  resins  [AG501-X8(D),  M/s  Bio-Rad  Inc.,  USA]  at  the 
bottom  of  the  cell  to  reduce  the  coion  concentration  n{.  The  cell  is  then  closed  air-tight 
with  properly  cleaned  teflon  cap  and  parafilm.  In  the  other  set,  the  mother  samples  of 
the  small  and  the  large  spheres  with  <£A  =  0B  =  0-1  are  mixed  in  1 : 1  volume-ratio  (total 
cj)  =  0-2)  in  a  similar  quartz  sample  cell  with  ion-exchange  resins.  The  volume-fractions 
of  the  constituent  solutions  were  confirmed  to  be  within  an  experimental  accuracy  of 
5%  by  drying  a  known  volume  of  the  suspension  and  weighing  the  solids.  After  an 
initial  vigorous  mixing  of  the  samples  with  the  resins,  the  cells  are  transferred  into  their 
respective  quartz  glycerin  baths  and  left  undisturbed  for  about  an  hour  before  the  data 
accumulation  begin.  This  is  necessary  to  assure  a  proper  gravitational  settling  down  of 
the  resins  in  the  cell  as  well  as  uniformity  of  the  glycerin  without  any  air-bubbles 
surrounding  the  cell.  A  linearly  polarized  beam  from  Kr+  laser  (/I  =  647-1  nm)  falls  on 
one  side  of  the  sample  and  the  scattered  light  of  desired  polarization  is  collected  from 
the  same  side  (scattering  angle  6  =  165°).  The  normalized  intensity  autocorrelation 
g2(t)  [eq.  (16)]  is  measured  using  the  Malvern  correlator  (model  7032CE).  The  laser 
light  is  vertically  (V)  polarized.  The  emergent  multiply  scattered  light  has,  in  general, 
a  polarized  (vertical)  component  with  intensity  7v(t)  and  a  depolarized  (horizontal) 
component  with  intensity  JH(t).  By  setting  the  orientation  of  the  analyzer,  g2(t)  was 
measured  in  either  polarized,  i.e.,  Cvv(t)  =  </v(f)/v(°)>/<7v>2  —  1  or  depolarized,  i.e., 
C^H(t)~<(IH(t)IH(Q)y/^IHy2 —  I  component.  We  note  that  although  in  normal  iso- 
tropic  systems  the  measured  correlation  functions  Cvv(£)  and  CVH(f)  behave  in 
qualitatively  the  same  way,  they  give  us  access  to  different  physical  properties.  While 
both  respond  to  translational  motion  [23,27, 32],  only  CVH  is  sensitive  to  changes  in 
the  local  dielectric  anisotropy  and  hence  contains  information  about  the  correlations  of 
orientational  fluctuations  as  well.  The  field  autocorrelation  g(1)(t)  in  the  backscattering 
geometry  is  extracted  using  Siegart's  relation  eq.  (16). 

We  note  that  since  the  system  is  in  a  frozen  but  disordered  (i.e.  non-ergodic)  state,  we 
are  obliged  to  average  over  space  as  well  as  time  to  sample  a  representative  fraction  of 
all  the  configurations.  To  this  end,  we  have  averaged  the  correlation  functions  obtained 
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from  ten  spatially  separated  regions  in  the  suspension.  Each  of  these  regions  is  achieved 
simply  by  rotating  the  sample  cell  in  its  position.  The  curves  so  obtained  are  labelled 
ensemble-averaged,  while  those  from  a  single  region  are  called  time-averaged. 

4.  Results  and  discussions 

Our  fluid-phase  data  for  the  total  0  =  0-1  mixtures  of  the  small  and  the  large  polyballs, 
before  putting  in  the  ion-exchange  resins,  are  shown  by  open  symbols  in  figure  1.  The 
measured  g2(t)  were  non-linear  least-square  fitted  by  a  simulated  annealing  Monte 
Carlo  fitting  method  [33]  using  (16)  and  (13)  with/  and  y  as  fitting  parameters.  The 
values  of/).,  If  (as  calculated  [34]  from  the  Mie  theory)  for  the  two  constituent  particle 
suspensions  and  also  the  values  of  Deff ,  are  presented  in  table  1.  The  tcff  was  calculated 
using  (14).  The  values  of  the  fitting  parameters  are  given  in  the  table  2.  Given  the  fitted  value 
of/,  the  data  have  been  presented  in  the  form  of  normalized  #(1)(t)  (experimental  g(i)(t)/f) 
in  figure  1.  The  values  of  the  parameter  y  for  the  polarized  and  depolarized  scattering 
are  comparable  with  those  in  the  monodisperse  colloids  [22, 23, 32].  The  polarization 
dependence  of  y  can  be  simply  understood  as  follows  [22].  The  autocorrelation 
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Figure  1.    The  fluid-phase  time-averaged  data  for  the  polarized  (VV)  and  the 
depolarized  (VH)  field  autocorrelation  functions  g(l](t)  versus  tl/2  in  the  backseat- 


Table  1.  The  parameters  relevant  to  the  Mie  the- 
ory fits  in  figure  1  and  the  inset  of  figure  3,  with 
k0  =  1297724cm" l.  The  parameters  at  and  /*  are  in 
jim,  while  the  Di  or  £>eff  are  in  l(T8cm2/s. 


0-089 

3-17 

232-77 

3-58 

0-115 

4-10 

295-82 

Table  2.  The  fitting  parameters  corresponding  to  figures  1  to  5,  with 
/c0  =  1-29  x  10s  cm"1.  Samples  I  and  II  are  1:1  binary  mixtures  of  0-1 15  jum  and 
0-089  fim  diameter  particles  with  total  </>  =  0-1  and  0-2,  respectively.  The  monodis- 
perse  sample  III  has  <£  =  0-03  and  diameter  0-115/xm.  The  De{(  is  in  units  of 
10~8cm2/s,  A  in  units  of  10~3,  T,  in  units  of  10~3s.  Liquid:  L,  Crystal:  C,  Freezing: 
F  and  Glass:  G,  Pol:  polarization. 
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State 


Pol 


D 


I 

L 

VV 

3-578 



— 

0-9 

2-0367 

I 

L 

VH 

3-578 

— 

— 

0-9 

3-1537 

I 

F 

VV 

3-578 

1-116 

3-204 

0-118 

0-0932 

I 

F 

VH 

3-578 

0-4456 

8-029 

0-317 

0-1072 

I 

C 

VV 

3-578 

0-6348 

5-526 

0-145 

0-0203 

I 

C 

VV 

3-578 

5-217 

6-822 

0-196 

0-0762 

II 

L 

VV 

3-217 

— 

— 

0-8 

2-332 

II 

L 

VH 

3-217 

— 

— 

0-8 

3-763 

II 

G 

VV 

3-217 

0-1046 

30-723 

0-226 

0-0018 

II 

G 

VH 

3-217 

1-2492 

2-349 

0-237 

0-0049 

III 

L 

VV 

3-473 

— 

— 

0-99 

1-89 

III 

L 

VH 

3-473 

— 

— 

1-06 

2-87 

III 

F 

VV 

3-473 

1-816 

1-912 

0-7 

0-988 

III 

C 

VV 

3-473 

1-732 

1-990 

0-6 

0-723 

III 

C 

VH 

3-473 

5-926 

0-5817 

0-6 

0-463 

function  for  the  perpendicular  polarization  (VH)  decays  more  rapidly  because  the 
analyzer  discriminates  against  the  short  photon  paths  which  retain  a  high  degree  of 
their  incident  polarization.  On  the  other  hand,  the  autocorrelation  function  for  the 
parallel  polarization  decays  more  slowly,  due  to  the  additional  contributions  of  the  low 
order  scattering  paths  which  have  a  longer  decay  time.  In  other  words,  for  very 
short-times  (shorter  than  that  shown  in  figure),  gw(t)  is  dominated  by  long  paths  for 
which  the  observed  photons  have  lost  their  memory  of  their  incident  polarization  and 
hence  Cvv  and  CVH  will  initially  decay  at  the  same  absolute  rate.  This  means 
(Cvv  -  CVH)  will  be  independent  of  time  to  first  order  in  ^ft.  Since  the  short  paths  tend 
not  to  alter  the  polarization  of  the  incident  light,  /vv  is  in  general  higher  than  7VH  [in 
our  experiments  IVV/IVH  ~  3]- 
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After  these  initial  measurements,  the  samples  cells  are  left  undisturbed.  Typically  about 
2  days  later,  the  0  =  0-1  mixture  started  showing  Bragg-iridescence  in  the  immediate 
vicinity  of  the  ion-exchange  resins-bed.  The  crystalline  region  started  growing  slowly 
till  the  whole  of  the  sample  in  the  cell  showed  the  iridescence.  Under  this  condition,  the 
correlation  functions  measured  for  different  polarization  channels  are  presented  in  figure  2, 
for  two  cases  namely,  (a)  in  the  region  of  1-5  mm  above  the  top  of  the  resins-bed  at  around 
174  h  after  the  addition  of  resins  (sample  I,  state  C  in  table  2)  and  (b)  at  the  height  of  4mm 
from  the  top  of  the  resins-bed  and  time  194  h  (sample  I,  state  F  in  table  2).  The  temporal 
correlation  functions  for  different  polarization  channels  follow  the  similar  trends  in  the 
crystallizing  samples.  The  form  of  the  correlation  function  gw(t)  is  same  as  predicted  by  (9). 
We  note  that  the  mean  interparticle  spacing  is  almost  one  third  of  wavelength  of  light  and 
hence  the  assumption  of  retaining  only  the  self  part  of  the  structure  factor  in  deriving  (9)  is 
not  fully  satisfied.  However,  since  (9)  is  the  only  closed  form  expression  available  for  the 
gw(t)  for  the  interacting  colloids,  we  have  chosen  to  use  it  to  fit  our  experimental  data  to  get 
some  estimates  of  the  parameters  y,f,  A  and  T,.  The  solid  lines  show  the  fit  to  (9)  and  (12)  by 
varying  f,*c\,y  and  A.  The  values  of  the  fitting  parameters  are  provided  in  the  respective 
figures  as  well  as  in  table  2.  Given  the  value  of  /,  here  too  the  data  are  presented  in  the 
normalized  form  g(1)(t).  We  note  that  the  fits,  though  excellent  at  long  times,  are  not 


t1'2  (secltt) 

Figure  2.  The  normalized  g(l\t)  versus  r1/2  for  the  different  stages  of  crystallization 
in  sample  I  ($  =  0-1)  table  2.  The  data  are  shown  in  (a)  the  frozen  state  (state  C, 
table  2)  and  (b)  while  freezing  (state  F,  table  2).  The  solid  lines  are  the  fits  to  eqs  (7) 
and  (10).  The  fitted  parameters  are  given  in  table  2. 
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than  1  at  short  times. 

The  situation  is  quite  different  with  the  total  <£  =  0-2  mixture.  The  sample  does  nol 
Bragg-iridescence  even  a  few  weeks  after  the  addition  of  resins.  The  autocorre 
functions  measured  after  a  week  from  the  addition  of  resins  (figure  3)  show  significan 
zero  values  at  long  time,  similar  to  that  shown  in  figure  2.  The  absence  of  Bragg-iridei 
and  non-decaying  intensity  autocorrelation  functions  show  that  the  particles  are  trap 
a  disordered  state,  as  in  a  conventional  glass.  For  comparison,  the  inset  shows  the 
state  data  for  the  sample  (total  <j)  =  0-2)  but  at  a  high  impurity  concentration.  Here 
the  solid  lines  are  the  best  fits  to  the  data  using  ( 1 3)  with  /  and  y  as  fitting  parameters  i 
liquid  state  data  in  the  inset  and  using  (9)  and  (12)  (by  varying  /,  T,,  y  and  A)  for  ttu 
data.  The  relevant  fitting  parameters  are  listed  in  table  2.  The  quality  of  fit  is  poor  at 
time  and  hence  the  extracted  normalized  g(1\t)  is  again  slightly  higher  than  unity  f 
glass  data.  Hence  our  experiments  clearly  show  that  the  binary  mixture  with  a  total  <j6 
freezes  into  a  crystalline  state  whereas  for  $  =  0-2,  the  interacting  state  of  the  mixtur* 
not  show  a  crystalline  phase  even  after  a  few  months  and  is  best  described  as  a  glass 
observation  is  similar  to  the  predictions  of  our  simulation  work  [13].  As  mentioned  e 
the  difference  between  our  results  and  those  of  Meller  and  Stavans  [11]  lies  in  th< 
the  interaction  is  allowed  to  build  up  in  the  samples. 
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Figure  3.  Same  as  in  figure  2,  but  for  the  glass  state  with  total  0  =  0-2  at  the  h 
of  4mm  from  the  top  of  the  resins-bed  and  171  h  after  the  addition  of  resins 
inset  shows  the  data  and  the  fits  for  the  corresponding  noninteracting  limit. 
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It  is  interesting  to  compare  the  values  of  the  parameter  y  in  table  2  obtained  in  the 
interacting  colloids  with  the  corresponding  values  in  the  noninteracting  limit  (figure  1 
and  the  inset  of  figure  3).  We  note  that  y  (for  a  given  scattering  geometry)  decreases  as 
the  interaction  between  the  particles  increases  towards  forming  a  crystal  or  a  glass.  To 
ascertain  that  this  is  a  property  of  the  freezing  suspensions,  we  show  the  ensemble- 
averaged  g(1](t)  for  a  monodisperse  suspension  (diameter  crp  =  0-115/im  and  volume 
fraction  0  =  0-03)  in  figure  4.  The  panel  (a)  shows  the  data  for  the  parallel  polarization 
channel  (i.e.  VV)  of  the  system  while  freezing  and  the  panel  (b)  shows  both  the  parallel 
(VV)  and  the  perpendicular  (VH)  channels  in  the  state  when  the  suspension  has 
completely  frozen  into  a  crystalline  order.  In  the  state  of  freezing,  as  shown  in 
figure  4(a),  the  sample  shows  the  polarization  dependence  [23]  of  the  correlation 
functions  i.e.,  CVH  shows  a  liquid-like  curve  and  hence  the  corresponding  g(l](t)  is  not 
plotted.  The  value  of  the  parameter  y  indeed  decreases  with  the  interaction  as  can  be 
seen  clearly  by  comparing  figure  4  with  the  fluid-phase  data  shown  by  open  circles  and 
diamonds  in  figure  5  (see  also  table  2).  In  figure  5,  we  have  also  compared  these  with  the 
initial  part  of  the  VV  data  in  the  crystallizing  state  (shown  by  + ).  We  see  clearly  that  as 
the  interparticle  interaction  becomes  important,  the  decay  of  g(l](t)  is  different  from 
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Figure  4.  The  semilog  plots  of  the  ensemble-averaged  g(  l ' (t)  versus  t l /2  (in  sec * /2 ) 
of  0-1 15  fj.m  polyball  suspension  with  </>  =  003  (a)  while  freezing  and  (b)  in  the 
crystalline  state  alongwith  the  fits  (solid  lines)  to  a  harmonic  model  as  explained  in 
the  text,  with  parameters  in  table  2. 
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Figures.    The  fluid  phase  VV  and  VH  time-averaged  g(1}(t)  versus  t1/2  (in  sec1/2)  for 
0  =  0-03  is  compared  with  the  initial  times  data  in  the  crystallizing  state. 


that  without  interaction,  even  at  very  early  times.  This  is  demonstrated  by  fitting  (6)  to 
the  first  10  data  points  of  the  crystallizing  suspension,  shown  by  the  dashed  line  and  the 
corresponding  parameters  are  given  in  table  2. 

As  we  have  mentioned  before,  the  parameter  y  is  inversely  proportional  to  the 
transport  mean  free  path  /*,  i.e.,  y  oc  I//*  [22].  Therefore,  the  result  that  the  magnitude 
of  y  reduces  with  the  growth  of  interaction  in  the  system  can  be  justified  if  we  can  show 
that  /*  increases  with  the  interaction.  We  do  this  only  for  the  monodisperse  suspensions 
(see  Appendix  for  details).  We  calculate  the  I*  [eq.  (17)  of  Appendix]  by  using  the  form 
factor  F(q)  from  the  Mie  scattering  theory  [eq.  (18)]  and  incorporate  the  interparticle 
interaction  through  the  structure  factor  S(q)  which  can  be  calculated  using  either 
rescaled  mean  spherical  approximation  or  Percus-Yevick  approximation.  For  the 
present  purpose,  it  suffices  to  note  that  the  interacting  charged  colloids  can  be 
approximated  to  effective  hard  spheres  by  rescaling  the  volume  fraction  and  therefore 
the  Percus-Yevick  approximation  for  the  hard  sphere  system  for  which  analytical 
results  are  known  has  been  used  as  an  input  S(q)  in  (17).  The  PY  approximation,  though 
not  as  good  as  mean  spherical  approximation  for  the  S(Q)  of  colloids,  has  been  used 
because  of  the  simple  closed  form  expression  [see  (21)  of  the  Appendix].  Our  purpose  is 
only  to  show  qualitatively  that  /*  increases  with  the  strength  of  interactions.  In  figure  6 
we  have  shown  the  Percus-Yevick  S(q)  for  nine  such  different  rescaled  volume  fractions 
0*.  The  height  of  the  first  peak  of  the  structure  factor,  5max,  can  be  taken  as  a  scale  to 
measure  the  interparticle  interaction.  Figure  7  represents  the  calculated  values  of  /*  for 
these  S(q)  as  a  function  of  the  interparticle  interaction  measured  by  Smax.  We  clearly  see 
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Figure  6.    The  static  structure  factor  S(q)  in  the  Percus-Yevick  approximation  as 
a  function  for  ten  different  values  of  the  effective  hard  sphere  volume-fraction  <£*. 
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Figure  7.    The  calculated  values  of  /*  (in  fj.m)  from  the  Mie  theory  (refer  text)  for 
different  S(q)'s  of  figure  6  as  a  function  of  Smax,  the  first  peak  height  of  the  S(q). 
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that  /*  increases  almost  linearly  with  the  interaction.  This  qualitatively  explains  our 
results  of  the  reduction  of  y  with  the  increase  of  the  interaction  strength. 

5.  Conclusions 

In  this  paper,  we  have  reported  our  observation  of  the  formation  of  a  crystalline  phase 
when  the  interparticle  interactions  are  increased  in  an  equimolar  binary  mixture  of 
polyball  suspensions  with  a  volume-fraction  <jf>  =  0-  1  .  In  comparison,  the  interactions  in 
the  same  binary  system  with  $  -  02  results  in  a  metastable  glassy  state.  This  is  in 
accord  with  our  results  in  the  BD  simulations  [13]  of  binary  colloidal  mixtures.  We 
have  also  shown  that  the  g(l}(t)  of  the  binary  mixture  in  the  non-interacting  fluid  phases 
can  be  well-reproduced  by  replacing  the  TO  in  the  functional  form  of  the  correlation 
functions  of  the  monodisperse  suspensions  by  a  properly  weighted  average  reff  [eq. 
(13)],  where  /*  are  calculated  using  Mie  scattering  theory.  A  model  of  the  harmonically 
bound  Brownian  particle  is  used  to  calculate  the  root  mean  square  displacements  and 
the  functional  form  of  g(l)(t)  suggested  by  MacKintosh  and  John  [27]  [eq.  (9)]  is  shown 
to  fit  with  a  moderate  success  the  results  from  the  binary  as  well  as  monodisperse 
polyball  suspensions  while  freezing  into  the  crystalline  or  glassy  states.  These  fits,  though 
qualitative  in  nature  due  to  the  approximate  form  of  (9),  indicate  that  the  parameter 
y  decreases  as  the  particles  in  the  suspensions  start  interacting  with  each  other.  This 
result  is  justified  in  the  case  of  monodisperse  suspensions  by  showing  that  the  transport 
mean  free;  path  /*  which  is  inversely  proportional  to  y  increases  with  the  interaction 
between  the  particles.  In  conclusion,  DWS  has  provided  very  interesting  information 
on  the  dynamics  of  dense  interacting  colloids  which  can  either  form  a  crystalline  state 
(at  a  low  volume  fraction)  or  a  glassy  state  (at  a  high  volume  fraction).  It  could  be 
worthwhile  to  carry  out  similar  DWS  experiments  on  binary  colloids  with  different 
diameter  ratios  and  relative  volume  fractions  to  map  out  the  entire  phase  diagram. 
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Appendix 

Within  the  Mie  scattering  theory  [26],  a  general  expression  of  /*  for  monodisperse 
particles,  is  given  by  [21,  30,  31] 


Here  np  is  the  number  density  of  particles  and  Q  =  (k0  —  kf)ap  is  the  dimensionless 
momentum  transfer  for  a  single-scattering  event.  For  monodisperse  particles,  we  have 
replaced  I(Q)  simply  by  the  product  of  the  form  factor  F(Q)  and  the  structure  factor 
S(Q}.  The  angular  brackets  indicate  an  average  over  all  the  scattering  paths.  The  form 
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factor  F(Q)  for  monodisperse  systems  is  given  by  the  Mie  scattering  theory  as  [26] 

F(Qap)  =  Qap\_AMA*(e)  +  A2(6)A*(e)l  (18) 

where  the  dimensionless,  single-scattering  amplitudes  A  i  (9)  and  A2  (0)  are  related  to  the 
scattered  far-fields  E^  and  Eg  as 

ieikr 
Ei=-  —  Al(G)*m<l>t 

ikr 
A2(6)cos(l>, 


00     2n+  1 


00     2n  4-  1 

(19) 


Ft=l 

with 


and    tn(cos0)  =      [Pn(cos0)].  (20) 


The  Mie  scattering  amplitudes  are  given  by 


,    = 

"      m 

where  a  =  /c0ap,  j9  =  /c0map,  «An(x)  =  (7TX/2)1/2  Jn+  1/2(x)  and  {„(*)  = 

[Jn(x)  and  Hn(x)  being  the  nth  order  Bessel  function  and  Hermite  polynomial, 

respectively]  and  m2  =  er  =  s/e0  is  the  relative  dielectric  constant  of  the  sphere. 

The  static  structure  factor  S(Q)  for  the  dense  hard-spheres  in  the  Percus-Yevick  (PY) 
approximation  is  given  in  terms  of  the  direct  correlation  function  c(Q)  as  [29,  35] 


r  \          /  r  x3 
=  <x0  +  ai(  —    +  a2    - 

°->      up/  (21) 
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Abstract.  Dielectric  properties  of  nitriles  have  been  studied  using  time  domain  reflectometry  in 
the  frequency  range  of  10  MHz  to  10  GHz  over  the  temperature  range  of  0°C  to  45°C.  The 
calculated  dielectric  relaxation  parameters  are  compared  with  values  for  the  corresponding 
n-alcohols.  The  dielectric  behaviour  in  n-nitrile  systems  is  quite  different  from  the  corresponding 
behaviour  in  n-alcohols.  The  dipoles  in  n-nitrile  systems  show  a  tendency  to  remain  antiparallel, 
unlike  in  alcohol  systems.  The  activation  energy  decreases  with  the  increase  of  molecular  size, 
whereas  in  hydrogen  bonded  liquid  it  increases. 

Keywords.    N-nitriles;  dipoles;  relaxation  times-microwaves;  dielectrics. 
PACSNo.    77-22 

1.  Introduction 

Structural  information  about  n-alcohols  in  the  liquid  state  has  been  extensively  studied 
through  dielectric  relaxation  measurements  [1-3].  Three  features  with  respect  to  the 
number  of  carbon  atoms  JVC,  in  n-alcohols  have  been  observed. 

(1)  An  increase  in  the  Kirkwood  correlation  factor  with  Nc:  this  indicates  that  the 
n-alcohols  with  larger  Nc  become  more  ordered  in  the  liquid  phase. 

(2)  Increase  of  the  relaxation  time  with  Nc:  this  indicates  a  slower  rotation  for  the 
heavier  molecules,  and 

(3)  Increase  of  the  activation  energy  with  Nc:  this  indicates  that  the  hydrogen  bonded 
energy  strengthens  with  increase  in  JVC. 

The  above  features  have  been  associated  with  hydrogen  bonding.  This  study  on  the 
nitriles  was  undertaken  to  determine  the  role  of  dipole  moment  in  the  above  mentioned 
features.  Nitriles  were  studied  because  (1)  many  nitriles  exist  in  the  liquid  state  at  room 
temperature,  (2)  these  are  not  hydrogen  bonded  and  (3)  they  have  large  dipole 
moments.  The  dielectric  constant  results  are  available  only  for  acetonitrile  [4]  and 
butanenitrile  [5].  The  dielectric  properties  on  seven  n-nitriles  with  Nc  =  1  to  5, 7, 9  and 
10  are  determined  in  this  work. 
In  §2,  the  experiment  is  described.  The  results  and  conclusions  are  given  in  §  3. 

2.  Experimental 


The  dielectric  spectra  were  obtained  by  the  time  domain  reflectometry  (TDR) 
technique  [6,  7].  In  the  TDR  technique  a  fast  rising  step  pulse  of  about  25  ps  generated 
by  S-52  pulse  generator  and  was  transmitted  through  3-5  mm  coaxial  line  with 
characteristic  impedance  50  ohms.  The  sample  cell  has  been  placed  at  the  end  of  the 
transmission  line  after  the  sampling  head.  The  SMA  (Standard  Military  Application) 
type  cell  was  used  for  the  work.  The  cell  had  outer  and  inner  dimensions  of  3-5  mm  and 
1-52  mm,  respectively.  The  effective  pin  length  of  inner  conductor  was  1-35  mm.  The 
Tektronix  7854  sampling  oscilloscope  was  used  to  sample  the  reflected  pulse.  The  time 
window  used  for  the  experiment  was  kept  5  ns.  The  reflected  pulse  was  digitized  with 
1024  points  in  the  selected  time  window  and  transferred  to  personal  computer  (PC/XT) 
through  IEEE-488  card.  The  complex  reflection  function  p*  of  the  sample  liquids  were 
determined  through  Fourier  transforming  the  measured  reflected  pulse  [7-10].  The 
spectra  show  the  single  Debye  relaxation  type  as  expected  [1  1].  The  complex  dielectric 
spectra  obtained  are  fitted  to  the  Debye  equation 

£   ~  £ 


with  e0  and  T  as  fitting  parameters.  In  the  fitting  procedure,  £w  is  not  fitted  and  assumed 
to  be  3-5.  This  is  done  because  the  data  are  not  sensitive  to  em  in  the  frequency  range 
under  consideration.  The  details  are  given  in  an  earlier  paper  [12]. 

3.  Results  and  discussion 

The  values  of  static  dielectric  constant  (e0)  and  relaxation  time  (T)  at  different 
temperatures  are  given  in  tables  1  and  2  respectively.  The  static  dielectric  constants  of 
acetonitrile  [4]  (£0  =  35-9)  and  butanenitrile  [5]  (e0  =  20-2)  as  reported  earlier  are  in 
good  agreement  with  the  values  reported  here. 

In  general,  e0  decreases  with  increase  in  the  number  of  carbon  atoms.  This  decrease 
can  be  ascribed  to  decrease  of  dipoles  per  unit  volume  with  an  increase  in  the  size  of  the 


Table  1.    Static  dielectric  constant  e0  for  n-nitrile  at  different  temperature. 


Nitrile 

t  =  0°C 

t  =  10°C 

t  =  25°C 

t  =  35°C 

t  =  45°C 

Acetonitrile 

40-0  (1) 

38-2  (1) 

36-9  (1) 

36-1  (1) 

33-7  (3) 

Butanenitrile 

26-2  (1) 

24-3  (4) 

22-0  (2) 

21-1  (1) 

20-3  (1) 

Pentanenitrile 

23-0  (1) 

21-5  (1) 

21-0  (2) 

19-1  (1) 

11-3  (1) 

Hexanenitrile 

18-1  (1) 

18-0  (1) 

16-3  (1) 

16-0  (1) 

14-2  (1) 

Octanenitrile 

14-7  (1) 

13-2  (1) 

13-0  (1) 

12-2  (1) 

10-8  (2) 

Decanenitrile 

11-4  (1) 

11-0  (1) 

10-6  (1) 

10-4  (1) 

10-2  (1) 

Undecanenitrile 

* 

11-0  (1) 

10-3  (1) 

9-1  (1) 

9-1  (5) 

*not  studied. 

Number  in  bracket  denotes  uncertainties  in  the  last  significant  digits  obtained  by 
the  least  squares  fit  method.  For  ex.  40-1  (1)  means  40-1  +  0-1.  The  realistic  errors  in 
the  values  is  about  2%  considering  other  experimental  aspects. 

20  Pramana  -  J.  Phys.,  Vol.  45,  No.  1,  July  1995 


Table  2.     Relaxation  time  T  for  n-nitrile  at  different  temperature. 

t(ps) 
Nitrile  t  =  Q°C       r=10°C      t  =  25°C      t  =  35°C      t  =  45°C 


Acetonitrile 

6-4  (7) 

3-6  (1) 

3-4  (1) 

2-7  (1) 

1-3  (8) 

Butanenitrile 

10-2  (3) 

9-9  (5) 

8-9  (7) 

8-0  (4) 

6-7  (5) 

Pentanenitrile 

16-8  (4) 

14-4  (2) 

13-0  (2) 

11-3  (1) 

10-9  (1) 

Hexanenitrile 

28-2  (3) 

22-0  (3) 

17-4  (4) 

16-3  (2) 

14-2  (3) 

Octanenitrile 

45-0  (4) 

36-6  (3) 

31-4  (3) 

25-6  (2) 

22-0  (3) 

Decanenitrile 

84-8  (13) 

73-7  (3) 

67-5  (8) 

53-6  (2) 

49-2  (4) 

Undecanenitrile 

* 

76-8  (5) 

64-6  (1) 

62-6  (1) 

58-7  (5) 

*  not  studied 


Table  3.     Kirkwood  correlation  factor  g  at  different  temperatures. 


Nitrile  r  =  0°C      t=\Q°C      t  =  25°C      t  =  35°C      t  =  45°C 


Acetonitrile 

0-87  (5) 

0-86  (5) 

0-88  (5) 

0-89  (5) 

0-85  (5) 

Butanenitrile 

0-80  (4) 

0-76  (4) 

0-73  (4) 

0-72  (4) 

0-71(4) 

Pentanenitrile 

0-74  (4) 

0-71  (4) 

0-73  (4) 

0-68  (4) 

0-66  (4) 

Hexanenitrile 

0-72  (4) 

0-74  (4) 

0-70  (4) 

0-71  (4) 

0-64  (4) 

Octanenitrile 

0-72  (4) 

0-66  (4) 

0-68  (4) 

0-66  (4) 

0-59  (3) 

Decanenitrile 

0-65  (4) 

0-65  (4) 

0-65  (4) 

0-66  (4) 

0-66  (4) 

Undecanenitrile 

* 

0-69  (4) 

0-67  (4) 

0-62  (3) 

0-61  (3) 

Number  in  brackets  denote  errors  in  least  significant  digits  by  assuming  2%  error  in 
the  values  of  e0  and  ero . 


molecules.  Structural  information  can  be  obtained  from  the  Kirkwood  correlation 
factor  [13],  g,  as 


=    oo  (  . 

9  9kTV      e(£ 


where  n  is  the  dipole  moment,  N/V  is  number  density  and  e^  is  the  square  of  the 
refractive  index.  The  values  of  p  and  £OT  have  been  taken  at  25°C,  from  the  Aldrich 
Catalog  handbook  of  Chemical  [14].  The  values  of  g  obtained  by  the  Kirkwood 
Frohlich  [13]  equation  is  given  in  table  3.  The  values  of  g  are  less  than  one,  indicating 
that  the  dipoles  are  oriented  in  antiparallel  direction.  As  the  size  of  the  molecule 
increases,  the  values  of  g  further  decreases.  The  tendency  to  remain  antiparallel 
increases  with  increase  of  chain  size  in  the  molecule.  This  tendency  is  due  to  dipole- 
dipole  interactions  between  two  molecules. 
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Figure  1.    Plot  of  relative  change  in  e0,  Ae0  with  number  of  carbon  atoms  in 
n-nitriles  and  n-alcohols  at  25  °C.  (O),  OH;  (  +  ),  CN 


1  5  * 

NC 

Figure  2.    Plot  of  Kirkwood  correlation  factor  vs  number  of  carbon  atoms  for 
n-nitriles  and  n-alcohols  at  25°C.  (O),  (OH);  (  +  ),  CN 


It  will  be  interesting  to  compare  the  results  with  the  corresponding  results  of 
hydrogen  bonded  liquids,  like  alcohols.  The  figure  1  shows  the  plot  of  change  in  As" 
with  the  number  of  carbon  atoms  in  n-nitriles  and  n-alcohols  [3,4]. 

The  value  of  As"  is  defined  as 

Ae"  =    °  Nc'a  ,„     — —  (4) 


where  (e0)aN  is  the  value  of  the  static  permittivity  for  the  system  with  N  carbon  atoms. 
The  g  values  are  given  in  figure  2.  It  shows  that  the  g  values  decrease  with  Nc  in  nitrile 
systems.  It  remains  almost  constant  in  alcohol  systems. 
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The  change  in  relaxation  times,  AT,  for  alcohol  and  nitrile  systems  are  shown  in 
figure  3. 
The  value  of  AT  is  defined  as 

AT  =  (T)'NC=O-(T)*NC=O  (5) 

where  (T)^  is  the  relaxation  time  for  the  system  a  with  N  carbon  atoms. 

In  figure  3,  the  change  in  relaxation  in  alcohols  is  larger  than  corresponding  values  in 
nitriles.  This  may  be  because  of  the  -OH  group  in  alcohol  which  is  a  hydrogen  bonded 
liquid.  As  the  chain  is  long  and  as  there  is  the  strong  hydrogen  bonding  with 
neighbouring  molecules,  relaxation  time  tends  to  increase  with  number  of  carbon 


Figure  3.    Plot  of  relative  change  in  T,  AT  with  number  of  carbon  atoms  in  n-nitriles 
arid  n-alcohols  at  25°C.  (O),  (OH);  (+ ),  CN 
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Figure  4.    Plot  of  activation  energies  (AH)  KJ/mole  vs  number  of  carbon.  (O), 
(OH);(+),CN 
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atoms.  In  case  of  nitriles  though  chain  is  long,  there  is  no  such  strong  bonding  with 
neighbouring  molecules. 

The  activation  energies  were  calculated  from  Eyring  equation  [15].  The  activation 
energies  are  associated  with  dipole-dipole  interaction.  The  results  are  shown  in  figure  4 
along  with  the  corresponding  values  for  n-alcohol  systems  [2,3].  It  can  be  seen  that 
nitrile  system  behave  in  a  very  different  manner.  This  may  be  because  of  the  fact  that  the 
activation  energies  in  nitrile  systems  are  due  to  the  dipole-dipole  interactions,  whereas, 
in  alochol  systems  it  may  be  due  to  the  hydrogen  bonding.  It  is  interesting  to  note  that 
the  trend  in  the  activation  energy  is  similar  to  the  trend  as  observed  in  the  (g)  factor, 
related  to  the  cooperative  effect.  The  higher  correlation  factor  leads  to  the  higher  value 
of  the  activation  energy. 
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Abstract.  The  phonon  spectra  of  unstrained  and  strained  quasiperiodic  semiconductor  super- 
lattices  (QSSL)  have  been  calculated  using  one-dimensional  linear  chain  model.  We  consider  two 
types  of  quasiperiodic  systems,  namely  cantor  triadic  bar  (CTB)  and  Fibonacci  sequences  (FS), 
constituting  of  AlAs,  GaAs  and  GaSb  of  which  the  latter  two  have  a  lattice  mismatch  of  about 
7%.  The  calculations  have  been  made  using  transfer  matrix  method  and  also  with  and  without 
the  inclusion  of  strain.  We  present  the  results  on  phonon  spectra  of  two  component  CTB  and  two 
as  well  as  three  component  FS  semiconductor  superlattices  (SSL),  thickness  and  order  depend- 
ence on  LO  mode  of  GaAs,  effect  of  strain  on  LO  frequency  of  GaAs.  The  calculated  results  show 
that  the  strain  generated  due  to  lattice  mismatch  reduces  significantly  the  magnitudes  of  the 
confined  optical  phonon  frequency  of  GaAs. 

Keywords.    Quasiperiodic;  phonon;  strained  layer;  transfer  matrix. 
PACS  Nos    63-20;  63-90;  68-35 

1.  Introduction 

In  the  recent  years  with  the  development  of  various  crystal  growth  techniques,  a  great 
deal  of  interest  has  been  generated  towards  the  experimental  as  well  as  theoretical 
studies  of  the  electronic  and  vibrational  properties  of  semiconductor  superlattices 
(SSL)  [1-11].  This  is  mainly  because  of  the  potentiality  of  the  applications  of  the  SSL  in 
various  high  speed  semiconductor  devices. 

Although  a  lot  has  been  studied  on  the  binary  SSL  having  perfect  lattice  matching 
between  its  components,  only  a  few  investigations  have  been  reported  on  the  periodic 
strained  layer  semiconductor  superlattices  (SLSL)  [12-14].  The  fabrication  and  study 
of  the  lattice  vibrational  properties  of  superlattices  consisting  of  lattice  mismatched 
materials  are  gaining  interest  because  of  their  applications  in  various  opto-electronic 
devices.  The  strain  due  to  lattice  mismatch  of  the  constituting  components  of  the  SLSL 
is  accommodated  by  each  layer  in  such  a  way  that  the  system  is  structurally  and 
thermodynamically  stable.  The  strain  generated  in  such  systems  is  isotropic  in  nature 
and  in  most  of  the  cases,  it  changes  the  volume  of  each  binary  layer  without  affecting  the 
crystal  symmetry  [15].  Due  to  this  fact,  it  has  now  been  possible  to  fabricate 
semiconductor  heterostructures  by  avoiding  misfit  dislocations  [14]. 

On  the  other  hand,  the  discovery  of  icosahedral  crystallographic  symmetry  has 
proved  to  be  a  new  dimension  to  the  study  of  the  electronic  and  vibrational  properties 
of  quasi-periodic  systems  [16, 17].  Among  the  quasi-periodic  superlattices,  the 
Fibonacci  superlattices  (FSL)  have  been  studied  extensively  both  theoretically  [18]  as 
well  as  experimentally  [19].  However,  in  the  case  of  FSL,  the  main  emphasis  has  been 
on  the  study  of  acoustic  phonon  propagation  and  their  transmission  [20-22].  Other 
types  of  quasi-periodic  sequences  are  Thue-Morse,  Cantor  bar  etc.  Recently,  some 


theoretical  efforts  have  been  devoted  to  investigate  the  vibrational  properti 
semiconductor  quasi-periodic  systems  [20]. 

In  a  recent  paper,  we  have  reported  the  calculated  results  of  vibrational  prc 
(GaAs)n  (GaSb)n2  SSL  [2].  In  this  investigation  the  effects  of  strain  on  the  vi 
modes  of  a  kind  of  quasiperiodic  semiconductor  superlattices,  the  cantor  bar 
ductor  superlattices  (CBSSL)  of  one  and  three  generations  comprising  of  ( 
GaAs  as  its  components.  In  addition  to  this,  we  have  also  calculated  th< 
spectra  for  two  component  GaAs,  GaSb  and  three  component  GaAs,  InAs  i 
FSL  of  four  generations.  Also,  we  have  studied  the  thickness  dependence  o 
phonon  modes  of  GaAs.  The  main  objective  in  investigating  this  problem  hi 
understand  the  effect  of  strain  on  the  phonon  properties  of  quasiperiodic  sup 
The  lattice  mismatch  in  GaAs  and  GaSb  is  quite  large  («  7%)  and  these  mal 
ideal  for  observing  the  effects  of  strain  on  the  vibrational  properties  of  su 
periodic  SSL.  The  phonon  dispersion  curves  (PDC)  in  CBSSL  have  been  c 
using  a  linear  chain  model  [2]  and  the  transfer  matrix  method.  This  method  hi 
been  applied  to  study  the  PDC  in  multicomponent  SSL  [1]  and  strai] 
superlattice  [2].  We  present  the  structure  and  generation  of  the  sequence  of  < 
§2  followed  by  a  brief  outline  of  the  method  of  calculation  of  PDC.  The  res 
been  discussed  in  §  3. 

2.  Model  structure  and  dispersion  relation 

The  one-dimensional  quasiperiodic  semiconductor  superlattices  under  cons 
are  generated  recursively  along  Z-direction  by  two  elementary  media:  med 
media  B.  The  model  structure  prescribed  in  figure  1,  maps  the  mathematic 
the  cantor  triadic  bar  sequence  infinitely  as  follows  [23],  C^  = 
C2  =  B2A2B2). . .Cn  =  BAA^Q-iA,,^  where  Bn  =  CM_l5  while  AB  is 
medium  as  layer  A:  but  with  different  thickness,  dAn  =  (2  +  a)"  MAj  v 
parameter  a  is  the  thickness  ratio  of  dAn  to  dBM.  The  total  thickness  c 
generation  one-dimensional  quasiperiodic  semiconductor  superlattice  (( 
dCn  =  (2  +  a)"dB1  where  dBx  is  the  thickness  of  the  layer  B  in  the  first  gener 
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Figure  1.    Geometry  of  a  cantor  bar  triadic  semiconductor  superlattice 
three  generations. 
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The  fractal  dimension  of  present  model  [23]  is  given  by  D  =  In  2/ln  (2  +  a).  It  is  easy 
to  find  that  when  <x=  1,  the  QSSL  becomes  usual  CBSSL  with  D  =  In  2/ln  3;  when 
a  =  A  =  (y/5  -  l)/2,  the  structure  of  QSSL  is  somewhat  like  the  Fibonacci  superlattice. 
If  we  substitute,  An  by  Cn_2  and  Bn  by  C,,_2,  it  is  just  the  usual  Fibonacci  superlattice. 
Hence,  the  physical  meaning  of  the  quantity  a  can  be  found  from  the  fact  that  it  governs 
the  formation  of  various  sequences. 

A  Fibonacci  superlattice  comprises  of  an  arrangement  of  individual  building  blocks 
of  type  A  (each  of  thickness  dA)  and  type  B  (each  of  thickness  dQ)  following  a  concatena- 
tion scheme  determined  by  the  Fibonacci  sequence  Gj  =  A,  G2  =  AB,  G3  =  ABA, . . . , 
Gr  =  Gr  _  j  Gr  _  2  [  1 7] .  The  rth  generation  FSL  Gr  consists  of  Fr  type  A  blocks  and  F,  _  1 
type  B  blocks,  where  Fr  are  the  Fibonacci  numbers  given  iteratively  Fr  =  Fr_  1  +  Fr_  2 
for  r  >  2  with  F0  =  0  and  ¥^  =  1.  In  order  that  phonons  may  recognize  the  distinct 
interfaces  spaced  in  Fibonacci  manner,  each  building  block  should  be  subdivided  into 
two  or  more  layers  with  different  elastic  properties.  In  this  paper,  we  have  considered 
that  both  type  A  and  B  blocks  consist  of  two  and  three  layers. 

In  order  to  calculate  the  phonon  dispersion  curves  for  these  QSSL,  we  have  adopted 
a  linear  chain  model  in  one-dimension,  details  of  which  have  been  given  in  our  previous 
papers  [1,2].  The  general  equation  of  motion  of  any  atom  in  either  of  the  components 
of  the  QSSL  can  be  written  as  [1] 

mjQ}2un=  -Cj(uB_ i  +  ua+l  -2un)  (1) 

where  Ct(i  =  A,  B)  are  the  nearest  neighbour  force  constants,  un  are  the  displacements, 
and  nij  are  respective  masses  of  the  atoms.  By  using  the  transfer  matrix  method  as  in  our 
earlier  papers  [1, 2],  we  get 

cosQD=l/2trT  (2) 

where  T  is  a  unimodular  (2  x  2)  transfer  matrix  which  has  the  property  of  relating  the 
coefficients  of  the  displacement  vectors  in  one  cell  (/  +  1)  to  those  in  the  previous  one  (/) 
and  is  given  by 

T==  YlKilHnKn  1Hn_iKn^1...H2K2  1H^  (3) 

where  Ht  and  K{  (i  =  l,n)  are  (2  x  2)  unimodular  matrices  and  defined  in  ref.  [1]. 

In  the  present  paper,  we  have  calculated  the  dispersion  curves  for  diatomic  one  and 
three  generation  CBSSL  and  two  and  three  component  four  generation  FSL.  The 
transfer  matrix  in  the  case  of  one  generation  CBSSL  reduces  to 

T  =  K~1H^K-iH2K-1H1.  (4) 

This  equation  is  similar  to  that  which  has  been  reported  earlier  by  us  in  the  case  of 
multicomponent  SSL  [2],  where  there  are  only  two  components,  the  middle  one  being 
different  than  the  two  adjacent  neighbours.  The  dispersion  relation  for  three  generation 
CBSSL  as  well  for  two  and  three  component  four  generation  FSL  have  been  calculated 
using  (2)  and  (3)  [24]. 

The  effect  of  strain  due  to  lattice  mismatch  at  the  interface  of  the  two  components  has 
been  considered  in  the  usual  manner,  as  discussed  in  our  previous  paper  [2].  We  have 
calculated  the  phonon  dispersion  relation  by  considering  the  effect  of  strain  also,  to 
reveal  its  role  in  the  dispersion  of  phonons  in  such  systems.  Also  we  have  investigated 
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the  effect  of  change  in  layer  thickness  on  the  highest  longitudinal  optical  mod 
by  varying  the  value  of  a. 

3.  Results  and  discussion 

We  have  calculated  the  phonon  dispersion  curves  (PDC)  of  unstrained  an< 
one,  three  generation  CBSSL  and  two,  three  component  four  generation 
constituents  of  CBSSL  and  two  component  FSL  are  GaAs  and  GaSb  and  th; 
component  FSL  are  GaAs,  InAs  and  GaSb.  The  various  parameters  i 
constants  have  been  calculated  in  the  usual  way  as  explained  in  our  earlier  paf 
The  calculated  bulk  dispersion  curves  (longitudinal  modes)  for  GaAs  ai 
compared  with  the  experimental  values  [25, 26]  have  been  reported  in  ref.  [ 
In  figures  2(a)  and  2(b),  we  have  plotted  the  PDC  of  one  and  three  generatic 
with  the  components  B  and  A  as  (GaAs)ni  and  (GaSb)n2  respectively  with  the 
Wl  =  n2  =  10.  In  this  calculation,  we  have  not  considered  the  effect  of  stre 
figures  reveal  that  they  adequately  explain  the  usual  features  of  the  su 
vibrations.  It  is  also  seen  from  these  figures  that  with  the  increase  of  the  th 
GaSb  from  one  to  three  generations,  the  number  of  confined  modes  of  GaSb 
This  fact  is  true  in  both  one  and  three  generation  CBSSL.  However,  when  th 
the  components  is  reversed  with  a  motivation  to  study  the  effect  of  increased 
of  one  of  the  constituents,  the  number  of  GaAs  layers  increases  as  the  n 
generations  increase  resulting  into  the  increase  in  the  number  of  confine 
modes.  It  is  observed  from  these  figures  that  the  self  similarity  nature  incre 
increase  in  the  generation  number.  In  addition  to  this,  we  have  also  calci 
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Figure  3.    Phonon  dispersion  curves  of  strained  CBSSL  with  components  A  and 
B  as  (GaSb)10  and  (GaAs)10  respectively  for  (a)  one  and  (b)  three  generations. 


phonon  dispersion  curves  in  another  quasiperiodic  system,  the  two  and  multicompo- 
nent  Fibonacci  SSL.  The  results  are  quite  similar  to  that  of  CBSSL. 

In  order  to  understand  the  effect  of  strain  generated  due  to  lattice  mismatch,  we  have 
plotted  in  figures  3  (a)  and  3(b),  the  PDC  of  one  and  three  generation  CBSSL  with  the 
components  GaSb  and  GaAs  for  the  above  mentioned  order  and  thickness.  It  is 
observed  from  these  figures,  that  in  addition  to  the  self  similarity  nature  and  other  usual 
features  observed  in  quasi-periodic  SSL,  there  is  a  downward  shift  of  the  highest  LO 
mode  frequency  of  GaAs  for  both  the  cases  and  generations.  This  is  attributed  to  the 
fact  that  the  lattice  mismatch  is  accommodated  by  tensile  strain  in  GaAs  and  as 
compressive  strain  in  GaSb.  Thus,  it  reduces  the  lattice  constant  of  GaSb  and  enhances 
that  of  GaAs.  However,  when  the  order  of  the  components  is  reversed,  the  downward 
shift  increases  more  because  of  the  increase  in  the  number  of  the  GaAs  layers.  Our 
arguments  can  be  justified  from  our  earlier  papers  [1, 2]  and  Raman  scattering 
measurements  made  on  InGaAs/GaAs  [13]  and  GaSb/AlSb  [27]  SLSL,  respectively. 
To  the  best  of  our  knowledge,  we  have  calculated  for  the  first  time  the  PDC  of 
a  multicomponent  four  generation  Fibonacci  SSL  without  and  with  the  effect  of  strain 
included.  The  FSL  comprises  of  building  blocks  A  as  (GaAs)10(GaSb)10(InAs)20  and 
that  of  B  as  (GaAs)  10 (GaSb)  10  (In As)4.  The  calculations  here  are  carried  out  with  the 
assumption  that  GaSb  and  In  As  are  perfectly  lattice  matched  and  there  occurs  a  lattice 
mismatch  between  In  As  and  GaAs.  The  PDC  have  been  plotted  in  figures  4  (a)  and  4(b). 
This  figure  reveals  that  there  occurs  a  considerably  large  amount  of  frequency  shift  of 
the  highest  GaAs  LO  mode  which  is  more  prominent  in  this  case  than  the  two 
component  FSL  with  all  usual  features  observed  in  quasiperiodic  systems. 

To  investigate  the  effects  of  strain  on  the  LO  phonon  modes  quantitatively,  we  plot 
in  figure  5,  the  highest  LO  mode  of  GaAs  fof  three  generation  CBSSL  for  cases 
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Figure  4.  Phonon  dispersion  curves  for  FSSL  of  four  generations  with  blocks 
A  =  (GaSb)10(InAs)20(GaAs)10,  B  =(GaSb)10(InAs)4(GaAs)10  for  unstrained  (a) 
and  strained  (b)  systems. 
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Figure  5.  Thickness  ratio  a  as  a  function  of  highest  GaAs  LO  frequency  of  strained 
CBSSL  for  three  generations  with  layers  A  and  B  as  (a)  (GaSb)10  and  (GaAs)M;  (b) 
(GaAs)10  and  (GaSb)n  respectively. 


(a)  B  ==  GaAs  and  A  =  GaSb,  (b)  B  =  GaSb  and  A  =  GaAs.  Considering  the  first  case  in 
which  the  thickness  of  the  GaSb  layer  is  kept  constant  and  that  of  Gr  As  is  varied 
resulting  into  the  variation  of  the  parameter  a  from  0-01  to  10.  It  is  observed  that  as  the 
value  of  the  parameter  a  increases  from  0-01  to  10,  the  thickness  of  GaSb  increases  as 
compared  to  that  of  GaAs  leading  to  the  downward  shift  of  the  GaAs  LO  mode.  At 
a  =  10,  the  system  behaves  as  bulk  GaSb  and  the  frequency  is  very  near  to  that  of  GaSb 
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LO  mode.  But  as  a  decreases  to  0-01,  the  strain  induced  effect  on  the  LO  modes  of  GaAs 
relaxes  thereby  saturating  at  a  frequency  quite  close  to  that  of  experimental  zone  centre 
value.  This  is  due  to  the  fact  that  around  a  =  0-01,  the  thickness  of  GaAs  layers  is  very 
large  as  compared  to  GaSb  and  hence  the  system  behaves  as  bulk  GaAs.  Similar 
arguments  hold  good  when  the  order  is  reversed  (case  (b))  thereby  showing  reverse 
nature. 

In  conclusion,  we  have  investigated  the  effect  of  built-in  strain  on  the  PDC  of 
quasiperiodic  CBSSL  and  FSL  thereby  observing  that  the  change  in  the  in-plane  lattice 
parameter  subsequently,  enhances  (or  suppresses)  the  highest  LO  mode  of  GaAs. 
However,  so  far  no  efforts  have  been  made  from  the  experimental  point  of  view  on 
the  phonon  properties  of  such  systems.  We  emphasize  the  need  of  Raman  measurement 
for  such  systems. 
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Abstract.  The  role  of  lower  symmetry  component  of  the  crystal  field  in  causing  a  mixing  of 
excited  3d*~  Ms  with  the  ground  3d*  configuration  and  leading  to  spin  density  at  the  nucleus  for 
iron  group  ions  was  suggested  by  Griffith  and  Orgel.  This  mechanism  has  been  examined  in 
detail  for  the  two  low-symmetry  copper  complexes,  one  square  planar  (D4h  symmetry)  and  the 
other  distorted  tetrahedron  (D2d  symmetry)  and  the  calculation  has  been  performed  using  the 
powerful  Racah  method  and  tensor  operator  technique.  It  is  found  that  for  the  two  types  of 
copper  complexes,  copper  pthalocyanin  (square  planar,  D4h  symmetry)  and  cesium  copper 
chloride  (distorted  tetrahedron,  D2d  symmetry)  the  contribution  from  this  mechanism  to  the  spin 
density  at  the  nucleus  vanishes  identically. 

Keywords.    Spin  density;  configuration  interaction. 
PACSNo.    31-20 


1.  Introduction 

The  problem  dealt  with  in  the  present  work  is  the  result  of  a  program  which  we  initiated 
for  studying  the  effect  of  configuration  interaction  on  the  various  physical  properties  of 
iron  group  such  as  crystal  field  splittings  [1-5],  intensity  of  crystal  field  spectra  [6, 7], 
reduction  of  orbital  moment  and  s-o  coupling  [8]  and  finally  on  the  spin  density  at  the 
nuclear  site  of  the  ion.  The  configuration  interaction  in  which  we  are  interested  in  the 
said  program  concerns  the  mixing  of  the  ground  configuration  3d*  with  the  excited 
configurations  3d*~ *  41(41  =  4s,  4p,  4d  and  4f)  in  which  a  3d  electron  is  promoted  to  the 
4s,4p,4d  or  4f  orbit. 

For  the  explanation  of  spin  density  at  the  nulcear  site  of  iron  group  ions  Abragam  et  al 
[9]  proposed  a  mechanism  where  the  ground  configuration  Is22s22p63s23p63d*  gets 
admixed  via  interelectronic  repulsion  with  the  excited  configuration  ls22s22p63s3p63d*Rs 
where  Rs  is  a  generalized  higher  s-orbital  orthogonal  to  all  the  s-orbitals.  The  method 
of  evaluating  Rs  using  variational  Hartree-Fock  calculation  has  been  illustrated  by 
Abragam  et  al  [9].  However,  this  mechanism  yielded  a  result  for  the  spin  density  equal 
to  —  0-3  a.u.  instead  of  the  experimental  value  —  3  a.u.  approximately  in  the  case  of 
Mn2  +  .  Griffith  and  Orgel  [10,11]  suggested  a  mechanism  which  is  dependent  on 
crystal  field  having  a  symmetry  lower  than  the  cubic  and  causing  an  admixture  of  the 
excited  configuration  Is2  2s2  2p6  3s2  3p6  3dx~ 1 4s  with  the  ground  configuration.  To  our 
knowledge  the  size  of  the  contribution  from  this  mechanism  to  the  spin  density, 
although,  suggested  long  ago,  was  not  quantitatively  calculated  and  reported  in  the 
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which  is  now  considered  to  be  the  most  plausible  and  successful  median 
excellent  agreement  with  the  experiment.  This  mechanism  is  based  on  th 
magnetic  polarization  of  an  atomic  or  ionic  core  of  closed  shells  by  ar 
external  shell  having  a  total  spin  S.  The  radial  functions  of  s-orbitals  of  a 
with  spin  up  and  down  are  considered  to  be  different  because  of  their 
coupling  with  the  magnetic  d-electrons  and  the  density  at  the  origin  will  be 
for  the  two  cases.  Some  details  of  the  calculation  can  be  found  in  the  revie1 
[12,13].  The  calculation  of  spin  density  involves  quantities  which  are  sei 
the 'precise  form  of  electron  wavefunctions.  Further  development  involving 
forms  of  Hartree-Fock  calculation  and  other  rigorous  techniques  : 
correlation  and  relativistic  correction  may  be  found  in  excellent  reviews 
Recent  works  [16-20]  with  the  highly  sophisticated  methods  such  as  lar 
configuration  H-F  technique,  multireference  configuration  interaction  t< 
many-body  perturbation  technique  etc.  are  confined  within  the  relativ 
atoms.  The  complexity  of  these  sophisticated  calculations  seems  to  indice 
nued  reliance  on  the  more  straightforward  spin-polarized  H-F  technique  fo 
atoms  or  ions. 

We  would  like  to  stress  that  the  present  paper  does  not  aim  at  explaining 
density  of  the  two  copper  complexes,  Cu-pthalocyanin  (CuPc)  and  Cs2  CuCl4 1 
the  size  of  the  contribution  from  the  crystal  field  dependent  mechanism  sugj 
GrifRth  and  Orgel  as  a  part  of  our  program  of  studying  the  effect  of  conf 
interaction  on  the  various  physical  properties  of  iron  group  ions  as  stated  ea 
choose  the  above  two  particular  ions  since  the  deviation  from  pure  cubic  syn 
large  in  these  cases,  a  condition  necessary  for  Griffith  and  Orgel  mechanism  to 
significance.  The  first  is  a  covalent  square  planar  complex  (D4h  symmetry)  [2] 
a  large  deviation  from  cubic  symmetry.  In  Cs2  CuCl4 ,  the  Cu2  +  ion  deviates  f: 
Td  as  observed  from  X-ray  [22, 23]  and  it  is  best  described  by  D2d  symmetry  ( 
epr  hyperfine  structures  from  which  the  experimental  values  of  spin  density 
extracted  are  also  available  for  these  two  copper  complexes  [21, 25].  To  our  su 
found  that  the  contribution  to  the  spin  density  from  Griffith  and  Orgel  m< 
vanishes  identically  in  these  two  particular  cases  even  though  the  symmetry 
appreciably  from  pure  cubic,  a  result  considered  to  be  worth  reporting.  The  n 
is  evidently  due  to  the  particular  symmetry  of  the  copper  ion.  This,  of  course, 
mean  that  in  the  case  of  other  symmetries  of  the  copper  complex,  the  contribul 
Griffith  and  Orgel  mechanism  will  also  vanish. 

The  central  density  %  of  the  unpaired  spin  is  calculated  from  the  followin: 
sion  [9] 

47C 

X  =  y<^!Z<5(rf);y«A>Si=s 

where  S  is  the  total  spin  of  the  ion  in  its  ground  configuration,  the  state  \l/  is  tl 
state  of  the  groundconfiguration  3d*  in  a  crystal  field  admixed  with  the  states  c 
configuration  3d*-1 4s  via  both  crystal  field  and  interelectronic  repulsion  co 
together  as  the  perturbation. 
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2.  Method  of  calculation 

The  lowest  state  of  the  ground  configuration  3d9  of  Cu2  +  ion  in  D2d  and  D4h  symmetry 
is  given  by  [24,  21] 


iAg(3d9)  =  (l/2)1/2[|3d92D  1/2  2>  +  |3d92D  1/2  - 

[|<A1>  +  jiA2>]    (say).  (2) 


The  upper  and  lower  signs  correspond  to  D2d(Cs2CuCl4)  and  D4h(CuPc)  systems 
respectively.  We  consider  the  mixing  of  this  lowest  state  as  given  by  (2)  with  the  various 
states  of  the  excited  configuration  3d8  4s  via  the  combined  action  of  interelectronic 
repulsion  and  crystal  field  interaction  i.e.  through  the  operator 


(3) 

i<j 

where  Hc  =  S,.  —  \e\  V(i),  and  V(i)  is  the  crystal  field  potential  of  the  ith  electron  and  the 
summation  extends  over  all  the  electrons  of  the  unfilled  shell.  For  Cs2CuCl4(D2d 
symmetry) 

V(i)  =  a, 


For  CuPc  (D4h  symmetry) 

no  =  «20  n2)co  +  « 

In  (4)  and  (5)  akq's  are  crystal  field  coefficients  and  V*®  =  rk  Y|,  7£  being  a  spherical 
harmonic  of  order  k  and  component  q.  In  the  present  calculation  we  need  only  to 
consider  the  term  a20  V(Q}  of  the  crystal  field  potential,  for  other  terms  of  the  potential 
the  matrix  elements  of  Hc  between  the  states  of  3d9  and  3d8  4s  vanish.  The  configura- 
tionally  mixed  lowest  state  becomes 


=  «Ag(3d9)  -  d/A£)Z  <>M3d84s)|    ZteVry)  +  1  -  \e\a 


20 


(6) 

where  the  general  form  of  the  states  of  the  excited  configuration  3d8  4s  is  written  as 
^e  =  |3d8(S1L1)  4s  S'I'M^ML)  and  AE  is  the  energy  separation  of  the  excited 
configuration  3d8  4s  from  the  ground  3d9  configuration.  For  the  matrix  elements  of 
2(e2/r£j.)  in  (6)  we  need  to  calculate  only 


since  the  matrix  of  Ze2/r,.  •  is  diagonal  in  L,  S.  Here  Ms  and  ML  values  are  omitted  in  the 
state  symbol  as  the  matrix  element  does  not  depend  on  them.  Racah  [26]  has  already 
given  a  very  convenient  formula  for  the  evaluation  of  such  matrix  element  which 
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reduces  to  the  following  (using  eq.  (79)  of  Racah  [26]) 
<3d92D|Xe2/r0-|3d8(1D)4s2D> 

=  (9/14)1/2[<d92D||U(2)||d92D><d92D{|d8(1D)d2D>51/2 
+  <d92D{|d8(1S)d2D><d81S||U(2)||d81D>51/2 
+  <d92D{|d8(1D)d2D><d81D||U(2)|jd81D>51/2 

+  <d92D{|d8(1G)d2D><d81G||U(2)||d81D>51/2]R^(3d3d,3d4s) 

(7) 

where  the  various  symbols  and  quantities  have  been  explained  in  Racah's  papers 
[26,  27].  Using  eq.  (74)  of  Racah  [27] 


<d92D||U(2)||d92D>=  -<d||u(2)||d>=-l  (8) 

and 

<d8(S1L1)||U(2)||d8(S1L'1)>=-<d2(SJL1)||U^!|d2(S1L'1)>  (9) 

The  r.h.s.  of  (9)  has  been  tabulated  by  Racah  [27],  eq.  (103b). 
Carrying  out  the  calculation  involved  in  (7)  we  finally  get 

<3d92D|£e2/r,.X(lD)4s2D>  =  -(2/35)1/2R(2)(3d3d,3d4s) 

where  R(2)(3d  3d,  3d  4s)  is  the  interelectronic  repulsion  integral  defined  by  Condon  and 
Shortley  [28].  The  matrix  element  of  LK(02)(i)  that  occurs  in  (6)  is  of  the  form 

<3d92Dl/2M1JX^/(o2)|d8(S1L1)4s2L1l/2MLi>  =  A    (say)  (10) 

which  may  be  written  as  (using  tensor  operator  formula  [29]) 

A  =  91/2<d92D{|d8(S1  LJd^X-  l)Ll-M'  [5(2Lt  +  I)]1/2 


Thus  the  admixed  state  \l/  in  (6)  reduces  to 


02) 
where 

D20  =  |c|o20<3d||V(2'||4s> 


l^j)  =  |3d8(1D)4s2D  1/2  2>,    |02>  =  |3d8(1D)4s2Dl/2  -2> 
|<£3>  =  |3d8(3F)4s2F  1/2  2>,     |04>  =  |3d8(3F)4s2Fl/2  -2> 
|</)5>  =  |3d8(1G)4s2Gl/22>,    |06>  =  |3dYG)4s2Gl/2-2> 
and  the  upper  sign  in  (12)  corresponds  to  Cs2CuCl4  and  the  lower  sign  to  CuPc.The 


Expanding  ij/  according  to  (12)  %  ultimately  comes  out  to  be 

X  =  Xc  +  XR  +  XcR  (13) 

where 


=  (47r/S)(l/A£)2[-(l/35)1/2t;20R(2)(3d3d,3d4s)] 


in  which  Q  =  Z(5(r,)sfz;  %c  represents  the  contributions  to  the  spin  density  at  the  nucleus 
arising  through  the  admixture  of  ground  3d9  and  excited  3d8  4s  configurations  via 
crystal  field  alone  as  was  suggested  by  Griffith  and  Orgel  [10],  XR  stands  for  the 
contribution  arising  through  the  admixture  of  the  two  configurations  via  interelec- 
tronic  repulsion  alone  and  the  contribution  ^CR  arises  from  the  cross  terms  like 


<«Ag(3d9)|Hc| 
The  type  of  matrix  element  <0j|Q|0,->  that  we  now  need  is 

<d8(S1L1)s2L1l/2MLi|j;W*uld8(S1L1)s2L1l/2MLii>«B    (say) 
where  MLi  =  2  or  -  2.  Using  (22)  and  (26)  of  Racah  [26] 

B  =  <d8(S1L1)St2L1l/2MLJ5(rt)Sfz|d8(S1L1)sI2L1l/2MLi> 

0 

/ 
l/2    St 

where  iA4s(0)  is  the  wave  function  of  4s  electron  at  r  =  0  (origin). 

3.  Results 

Calculation  yields 

Ic  =  (47r/S)[i;20/A£;]2(l/47r){(l/35)^4s(0)|2 


=  0 

XR  =  (AE)-2(l/35S)[R(2)(3d3d,3d4s)]2|iA4s(0)|2 
XcR=-(A£)-2S-1(l/35)1/2R(2)(3d3d,3d4s)i;20|»A4s(0)|2 

Thus  xc  is  found  to  vanish  identically  in  the  present  case  showing  that  the  crystal  field 
through  the  mechanism  suggested  by  Griffith  and  Orgel  has  no  contribution  to  the  spin 
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density  at  the  nuclear  site  of  the  copper  ion.  However  the  crystal  field  indirectly  gives 
a  non-zero  contribution  XCR  which  involves  the  crystal  field  parameter  through  u20.  An 
estimate  of  XCR  may  be  made  from  the  knowledge  of  a  correct  radial  wave  function  in 
the  crystal  which  is  difficult  to  obtain.  However,  a  rough  estimate  may  be  made  by 
considering  Richardson's  wave  functions  [30,  31]. 

The  energy  separation  AE  of  the  excited  configuration  3d8  4s  from  the  ground  3d9 
configuration  for  Cu2  +  ion  is  obtained  from  Moore's  table  [32] 

A£  =  0-74x  105  cm'1  =  3-371688  x  10~la.u. 
Using  Richardson's  radial  functions  we  have 


R<2)(3d  3d,  3d  4s)  =  0-006833  a.u. 

R*dr2R4/2dr=  1-3743  a.u.  (14) 


As  in  our  previous  work  [5]  the  value  of  v20  in  the  case  of  Cs2  CuCl4  can  be  calculated 
from  the  knowledge  of  metal-ligand  distance  [22,23],  ligand  charge  (=  -  \e\)  and  the 
radial  integral  given  in  (14).  The  value  is  v20  =  0-0133  a.u.  for  Cs2CuCl4. 

The  value  of  v2Q  is  difficult  to  estimate  in  the  case  of  Cu-pthalocyanin.  A  rough 
estimate  in  this  case  will  be  made  considering  v20  for  the  covalent  compound  CuPc 
larger  than  that  for  Cs2  CuCl4  by  a  maximum  of  one  order  in  magnitude.  Thus  using  the 
above  numerical  values  and  the  total  spin  S  =  1/2  for  Cu2  +  ion  we  arrive  at  the 
following  results 


10~3a.u. 
=_  0-165  x  KT2  a.u.    (for  Cs2CuCl4) 

=  -  1  -65  x  10  ~  2  a.u.      (for  CuPc  assuming  that  v20  has  a  value  10  times 
that  in  the  case  of  Cs2CuCl4) 


4.  Discussion 


The  above  calculation  shows  that  #c,  the  contribution  arising  from  the  mechanism  of 
Griffith  and  Orgel  identically  vanishes  in  the  present  case.  Further,  it  is  clear  that  XCR  is 
of  very  little  significance  when  compared  with  the  experiment  which  gives  a  total  value 
of  about  -  3-2  a.u.  in  Cu(Pc)  [21]  and  -  2-7  a.u.  in  Cs2CuCl4  [25]  for  the  spin  density 
at  the  copper  nucleus  as  derived  from  the  epr  hyperfine  spectral  data  of  these  two 
copper  complexes.  Incidentally,  we  note  that  the  magnitude  of  XR  is  even  smaller  than 
that  of  XCR-  We  conclude  that  the  role  of  crystal  field  in  contributing  to  the  spin  density 
at  the  site  of  copper  nucleus  in  the  case  of  copper  pthalocyanin  and  cesium  copper 
chloride  is  of  very  little  importance  although  the  symmetry  of  the  copper  ion  largely 
deviates  from  cubic. 


^i  y aLui  JI.KIU  unu  spin  density 
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Abstract.  Schwinger  variational  principle  has  been  used  to  calculate  triple  differential  cross- 
sections  for  ionization  of  hydrogen  atoms  by  positrons  at  intermediate  and  high  energies  for 
Ehrhardt  type  asymmetric  geometry.  The  results  agree  in  general  with  the  calculations  of 
Brauner  et  al  [8]  and  with  the  second  Born  calculation. 
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1.  Introduction 

The  study  of  ionization  collisions  of  atoms  by  incident  charged  particles  has  been  an 
active  field  of  research  for  the  past  few  decades.  Much  emphasis  has  been  laid  on  the 
study  of  triple  differential  cross  sections  (TDCS)  for  ionization  of  atoms  by  electrons. 
These  studies  focus  light  on  various  aspects  of  the  ionization  problem.  In  this 
connection  the  theoretical  work  of  Byron  et  al  [1,2],  Curran  and  Walters  [3]  and 
Curran  et  al  [4],  Schlemmer  et  al  [5],  Jones  et  al  [6],  McCarthy  et  al  [7],  Brauner  et  al 
[8],  Das  and  Seal  [9,10]  and  Seal  and  Das  [11]  and  the  experimental  works  of 
Ehrhardt  et  al  [12],  Ehrhardt  [13]  and  Lohmann  et  al  [14]  are  noteworthy. 

For  a  long  time  it  has  been  known  that  positron  is  more  efficient  in  ionization 
of  atoms  and  molecules.  Recent  measurements  of  total  cross  sections  for  hydrogen 
atoms  [15]  and  hydrogen  molecules  [16]  by  positron  impact  further  confirm  this. 
TDCS  measured  results  for  positrons  are  not  available  now.  However,  in  .near  future, 
when  such  results  will  be  available  a  new  dimension  will  be  added  to  the  ionization 
studies.  A  few  calculations  for  total  cross  sections  for  ionization  of  hydrogen  atoms 
by  positrons  exist  at  present  [17-20].  Brauner  et  al  [8]  also  recorded  TDCS  results 
for  ionization  of  hydrogen  atoms  by  positrons  along  with  those  for  ionization  by 
electrons. 

Recently  we  have  made  a  Schwinger  variational  calculation  (SVC)  of  TDCS  for 
ionization  of  hydrogen  atoms  by  electrons  and  obtained  good  results  [21].  Here  we 
want  to  report  our  TDCS  results  in  certain  cases  for  ionization  by  positrons  obtained 
with  the  same  variational  method. 
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Figure  1.    TDCS  versus  ejection  angle  B1  for  ionization  of  hydrogen  atoms  by 

positrons  at  £,  =  250  eV,  E1  =  5eV,  0t  =  3°  (a)  0,  =  8°  (b).  Theory: present 
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Joachain[22]) 


(1) 


It  is  easily  checked  that  this  is  stationary  (to  1st  order)  under  arbitrary  variations 
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Figure  2.    Same  as  figure  1  except  for  E;=150eV,  JE^SeV,  02  =  4°  (a)  and 
02  =  10°  (b). 


This  provides  the  variational  principle.  Here  |  ^{  >  and  |  $/  >  are  unperturbed  initial  and 
final  states,  |  T j + }  >  and  |  Tj." )  >  are  the  initial  and  final  channel  exact  states  respectively. 
V  is  the  interaction  potential  and  G(+)  is  the  Green's  function  for  the  perturbed 
Hamiltonian  in  which  the  interaction  V  is  omitted.  It  is  reasonable  at  high  energies  to 
approximate  |  *F{ + } >  and  |XP^~)  >  by  1 0f  >  and  | O7  >  respectively.  The  resulting  scattering 
amplitude  is  then  given  by 


When  fB2  is  small  compared  to  /B, 


(3) 


the  second  Born  amplitude. 

The  present  calculation  is  effectively  same  as  our  recent  work  [21]  except  for 
a  change  in  sign  in  the  interaction  potential. 
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Figure  3.    As  in  figure  1  with  Ex  =  5  eV,  02  =  5°,  E,  =  600  eV  (a),  E-t  =  800  eV  (b)  and 
EI  =  1000  eV.  Theory  —  SVC; FB2; FB. 


3.  Results  and  discussion 

We  have  used  here  the  usual  notations:  £  for  energy,  p  for  momentum,  d  for  scattering 
angle  (referred  to  incident  particle  momentum  direction  as  polar  axis).  £,,  p-t  refer  to  the 
incident  particle,  E^ ,  p± ,  0^  to  the  ejected  particle  and  E2,p2, 92  to  the  scattered  particle. 
The  scattering  is  considered  in  a  plane. 

Three  representative  sets  of  results  are  presented  in  figures  l(a),  l(b),  2(a),  2(b)  and 
3  (a),  3(b),  3(c)  respectively.  Figures  l(a)  and  l(b)  correspond  to  E{  =  250  eV,  Ei  —  5  eV, 
02  =  3°  (l(a))  and  62  =  8°(l(b)).  In  figures  2(a)  and  2(b)  JE,  =  150eV,  El  =  5eV,  02  =  4° 
(2(a))  and  02  =  10°,  (2(b)),  while  in  figures  3 (a),  3(b)  and  3(c),  E^SeV,  02  =  5°; 
E{  =  600eV  (3 (a)),  E,  =  SOOeV  (3(b)),  E,  =  lOOOeV  (3(c)). 

As  no  experimental  data  for  positron  impact  ionization  are  available,  we  compare 
our  results  with  the  theoretical  results  of  Brauner  et  a\  [8]  (see  figures  1  and  2),  second 
Born  approximation  (FB2)  and  the  first  Born  approximation  (FB).  At  intermediate 
energies  (150eV-200eV),  the  agreement  of  the  present  results  (SVC)  with  FB2  and 
those  of  Brauner  et  al  are  fairly  good.  Since  these  three  results  agree  well,  we  may 
conclude  that  first  Born  approximation  underestimates  cross  section  in  the  binary  peak 
region.  This  is  plausible  because  it  is  a  high  energy  approximation. 

At  high  energies  (600  eV,  800  eV,  1000  eV  presented  in  figures  3)  the  present  results 
and  FB2  results  agree  remarkably.  There  is  practically  no  difference  between  these  two. 
Even  at  1000  eV  the  first  Born  approximation  results  differ  significantly  from  that  of 
FB2  and  the  present  results  in  the  binary  peak  region,  though  a  very  slow  convergence 
to  the  SVC  results  from  600  eV  to  1000  eV  is  detectable.  In  the  recoil  peak  region  first 
Born  approximation  seems  to  work  much  better  at  these  energies. 
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Certain  points  need  to  be  noted  for  the  present  results.  Binary  peaks  are  shifted  to 
smaller  angles  compared  to  first  Born  results  and  the  peak  heights  are  much  larger. 
These  are  just  opposite  for  ionization  by  electrons.  The  recoil  peak  heights  are  smaller 
for  ionization  by  positrons  compared  to  those  by  electrons.  At  large  momentum 
transfers  the  present  results  tend  to  depart  from  those  of  Brauner  et  al  [8]. 

4.  Conclusions 

It  is  interesting  to  note  the  good  agreement  between  the  Schwinger  variational  results 
and  those  of  Brauner  et  al.  However,  at  large  momentum  transfers  these  tend  to  depart 
from  each  other.  Future  experimental  confirmation  or  otherwise  of  these  results  will  be 
interesting.  Schwinger  variational  calculation  for  ionization  of  hydrogen  and  helium 
atoms  by  electrons  and  positrons  using  more  accurate  wave  functions  such  as  that  used 
by  Das  and  Seal  [9]  may  produce  better  results. 
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Abstract.  Bound  state  population  dynamics  in  a  diatom  modelled  by  an  appropriate  Morse 
oscillator  with  a  time-dependent  well-depth  is  investigated  perturbatively  both  in  the  absence 
and  presence  of  high  intensity  radiation.  For  sinusoidally  oscillating  well-depth,  the  population 
of  the  mth  bound  vibrational  level,  Pmm(t),  is  predicted  to  be  a  parabolic  function  of  the  amplitude 
of  the  oscillation  of  the  well-depth  (AD0)  at  a  fixed  laser  intensity.  For  a  fixed  value  of  A£>0 ,  Pmm(0 
is  also  predicted  to  be  quadratic  function  of  the  field  intensity  (e0).  Accurate  numerical  calculations 
using  a  time-dependent  Fourier  grid  Hamiltonian  (TDFGH)  method  proposed  earlier  corrobor- 
ate the  predictions  of  perturbation  theory.  As  to  the  dissociation  dynamics,  the  numerical  results 
indicate  that  the  intensity  threshold  is  slightly  lowered  if  the  well-depth  oscillates.  Possibility  of 
the  existence  of  pulse-shape  effect  on  the  dissociation  dynamics  has  also  been  investigated. 

Keywords.    Morse  oscillator;  time-dependent  well-depth;  photo  dissociation. 
PACSNo.    33-80 

1.  Introduction 

The  quantum  dynamics  of  a  periodically  forced  Morse  oscillator  has  attracted  serious 
attention  in  view  of  its  usefulness  as  a  model  for  gaining  insight  into  the  response  of 
molecular  vibrations  when  interacting  with  intense  radiation  or  laser  fields  [1-8]. 
Recent  numerical  studies  on  the  dissociation  dynamics  of  a  hydrogen  fluoride  molecule 
by  monochromatic  sub-picosecond  pulses  have  shown  that  the  dissociation  probability 
(Pd)  is  less  than  10~5  when  the  laser  intensity  is  below  1014w/cm2.  Chelkowski  et  al 
[4-5]  showed  that  the  obtained  rates  of  dissociation  are  sensitive  to  the  shape  of  the 
excitation  pulse  and  an  appropriately  chirped  pulse  can  achieve  very  selective  vibra- 
tional excitation  with  high  efficiency.  Brown  and  Wyatt  [9]  analyzed  the  bottle- 
necking  problem  in  the  multiphoton  dissociation  of  a  diatom  modelled  by  a  Morse 
oscillator  under  laser  irradiation  of  high  intensities.  Recently,  Gangopadhyay  and  Ray 
[10]  proposed  a  theory  of  multiphoton  excitation  and  dissociation  of  a  Morse 
oscillator  in  the  presence  of  dissipation  and  explored  how  the  interplay  of  excitation 
and  dissipation  with  the  non-linearity  could  lead  to  observable  effects.  Adhikari  et  al 
[1 1-12]  observed  numerically,  that  the  multiphoton  dissociation  process  is  characterized 
by  the  existence  of  a  typical  threshold  intensity  and  an  induction  period  in  much  the 
same  way  as  the  multiphoton  ionization  of  atoms  in  strong  laser  fields  [13-15].  Many 
more  studies  on  different  aspects  of  the  multiphoton  dissociation  of  molecules  are  also 
available  in  the  current  literature  [16-19]. 

In  all  these  studies,  the  Morse  oscillator  is  characterized  by  a  fixed  time  invariant 
well-depth  which  is  an  acceptable  picture  for  an  isolated  oscillator.  However,  we  can 


me  weii-aepm  01  me  Morse  oscillator  about  me  mean  or  me  unperturbed  wen-aeptn 
DQB.  How  does  the  oscillating  well-depth  modify  the  dynamics  of  dissociation  of  the 
diatom  under  strong  laser  irradiation?  Does  it  alter  the  bound  state  population 
dynamics  or  affect  the  intensity  or  time  thresholds  observed  for  isolated  diatoms  or  for 
Morse  oscillator  with  constant  well-depths?  We  propose  to  probe  some  of  these 
questions  numerically  by  directly  solving  the  time-dependent  Schrodinger  equation 
(TDSE)  within  the  framework  of  the  time-dependent  Fourier  grid  Hamiltonian  method 
(TDFGH).  The  numerical  study  has  been  bacjced  up  by  a  low  order  perturbative 
analysis  wherever  feasible.  The  plan  of  the  paper  is  as  follows.  In  §  2  we  invoke  the  first 
order  time-dependent  perturbation  theory  to  obtain  the  qualitative  features  of  the 
bound  state  population  dynamics  for  short-time  scales.  The  time-dependent  Fourier 
grid  Hamiltonian  method  [11-12]  is  employed  for  obtaining  more  accurate  and 
quantitative  information  on  the  bound  state  population  as  well  as  dissociation 
dynamics  (§  3).  The  main  features  of  the  perturbative  and  numerical  results  are  analyzed 
in  §  4  for  both  continuous  and  pulsed  irradiation. 

2.  Perturbative  bound  state  dynamics 

Since  the  diatomic  molecule  is  represented  by  a  Morse  potential,  the  unperturbed 
Hamiltonian  (HQ)  is  given  by 

2 

2m  eq 

The  parameters  of  HQ  are  so  chosen  as  to  describe  the  bound  vibrational  levels  of 
hydrogen  molecule  [D0  =  0-17440  a.u.,  ft  =  1-02764  a.u.,  xeq  =  1-40201  a.u.]. 

Now  let  us  suppose  that  the  well-depth  DfXB  of  the  oscillator  oscillates  with  time.  If 
the  time-dependent  well-depth  be  represented  by  D'AB>  the  Hamiltonian  becomes 
time-dependent. 

where  we  have  chosen  DAB  =  D°B  +  AD /(£),/(?)  being  the  temporal  modulator  of  AD 
such  that  at  t  =  0,  Ht  =  H0.  A  simple  choice  for  f(t)  is  f(t)  =  sin  cot.  As  time  proceeds, 
the  well-depth  D^B  therefore  varies  sinusoidally.  The  Hamiltonian  Ht  is  manifestly 
phenomenological.  But  we  may  offer  some  rationalization  in  the  following  way.  If  the 
Morse  oscillator  (H0)  interacts  weakly  with  the  surrounding  (Hs)  through  an  interac- 
tion term  V0s,  the  equilibrated  system  eigenstates  can  be  found  by  solving  the  following 
eigenvalue  equation  [20]. 

c  j  =  £(c°)  <2a) 

^s/  \^s/ 
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Morse  oscillator 

Equation  (2a)  can  be  split  into  a  pair  of  coupled  eqs  (2b)  and  (2c) 

H0C0  +  VQSCS  =  ECQ  (2b) 

F+CO  +  HSCS  =  ECS.  (2C) 

Using  the  fact  that  Cs  =  -  (Hs  -  E}~ 1  V+Q  C0,  we  can  recast  (2a)  as 

[//0-K0s(^s-JE)-1Fs+0]C0  =  £C0 
which  can  be  written  as 


i.e., 


=  £C0. 


Over  a  long-time  scale  Keff  may  be  replaced  by  a  time-averaged  interaction  and  be 
treated  as  a  constant  quantity.  But  while  probing  on  a  very  short-time  scale  one  would 
notice  a  time-varying  Feff .  We  have  treated  Feff  as  a  sinusoidally  oscillating  term  so  that 
#eff  ->  HQ  as  t  ->  0  and  Feff  averaged  over  a  characteristic  period  of  oscillation  is  zero. 
This  simplifies  the  problem  greatly. 

The  other  perturbation  present  in  the  system  is  external  and  works  on  a  faster  time  scale. 
The  characteristics  of  the  time- varying  field  which  couples  with  the  oscillator  are  as  follows, 

V'(x,  t)  =  s0S(t)xsm  cot; 

where  e0  is  the  electric  field  strength  and  co  the  laser  frequency,  and  S(t)  is  a  pulse  shape 
function.  When  the  field  is  continuous,  S(t)  =  1.  For  pulsed  fields  S(t)  can  have  many 
different  forms.  However,  we  shall  make  use  of  the  following  three  forms  only  in  the 
present  study: 

nt 
(i) 


(ii)  5(0=1- 

(iii)  S(t)  =  exp[  —  y(t  —  tp)2] 

where  tp  stands  for  the  pulse  duration.  So  the  total  Hamiltonian  representing  a  Morse 
oscillator  with  oscillating  well-depth  and  interacting  with  an  external  time-varying 
field  is  given  by 


H(t)  =  ~-  +    D°B  +  AD  sin—   (1  -  exp[-  j8(x  -xeq)])2  +  e0S(t)xsmcot 


-  +    D°B  +  AD  sin 

(3) 


(4) 


.<.~.«....         T    DU.rr,      1/^.1     AC    XT«     1      T..K,  1OOC 


where 

-exC-^-Xec,)])2     and     H 


The  zeroth  order  Hamiltonian  (H0)  is  clearly  the  free  Morse  oscillator  Hamiltonian  of 
constant  well-depth  D°AB. 

Now  we  consider  the  different  cases  that  may  arise;  for  e.g.  (i)  when  the  external 
time-varying  field  is  present  but  the  well-depth  does  not  oscillate,  i.e.,  £0  ^  0,  but 
AD  =  0;  (ii)  when  the  well-depth  oscillates  but  the  field  is  absent,  i.e.,  £0  =  0,'but  AD  ^  0; 
(iii)  when  the  well-depth  oscillates  and  the  applied  field  intensity  is  non-zero,  i.e.,  £0  ^  6 
and  AD  ^  0.  In  what  follows  we  make  a  perturbative  estimate  of  the  population  of 
different  levels,  assuming  a  short-time  interval  and  not  too  high  fields. 

(i)  when  e0  ^0,  but  AD=0: 

If  e0  ^  0  and  AD  =  0,  the  Hamiltonian  of  the  system  becomes 


where 

H2(t)  = 

For  the  sake  of  simplicity  we  consider  that  the  field  is  continuous  i.e.  S(t)  =  1.  Thus 


Let  the  system  be  in  the  roth  eigenstate  of  H0,  ($°)  at  t  =  0.  The  perturbation  throws  $° 
into  a  linearly  superposed  state  ¥(*)  with  m 


where 


I 


In  the  first  order  therefore  the  transition  probability  from  an  initial  eigenstate  (n)  of  the 
unperturbed  Morse  oscillator  (HQ)  to  a  state  (m)  caused  by  the  applied  field  is  given  by 


r  i 

L  w"  M"  m  ^    (°t  J 


where 

ai(t,£J  =  £o<m|x|n>|  expH^^u^^ 
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and  <m|x[n>  =  <Oj|x|<&J>,comn  =  (Ej-£j)/ft;  and  e0  is  the  electric  field  strength  oi 
the  applied  radiation,  co  is  its  frequency  which  is  supposed  to  be  slightly  off-resonance. 
\a^(t,£)\2  or  PjJ,m(t,e)  is  then  the  population  of  the  wth  state  of  the  unperturbed  Morse 
oscillator  created  by  the  applied  off-resonance  time-varying  electric  field  of  the 
radiation. 

(ii)  when  e0  =  0  but  AD^O 
In  this  case, 


where 


Now  expanding  the  exponential  term  and  considering  only  terms  up  to  (x  -  xeq)2  we 
obtain, 

H,  (t)  =  H{l\t)  =  AD  sin(27rt)A  P(*  ~  *eq)2- 
The  first  order  transition  amplitude  a^(i)  is  therefore  given  by 


_  "exp  i(a)mn  -  (27i/T))f  -  1      exp  i(comn  +  (2n/ 

X 


The  first  order  transition  probability  from  the  state  n  to  the  state  m  due  to  oscillations  of 
the  well-depth  alone  is  therefore  given  by 

(ADfjS4  ,_..„_.     _. 


I6h2     v-u-      -eq/   .». 
1 


(0)     m-(27T/T))2 


X 

V      V 

+  4  sin: 


2;r\  M       .   /  67r\  /0 

+— Jt/2-smUmn— -jt/2 


r,  AD)  in  fact  gives  the  population  of  the  mth  vibrational  level  of  the  unperturbed 
Morse  oscillator  of  well-depth  DQB  created  by  a  sinusoidal  temporal  fluctuation  of  the 
well-depth  with  an  amplitude  AD. 

(iii)  when  both  EO  ^  0  and  AD  =£  0 

In  this  case 

H(t)  =  H0  +  H^t)  +  H2(t)  =  HG  +  H'(t) 

H'(t)  =  AD  sin(2?rt/T)[l  -  exp [  -  )9(x  -  xeq)] ]2  +  80x  sin  cot. 
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Now  making  approximations  already  introduced  in  (ii),  we  may  write  H'(t)  as 
H'(t)  =  tf  (1)(t)  AD  sin(27rt/r)j8(x  -  x2q)2  +  8xsin  cot. 

The  first  order  transition  amplitude  for  the  oscillator  with  oscillating  well-depth  in  the 
presence  of  the  applied  time-varying  non-resonant  field  is  then  given  by 

al(t 

ttt  V  *    0 


AD/32      ,,  x?,  vfexpzK,  -(27i/T))t-l 

^~<m|(x-xeq)2|7t>     p    mn   ;  '  ;  — 

4lft  q  |_  (fl)mB-(27C/T)) 


The  corresponding  transition  probability  from  an  initial  unperturbed  state  n  to  the 
final  perturbed  state  m  caused  by  the  oscillating  well-depth  and  time-varying  electric 
field  of  the  radiation  that  act  in  unison  is  therefore  given  by 


£2<m|x|n>T4sin2(comn-co)t/2 


n  +  —    /2 
\         \  T/  \ 

ADj3260<(x-xeci)2)mH 


t  r2cos(co-(27t/T))t-2cos(Q}m>,-(27i/T))f  - 
L  KB-(27i/t))(a)mn-a>) 

2  cos(a)  +  (27r/T))r  -  2cos(a)mB  -  (2n/r))t  -  2cos(co 


2cos(cL>  +  (27r/T)f  -  2cos(ctjmn  +  (27r/T))f  -  2cos(ajmn  -co)t  + 
(fl>m.  +  (2«/T))(a)mil-fl)) 

2cos(co  -  (27C/i))t  -  2cos(ajm>,  +  (2n/T))t  -  2cos(comn 


,  e,  AD  =  0)  +  P'mn(t,  e  =  0,  AD)  +  P'mm(t,  e0,  AD)  (g) 


T>  ______         t     T»I  ____ 


P'mm(t,  £0>  AD)  represents  the  part  of  the  time-dependent  population  of  the  mth  state 
of  the  unperturbed  Morse  oscillator  (H0)  that  is  created  by  the  interference  of  the 
population  excitation-deexcitation  dynamics  caused  by  well-depth  oscillation  and  the 
applied  time-varying  field.  The  overall  expression  of  P'mm(t,  fi0>  AD)  suggests  a  parabolic 
dependence  of  the  population  on  the  electric  field  strength  (e0)  when  all  other  system 
parameters  viz.,  AD,  j8,  etc.,  are  fixed.  The  expression  also  reveals  parabolic  dependence 
of  P'mm  on  AD  at  fixed  £0  and  /?.  However,  these  are  first  order  results  and  are  therefore  of 
limited  validity.  We  therefore  propose  to  calibrate  these  predictions  against  near  exact 
solutions  of  the  corresponding  TDSE  equation  by  invoking  the  TDFGH  method 
proposed  earlier  by  Adhikari  et  al  [11-12]  and  later  extensively  applied  by  Sarkar  et  al 
[21-22].  Since  the  TDFGH  method  is  not  yet  widely  known,  we  would  briefly  discuss 
the  salient  features  of  the  method. 

TDFGH  method 

Let  us  start  with  the  TDSE, 


&  ---  (9) 

where 

2 

H  =  H0+  V'(x,  t)  =  |-  +  F(x)  +  F'(x,  t).  (10) 

We  can  employ  the  FGH  method  [23-24]  to  evaluate  the  eigenfurictions  and  eigen- 
values of  HQ  giving 

H0|0,0(x)>  =  fi?|Oj)(x)>,     i  =  l,2,...N,  (11) 

where  N  is  the  number  of  grid  points  used  for  representing  |<$?>  in  the  co-ordinate 
space. 

|<D°>=  £  |x4>Axw9  (12) 

1=1 

{wf}  in  (12)  represent  the  values  of  the  co-ordinate  representative  of  the  state  function 
|  $9  (x)  >  at  the  grid  points,  the  values  of  which  are  obtained  by  the  standard  variational 
recipe  [23-24].  The  FGH  method  can  be  used  to  propagate  the  wave  function  on  the 
same  grid  as  shown  by  Adhikari  et  al  [1 1-12].  Thus,  when  the  perturbation  is  switched 
on,  the  state  function  ^(x,  t)  is  given  by 

N 

|xP(x,t)>=  Y"  |x£>Axw£(t).  (13) 

The  amplitudes  (wt(t))  are  now  time-dependent  quantities  and  their  evolution  equa- 
tions are  as  follows 


Once  initial  values  of  {wj  are  specified,  (14)  can  b&  numerically  integrated  on  the 
specified  grid  by  Runge-Kutta  or  Bulirsch-Stoer  method  [25].  The  matrix  elements  of 
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H0  and  V  of  (14)  can  be  evaluated  using  the  FGH  method 
<x  I  H  I  Y  \  -  *  f   v    exP  [2nl(i  -  T\ln 


where  +*<***' 


and 


and 


in  the  unperturbed  ground    ate  S  P'  f"?31™  *  f  ltched  on-  "the  system  is  initially 
change  with  time  «  ?S  ^ptturbafon  V=r  ;<0)  =  °  W  *  !)'  F°r  J  >  °'  S'  W  be«in  to 

unperturbed  system  S  (tKSn  ^  excltations  to  occ«  with  respect  to  the 

^^ 


coro      T  mPPOM  *  *e  unperturbed 

follows,  we  discuss  the  neltlsul,  °I^"  ^Sta?tfWeU-dePth.  »  «"*  case).  In  what 
expectadons  based  on  first  ™ 


3.  Results  and  discussion 

(1)  Bound  state  population  dynamics 


(AD  =  0)  computed  numericy  by   he  TDFGH  ^  C°nStam  W*dePth 

applied  radiation  interacts  with  it  FiL^l^T^i,  "  "*  deCtriC  field  of  the 
population  [eq.  (6)]  versus  time  proflk  of  he  fir  ,  "^  '  perturbati«:ly  computed 
time  profiles  for  other  excited  £*£*  *  P°PU'ati°n  versus 
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Figure  l(a).  Growth  and  decay  of  level  populations  of  the  ground  and  the  first 
three  excited  states  of  a  Morse  oscillator  with  constant  well-depth  as  the  electric 
field  of  the  applied  radiation  interacts  with  it. 
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Figure  l(b).    Perturbatively  computed  population  versus  time  profile  of  the  first 
excited  state  of  the  same  system  depicted  in  figure  l(a). 


features  of  the  population-time  profiles,  although  the  excitation  or  relaxation  rates  in 
the  presence  of  the  well-depth  oscillation  may  be  quite  different  from  their  constant 
well-depth  counterparts  (see  later). 

Figure  3  (a)  shows  the  numerically  computed  (by  TDFGH  method)  population  of 
a  vibrational  level  (k  =  2)  of  the  Morse  oscillator  with  oscillating  well-depth  (AD  =£  0)  as 
a  function  of  the  intensity  (e0)  of  the  applied  field.  The  overall  parabolic  nature  of  the 
profile  mirrors  the  prediction  of  the  first  order  TDPT.  Figure  3(b)  shows  the  behaviour 
of  the  numerically  computed  population  Pkk(t)  —  AD  profile  for  a  fixed  value  of'e0\  the 
applied  field  strength.  From  the  figure  it  is  seen  that  the  population  of  the  given  state 
monotonically  increases  as  AD  increases.  Thus,  the  perturbation  theoretical  prediction 
that  the  dependence  of  the  population  on  AD  would  be  parabolic  for  a  fixed  value  of  e0 
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population  dynamics  of  the  three  highest  bound  vibrational  levels  for  the  AD  ^  0  case 
which  is  basically  similar  to  that  observed  for  the  lower  states.  From  the  observed 
population  dynamics  it  appears  that  if  the  applied  field  intensity  is  below  a  certain 
threshold,  excitation  is  the  only  process  to  reckon  with  and  the  dissociation  probability 
is  negligible.  Once  the  threshold  is  crossed,  rapid  onset  of  dissociation  of  the  diatomic 
molecule  sets  in.  It  would  therefore  be  interesting  to  know  how  the  dissociation 
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Figure  2.  Growth  and  decay  of  level  populations  of  the  ground  and  the  first  three 
excited  states  of  the  Morse  oscillator  in  the  presence  of  the  radiation  when 
well-depth  oscillates  sinusoidally  with  time. 
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Figure  3(a)  .Numerically  computed  population  (by  the  TDFGH  method)  of 
a  v,brat,onal  level  (k  =  2)  of  a  Morse  oscillator  with  oscillating  wdl-depft MJ>1  V> 
as  a  function  of  the  intensity  (e)  of  the  applied  field.  ' 
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Figure  3(b).    Numerically  computed  population  of  the  k  =  2  level  shown  as 
a  function  of  AD  for  a  fixed  value  of  e,  the  strength  of  the  applied  field. 
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Figure  3(c).    Numerically  computed  population  versus  time  profiles  of  the  three 
highest  bound  vibrational  levels. 


probability  depends  upon  the  applied  field  intensity,  when  the  well-depth  oscillates. 
The  specific  questions  to  be  probed  are  the  following.  Does  the  well-depth  oscillation 
alter  the  dissociation  dynamics  observed  for  a  constant  well-depth  Morse  oscillator?  Is 
there  any  pulse-shape  effect  on  the  temporal  characteristics  of  the  dissociation  process? 
Before  focussing  on  these  aspects  of  the  problem,  we  would  like  to  see  whether  an 
oscillating  well-depth  affects  the  excitation  or  relaxation  rates  induced  by  the  field. 

Let  us  suppose  that  we  start  with  the  system  in  the  ground  state  of  the  unperturbed 
oscillator  at  t  =  0.  Figures  3(d)  and  (e)  show  the  growth  of  the  population  of  the  first 
excited  state  as  the  applied  field  interacts  with  the  Morse  oscillator  of  (i)  fixed,  and 
(ii)  oscillating  well-depth,  respectively.  The  limiting  slopes  of  these  curves  give  the  total 
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Figure  3  (d).    Growth  of  the  population  of  the  first  excited  state  as  the  applie 
interacts  with  the  Morse  oscillator  of  fixed  well-depth  (e  =  0-06,  AD  =  0-0). 
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Figure  3  (e).    Growth  of  the  population  of  the  first  excited  state  as  the  applie 
interacts  with  the  Morse  oscillator  of  oscillating  well-depth  (e  =  0-06,  AD  = 


rate  of  excitation  into  the  first  excited  state,  which  are  recorded  in  table  1.  For  a  I 
oscillator  with  a  time-dependent  well-depth,  the  excitation  rate  is  larger  sh« 
constructive  interference  between  the  field-induced  and  the  oscillating  well- 
controlled  excitations.  If  we  start  with  the  oscillator  in  an  excited  state,  we  can  sirr 
compute  the  rate  of  relaxation  caused  by  the  external  field  as  well  as  by  the  in 
perturbation  set  up  in  the  system  due  to  oscillation  in  the  well-depth.  For  k  =  1 
these  rates  are  also  reported  in  table  1  both  when  AD  =  0,  e0  ^  0  and  AD  ^  0,  t 
Thus,  the  well-depth  oscillation  is  seen  to  enhance  the  excitation  rates  but  dep're 
relaxation  rates  in  a  Morse  oscillator  interacting  with  the  radiation. 
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Table  1.     Computed  excitation  and  relaxation  rates  of  a  given  vibrational  level  of  a  Morse 
oscillator  in  the  absence  and  presence  of  well-depth  oscillation. 


System 


Rate  of  excitation  to  the 
first  excited  state 


Rate  of  relaxation  of  the 

first' excited  state 

(PS'1) 


i)  Morse  oscillator 
with  fixed  well-depth 

ii)  Morse  oscillator 

with  oscillating  well-depth 


i)  1-79 
ii)  2-84 
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Figure  4(a).    Dissociation  probability  versus  time  profile  for  the  oscillator  with 
(i)  fixed  well-depth  (ii)  oscillating  well-depth,  for  a  fixed  value  of  e0. 
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Figure  4  (b).  Computed  dissociation  probability  Pd  as  a  function  of  the  intensity  of 
the  applied  field  of  the  laser  for  the  oscillator  with  (i)  time-invariant  well-depth 
(ii)  time-varying  well-depth. 
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Figure  5(a-c).  Profiles  of  the  ground  state  vibrational  wave  function  of  H2  (modelled 
by  a  morse  oscillator)  as  it  dissociates  under  laser  irradiation,  (a)  ¥(x,  t)  at  t  =  0;  (b) 
x,0au  =  25fs.  The  peak  intensity  of  the •  pulse  is£o  =  0-06au. 
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(ii)  Dissociation  dynamics 

(a)  Dissociation  probability,  time  and  intensity  thresholds:  We  first  consider  the  case, 
when  the  field  is  continuous.  Figure  4(a)  shows  the  Pd-time  profile  for  the  oscillator 
(i)  with  fixed  well-depth  (ii)  with  oscillating  well-depth,  for  a  fixed  value  of  s0.  The 
oscillating  well-depth  is  seen  to  lower  the  time-threshold  slightly  with  respect  to  the 
constant  well-depth  Morse  oscillator. 

Figure  4(b)  shows  how  the  computed  dissociation  probability  of  a  Morse  oscillator 
at  a  given  time  (t)  varies  with  the  intensity  of  the  applied  time-varying  field  with  (i) 
time-invariant  well-depth,  (ii)  time- varying  well-depth.  In  both  cases,  a  clear  threshold 
intensity  (e°  =  £0)  is  seen  to  exist  beyond  which  the  dissociation  probability  rises 
sharply.  From  the  figures,  one  would  be  tempted  to  conclude  that,  the  oscillation  of 
well-depth  lowers  the  threshold  intensity  compared  to  that  observed  in  the  constant 
well-depth  case,  when  illumination  is  continuous. 

We  have  monitored  at  this  point  the  wave  function  as  the  molecule  dissociates. 
Figures  5(a-c)  show  the  wavefunction  at  different  stages  of  evolution  after  the  field  is 
switched  on.  As  the  system  gains  energy  from  perturbing  field,  more  and  more  nodes 
appear  in  ¥(*,  t)  and  the  effective  spatial  extension  of  ^(jt,  t)  increases. 

When  a  pulsed  field  is  used,  the  picture  that  emerges  from  the  continuous-field- 
oscillator  model  is  seen  to  change  a  lot.  Figure  6 (a)  shows  how  the  dissociation 
probability  varies  with  the  field  intensity  £0  in  (i)  the  fixed  well-depth  case,  (ii)  the 
oscillating  well-depth  case,  when  the  intensity  varies  as  sm2(nt/tp).  From  the  figures 
a  picture  opposite  to  that  observed  in  the  continuous  field  case  is  seen  to  emerge  i.e.,  the 
oscillating  well-depth  increases  the  threshold  intensity  of  dissociation  when  a  pulsed 
field  is  used.  Figure  6(b)  shows  the  dissociation  probability  versus  time  profile  in 
(i)  continuous  laser  field;  (ii)  pulsed  laser  field  with  sin2  dependence;  (iii)  triangularly 
pulsed  laser  field;  (iv)  laser  with  Gaussian  pulse  shape,  when  the  peak  intensity  (e0)  and 
laser  frequency  (a>)  are  the  same  in  each  case.  From  the  figures  it  is  seen  that  compared 
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Figure  6  (a).    Dissociation  probability  versus  intensity  profile  for  the  oscillator 
with  (i)  fixed  well-depth  (ii)  oscillating  well-depth  for  pulsed  laser  fields. 
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Figure  6(b).  Dissociation  probability  versus  time  profile  for  (i)  continuous  laser 
field  (ii)  pulsed  laser  field  with  sin2  dependence  (iii)  triangularly  pulsed  laser  field, 
(iv)  Gaussian  pulsed  laser  field. 


to  continuous  irradiation,  the  pulsed  laser  field  lowers  the  dissociation  probability  of 
the  diatom  at  any  given  time.  But  the  gross  features  of  the  Pd  versus  time  profile  for 
three  different  types  of  pulsed  fields  are  not  very  different  [4-5].  It  would  be  interesting 
to  analyze  the  kind  of  effects  that  a  Morse  oscillator  with  a  randomly  oscillating 
well-depth  would  bring  about.  We  will  address  ourselves  to  this  problem  in  a  future 
communication. 

4.  Conclusions 

By  studying  the  bound  state  population  and  dissociation  dynamics  of  a  model  diatomic 
species  whose  binding  energy  oscillates  with  time,  we  come  to  the  following  con- 
clusions: 
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tions  as  long  as  me  perturbation  is  small  ana  snort-time  scales  are  considered, 
(ii)  When  the  well-depth  oscillates  slowly,  which  is  the  actual  situation  when  the  system 
interacts  with  the  surrounding  medium,  the  threshold  intensity  decreases  to  a  small 
extent  for  dissociation  under  continuous  irradiation;  but,  for  pulsed  fields,  oscillating 
well-depth  is  predicted  to  increase  the  threshold  intensity  of  dissociation, 
(iii)  The  nature  of  the  dissociation  probability  versus  time  profiles  and  the  actual  extent 
of  dissociation  appears  to  be  almost  the  same  for  pulses  of  different  shapes  that  we  have 
studied.  However,  a  different  choice  of  Feff  could  change  the  picture  completely.  The 
problems  of  microscopic  derivation  of  Feff  and  thus,  Heff  and  a  fully  quantum 
dynamical  study  of  the  processes  of  our  interest  are  presently  under  way. 
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Abstract.  Temporal  evolution  of  electronegativity  and  hardness  associated  with  a  collision 
process  between  a  Be  atom  and  a  proton  has  been  studied  within  a  quantum  fluid  density 
functional  framework.  In  the  presence  of  a  third  collisional  partner  to  take  away  excess  energy,  this 
collision  may  lead  to  a  chemical  reaction  producing  a  BeH  +  molecule.  For  comparison  ab  initio  SCF 
level  calculation  (with  6-3  1G**  basis  set)  on  BeH  +  molecule  with  different  geometries  have  been 
performed.  Electronegativity  equalization  and  maximum  hardness  principles  are  analyzed. 

Keywords.    Electronegativity;  hardness;  structure;  dynamics. 
PACS  Nos    31-20;  34-50;  82-20;  82-40 

Chemical  concepts  like  electronegativity  [1-4]  and  hardness  [5-9]  have  been  found  to 
be  important  in  understanding  various  physico-chemical  processes.  Electronegativity 
was  first  defined  by  Pauling  [1]  as  the  power  of  an  atom  in  a  molecule  to  attract 
electrons  to  itself  and  was  shown  to  be  related  to  chemical  potential  (n)  in  density 
functional  theory  (DFT)  as  [3,4] 

dE 

v(r) 

where  E,  N,  v(r)  are  electronic  energy,  number  of  electrons  and  external  potential 
respectively.  The  concept  of  hardness  which  was  introduced  by  Pearson  [5]  has  been 
defined,  within  DFT,  as  [6-9] 

lf^     =  (2) 

77  2- 


2dNv(r) 

Using  finite  difference  approximations  and  assuming  Koopmans'  theorem  to  be  valid, 
fj.  and  q  become  [7-10] 

/*=*-(/  +  A)/2  *  (8LUMO  +  eHOMo)/2  (3) 

n*(I-  A)/2  ~  (8LUMO  -  eHOMo)/2.  (4) 

Here  /  and  A  are  ionization  potential  and  electron  affinity  respectively  and  £HOMO  anc* 
£LUMO  are  energies  of  highest  occupied  and  lowest  unoccupied  molecular  orbitals 
respectively. 
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In  a  chemical  reaction,  difference  of  electronegativity  drives  the  electron  transfer  from 
a  species  of  lower  electronegativity  to  one  with  higher  electronegativity  until  their  . 
electronegativity  values  become  equal.  According  to  Sanderson's  electronegativity 
equalization  (EE)  principle  [3,  11-15]  this  molecular  electronegativity  is  equal  to  the 
geometric  mean  of  isolated  atoms'  electronegativities.  An  important  principle  of  molecular 
electronic  structure  theory  associated  with  hardness  concept  is  hard-soft-acid-base 
(HSAB)  principle  [5,  16]  which  states  that  hard  acids  will  prefer  to  react  with  hard  bases 
and  soft  acids  with  soft  bases  to  form  a  stable  molecule.  This  principle  is  closely  related  to 
maximum  hardness  principle  (MHP)  [17-19]  which  states  that  "there  seems  to  be  a  rule 
of  nature  that  molecules  arrange  themselves  so  as  to  be  as  hard  as  possible".  Pearson 
[17]  postulated  MHP  on  the  basis  of  his  vast  experience  on  chemical  reactions.  The 
first  formal  proof  of  it  for  a  class  of  states  (cf.  eq.  (14)  of  ref.  18)  has  been  provided  by  Parr 
and  Chattaraj  [18].  Thus  during  molecule  formation  or  in  general  in  a  chemical  reaction, 
electronegativity  of  an  atom  changes  until  it  attains  the  molecular  electronegativity 
value  and  its  hardness  gets  maximized  which  is  the  consequence  of  MHP.  Therefore  a 
chemical  reaction  dynamics  can  be  envisaged  in  terms  of  temporal  evolution  of  electro- 
negativity  and  hardness  during  a  chemical  reaction.  Several  systems  and  processes  have 
been  studied  in  order  to  understand  the  implications  of  EE  [13-15,20,21],  MH  [22-33] 
and  HSAB  [23,  25-27]  principles.  In  this  letter  we  study  a  collision  process  between  a  Be 
atom  and  a  proton  and  study  the  temporal  evolution  of  %  and  r\  during  the  process  within 
a  quantum  fluid  density  functional  (QFDF)  framework  [37-40].  Since,  in  the  presence  of 
a  third  partner  to  take  away  the  excess  energy,  this  collision  may  lead  to  the  formation  of 
BeH+,  which  is  a  stable  closed  shell  molecule  with  D0  =  3-14  eV  and  Re  =  1-3122  A  [34-36] 
we  also  supplement  this  calculation  by  ab  initio  SCF  calculations  [41].  In  both  the  cases 
X  and  »/  have  been  expressed  as  functions  of  internuclear  distance  (R)  and  hence  have 
implicit  temporal  dependence.  Ab  initio  calculations  have  been  carried  out  at  SCF  level 
with  6-13  G**  basis  set  for  different  internuclear  distances  of  BeH+.  It  should  be  noted 
here  that  the  former  calculation  is  within  a  dynamical  context  and  hence  the  excited  states 
get  mixed  during  temporal  evolution  whereas  the  latter  calculation  refers  to  movement 
from  one  ground  state  to  another. 

In  QFDF  formalism  [37-40]  which  is  an  amalgamation  of  quantum  fluid  dynamics 
(QFD)  and  time-dependent  density  functional  theory  (TDDFT),  time-evolution  of 
electron  density  (p)  and  current  density  (7)  can  be  found  by  solving  a  generalized 
nonlinear  Schrodinger  equation  (GNLSE).  The  electron  density  and  current  density  in 
turn  uniquely  determine  all  other  properties  of  the  system  [42,43].  The  GNLSE  is 
given  by  [37-40], 


t)  =  i  (5) 

with 

and      =        V-V 


where  £'  is  the  velocity  potential.  In  eq.  (5)  the  effective  potential  ueff  is  given  by 
[20,37-39] 


ett     dp      dp 
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where  TNW  and  Exc  denote  non-Weizsacker  part  in  kinetic  energy  (KE)  and  exchange 
correlation  energy  functionals  respectively,  explicit  forms  of  which  have  been  given 
earlier  [20, 37, 38].  Although  Hohenberg-Kohn  theorems  [44]  assert  the  existence  of 
these  functionals,  their  exact  forms  are  still  not  known  even  for  the  ground  state  and 
various  approximations  with  respective  merits  and  demerits  have  been  proposed  [45]. 
The  external  potential  denoted  by  uext(r,  t)  is  as  given  below, 


where  R(t)  refers  to  the  classical  trajectory  of  two  nuclei  and  has  been  calculated  by 
standard  straightline  approximation  [37].  Since  veKi(r,  t)  vanishes  at  the  boundary,  the 
validity  of  TDDFT  [42,43]  allows  us  to  legitimately  assume  that  the  mappings 
yext(r>  t)~+p(r,  t)  and  vexl(r,  t)  -*j(r,  t)  are  uniquely  invertible  and  accordingly  the  total 
time-dependent  Hamiltonian  is  a  unique  functional  of  p(r,  t)  andj(r,  0-  Therefore,  in 
this  context  one  can  define  a  time-dependent  energy  quantity,  E(t),  as  a  density 
functional  [46]  and  the  corresponding  definition  for  the  time-dependent  chemical 
potential  can  be  given  by  [20] 


ext,. 
dp       2  op      J  \r  —  r\          r        op 

(8) 
Since  fj.  is  constant  over  the  whole  space  in  a  molecule,  if  at  a  distance  r^,  we  get  [20] 


the  chemical  potential  becomes  equal  to  total  electrostatic  potential  at  r^  in  that  time 
step  as 


-  XW  =  MO  =     irZTjdr  -7  +  »«t(r,,  ')•  (10) 

J  \rn     rl         r\i 

At  time  f  =  0,  (9)  and  (10)  become  identical  to  that  given  by  Politzer  etal  [47] 
considering  principle  of  equalization  of  electronegativity  and  the  distance  r^  was  found 
to  match  well  with  the  covalent  radii  of  atoms. 

Temporal  evolution  of  r\  has  been  calculated  at  every  time  step  from  the  following 
expression  [48,49] 


rj(t)=  \f(r,t)ri(r,t)dr  (11) 

J 

where  77  (r,  t)  is  the  local  hardness  at  time  t  given  by  [48,49] 

^(r' 0  =  ^77  hr~? — N  g  /  /  ^(r')dr'.  (I2) 

2N  J  dp(r,t)dp(r,t) 

Here  F\_p^  is  the  usual  Hohenberg-Kohn  functional  explicit  approximate  form 
of  which  has  been  given  elsewhere  [20,37].  The  Fukui  function,  /(r)  has  been  given 


by  [50] 


dN 


(13) 


This  can.  be  approximated  [32, 33]  by  neglecting  the  smaller  second  term  in  the 
expression  of  Hellmann-Feynman  force  [48]  written  as  the  sum  offerees  acting  on/(r) 
and  a  term  involving  the  product  of  the  matrix  elements  of  perturbation  caused  by 
small  changes  in  external  potential  and  local  hardness.  Thus/(r)  becomes  [32,33] 


P(r,t) 

N    ' 


(14) 


Same  approximate  form  of  f(r)  has  been  found  by  Gazquez  expressing  density  of 
charged  species  as  the  renormalized  electron  density  of  the  neutral  species  [51]. 

It  is  important  to  note  that  exact  forms  for  T\_p]  and  £xc[p]  are  not  known  even  for 
the  ground  state  [45].  As  it  has  been  done  in  all  earlier  time-dependent  calculations 
within  QFDF  framework  [37-40],  in  this  paper  also  it  has  been  assumed  that  T[p] 
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Figure  1.  The  plot  of  electronegati vity  (%)  versus  internuclear  distance  (R)  of  BeH  + 
(a  supermolecule  as  in  ref.  37)  calculated  within  a  quantum  fluid  density  functional 
(QFDF)  framework. 
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and  £xc[p]  have  the  same  forms  in  ground  and  excited  states.  This  assumption  need 
not  always  be  valid.  The  present  state  of  knowledge  of  excited-state  DFT  does  not  leave 
any  other  alternative  [38]. 

Time  evolution  of  the  system  as  well  as  its  properties  like  electronegativity  and 
hardness  has  been  found  by  solving  eq.  (5)  as  an  initial-boundary  value  problem.  The 
numerical  solution  starts  with  a  near  Hartree-Fock  [52]  density  of  Be-atom  in  a  XS 
state  and  a  proton  with  a  KE  value  of  5  keV  approaching  it  for  a  head-on  collision. 
A  leap-frog  finite  difference  scheme  which  is  stable  for  quantum  mechanical  equations 
of  motion  due  to  the  presence  of  i(i2  =  —  1)  [53]  has  been  adopted  here  to  solve  eq.  (5). 
The  numerical  method  has  been  reported  in  detail  earlier  [20, 38, 39].  In  ab  initio  SCF 
calculations  [41]  %  and  r\  have  been  calculated  from  (3)  and  (4)  using  energies  of  HOMO 
and  LUMO  obtained  at  different  internuclear  distances. 

The  electronegativity  profiles  obtained  from  QFDF  and  ab  initio  calculations  have 
been  presented  in  figures  1  and  2  respectively.  The  electronegativity  profile  obtained 
from  QFDF  calculation  (figure  1)  shows  some  initial  transients  as  the  electron  density 
gets  reoriented  around  the  nucleus  due  to  nonlinear  effective  potential  field,  yeff .  After 
elapse  of  certain  time  the  electron  density  does  not  change  appreciably  which  is  akin  to 
some  sort  of  dynamic  equilibrium.  At  this  stage,  x  attains  a  constant  value  over  a  range 
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Figure  2.    The  plot  of  electronegativity  (x)  versus  internuclear  distance  (jR)  of  BeH + 
obtained  through  an  ab  initio  SCF  calculation. 
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KE(  2i  5  keV).  Hence  m  the  encounter  regime  where  internuclear  distance  is  too  small, 
electronegativity  profile  shows  sharp  fluctuation  due  to  rapid  nonlinear  charge  oscilla- 
tions at  the  vicinity  of  the  two  nuclei.  On  the  other  hand,  the  SCF  results  being 
associated  with  ground  states  of  BeH+  at  different  internuclear  distances,  the  electro- 
negativity  profile  does  not  show  oscillations  but  increases  monotonically  in  the  process 
of  molecule  formation  and  reaches  a  plateau  corresponding  to  the  molecular  electro- 
negativity  value  (figure  2).  It  has  been  found  in  this  SCF  calculation  that  as  the  proton 
approaches  Be  atom,  charge  (Mulliken  population)  on  Be  atom  decreases  from  3-44  at 
#  =  4-3  A  to  the  minimum  value  of  3-06  at  the  equilibrium  bond  distance 
(Re=  1-3 122  A)  and  it  increases  again  as  BeH+  molecule  ion  is  squeezed  beyond  its 
equilibrium  configuration. 

Figures  3  and  4  present  the  hardness  profiles  obtained  from  QFDF  and  ab  initio  SCF 
calculations  respectively.  Both  figures  show  rise  in  hardness  as  the  two  nuclei  approach 
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Figure  3.  The  plot  of  hardness  fa)  versus  internuclear  distance  (R)  of  BeH+  (a 
supermolecule  as  in  ref.  37)  calculated  within  a  quantum  fluid  density  functional 
(QFDF)  framework. 
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Figure  4.    The  plot  of  hardness  (77)  versus  internuclear  distance  (R)  of  BeH+ 
obtained  through  an  ab  initio  SCF  calculation. 


each  other.  Hardness  in  figure  3  rises  sharply  as  the  proton  approaches  Be  atom.  In  this 
dynamical  process,  as  the  electron  density  of  Be  atom  gets  accumulated  near  the 
nucleus  and  does  not  change  appreciably  in  course  of  time,  r\  becomes  constant  after 
attaining  a  high  value  which  may  be  considered  as  the  dynamical  variant  of  maximum 
hardness  principle  [32, 33].  In  SCF  method,  however,  rise  in  77  is  not  so  sharp  along  the 
reaction  path  of  formation  of  BeH  +  molecule.  In  this  case  r\  passes  through  a  maximum 
which,  however,  is  not  at  equilibrium  bond  distance  of  BeH  +  because  the  extrema  for 
the  electronic  energy  and  the  nuclear  repulsion  energy  do  not  appear  in  the  same  point 
of  the  potential  energy  curve  [19, 22, 31]. 

It  may  be  noted  that  the  major  features  in  %  and  rj  profiles  obtained  from  structure 
and  dynamics  calculations  do  not  match  in  general.  The  differences  in  numerical  values 
as  well  as  other  features  in  the  profiles  are  probably  due  to  the  following  reasons.  The 
dynamical  formalism  involves  mixing  of  excited  state  densities  with  ground  state 
density  as  is  evidenced  from  nonvanishing  current  density,  while,  in  ab  initio  SCF 
method,  calculations  were  restricted  to  ground  states  with  different  internuclear 
geometries  of  BeH+.  The  electron  correlation  has  not  been  taken  into  account  in  ab 
initio  SCF  calculations.  Exact  forms  for  kinetic  and  exchange-correlation  energy 
functionals  of  DFT  are  not  known  [45].  Hence  approximate  forms  for  these  functionals 
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are  used  and  their  validity  in  excited  states  is  tacitly  assumed.  It  is  in  the  spirit  of  Slater  s 
transition  state  approach  [54]  where  the  ground  state  Hartree-Fock-Slater  formalism 
is  considered  to  be  applicable  to  excited  states  as  well.  Straight-line  approximation  has 
been  used  for  the  calculation  of  classical  trajectory  of  nuclei.  Finally  the  expression  for 
f(r)  given  in  (14)  is  an  approximation  which  may  be  modified  by  incorporating 
appropriate  inhomogeneity  corrections. 

This  study  compares  the  i  and  r\  profiles  generated  through  electronic  structure  and 
dynamics  calculations.  In  SCF  calculation,  electronegativity  increases  from  a  small 
value  corresponding  to  that  of  Be  atom  when  proton  is  much  away  from  it  and 
increases  monotonically  as  proton  approaches  and  finally  reaches  an  asymptote  which 
may  be  identified  as  the  electronegativity  of  BeH+  molecule.  This  is  a  consequence  of 
the  electronegativity  equalization  principle.  Hardness  also  increases  as  the  inter- 
nuclear  distance  decreases  and  passes  through  a  maximum  in  the  neighbourhood  of 
equilibrium  configuration,  as  expected  from  the  maximum  hardness  principle.  In  the 
dynamical  situation  rapid  charge  oscillation  is  observed  in  the  encounter  zone  and 
excited  state  densities  get  mixed  with  the  ground  state  density. 
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ract.  Using  ab  initio  data  of  the  potential  surface  for  the  ground  state  of  NH3  a  function  is 
ined  to  represent,  with  a  very  high  precision,  the  potential  along  the  inversion  coordinate. 
inversion  spectra  of  NH3,  ND3,  NH2D  and  NHD2  are  found  by  numerically  solving  the 
•odinger  equation  for  this  potential.  Comparison  is  made  of  the  calculated  inversion 
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ntroduction 

5  inversion  of  the  NH3  molecule,  in  which  a  nitrogen  atom  oscillates  between  both 
;s  of  the  plane  defined  by  the  hydrogen  atoms  (figure  1)  is  an  old  problem  that  has 
n  extensively  studied  by  means  of  experimental  and  empirical  methods  [1-12]  while 
ab  initio  approaches  have  been  rather  scarce  [13-15]. 

ieveral  types  of  double-well  potentials  have  been  used  [3,  8,  9,  12-14]  to  simulate  the 
srsion  barrier  of  NH3,  but  perhaps  the  simplest  analytical  way  to  represent  this 
ential  is  by  means  of  a  quartic  anharmonic  oscillator  with  a  symmetrical  double- 
1  [15]: 

K(x)=-ix2  +  ;bc4,    A>0 

h  an  energy  spectrum  consisting  of  doublets,  one  member  of  each  doublet  belonging 
i  symmetrical  state  and  the  other  to  the  antisymmetrical  state.  It  is  known  [16]  that 
Schrodingerequationforthis  potential  is  not  solvable  by  perturbation  theory  since 
perturbative  series  in  powers  of  the  parameter  X  is  divergent  for  an)  jvaluc  >  o  >  <X 
ne  years  ago,  Bunker  et  al  [13]  calculated  the  energy  surface  for  NH3  and  showed 
t  the  inversion  barrier  can  be  predicted  from  the  first  * 


>se  some  points  from  the  energy  surface  calculated  by 
*u~«  ~n  JL  cm,™  to  find  an  eiehth  degree  polynomial  to  re 


Figure  1.    The  double-well  potential  for  the  inversion  of  the  NH3  molecule. 


proposed,  from  which  the  quartic  anharmonic  oscillator  potential  with  double-well 
is  found  and  the  molecular  constants  and  the  inversion  frequencies  for  NH3  are 
determined. 

2.  The  quartic  anharmonic  potential 

If  we  assume  that  the  inversion  potential  V(x)  can  be  represented  by  a  fourth  degree 
polynomial  then  the  corresponding  Hamiltonian  can  be  written  as: 

~  Ip,      2aX      2  ^  ' 

where  /i  is  reduced  mass. 

This  Hamiltonian  can  be  reduced  to  that  of  a  quartic  anharmonic  oscillator  with 
double-well 

by  means  of  the  transformations: 

p    and    Q  =  (  TJ  1     *• 
The  energy  unit  E  and  the  anharmonic  constant  of  (2)  are  expressed  as: 

V)     ' 

3.  The  inversion  frequencies  for  NH3 

The  inversion  of  NH3  may  be  treated,  to  a  good  approximation,  as  a  problem  of  one 
particle  in  a  one-dimensional  potential  V(x\  where  x  represents  the  inversion  coordinate, 


(3a) 
(3b) 
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i.e.,  it  measures  the  distance  of  the  N  atom  to  the  plane  of  the  hydrogen  atoms.  Both 
quantities,  x  and  F(x),  are  given  in  atomic  units.  If  R  stands  for  the  distance  of  the 
N  atoms  to  one  of  the  H  atoms  and  a  is  the  angle  between  the  H-N-H  bonds,  then  the 
inversion  coordinate  will  be  given  by: 

x  =  [1  -fsin2^)]1/2^  =  R  sin  /?. 

where  /?  is  the  angle  between  the  N-H  bond  and  the  plane  of  the  H  atoms.  If  m  and 
M  are  the  masses  of  the  H  and  N  atoms,  respectively,  the  reduced  mass  may  be 
expressed  as  /x  =  3m(M  +  3msin2/?)/(M  +  3m).  Unlike  other  works  [14,15],  in  the 
present  one  the  angle  )3  is  not  taken  as  21°  49',  corresponding  to  the  experimental 
equilibrium  configuration  [11],  but  it  is  calculated  by  interpolation  from  the  data  in 
table  1  of  ref.  [14],  which  determine  an  angle  of  22° 44'  for  the  theoretical  equilibrium 
configuration. 

As  already  mentioned,  there  exists  a  number  of  suggestions  for  the  analytical  form  of 
V(x),  and  in  this  work  we  propose  the  use  of  a  fourth  degree  polynomial  which 
satisfactorily  reproduces  the  points  of  the  potential  surface.  This  is  achieved  by  taking 
the  same  six  points  (table  1,  ref.  [14])  chosen  by  Palma  et  al  [14]  from  the  54  point 
calculated  by  Bunker  et  al  [13]  using  ab  initio  methods. 

The  fourth  degree  polynomial  V(x)  =  cQ  +  cix2  +  c2x4  which  best  reproduces  the 
mentioned  data  has  c0  =  -  56-4064,  c>  =  -  3-034493  x  10"  2  and  c2  =  2-666604  x  10"  2. 
With  these  values  the  energy  unit  e,  eq.  (3  a),  the  anharmonic  constant  A,  eq.  (3b),  and  the 
other  molecular  constants  are  readily  determined  for  NH3 .  Some  of  these  constants  are 
summarized  in  table  1.  Reference  [10]  is  the  best  empirical  calculation  ever  made, 
where  a  potential  of  the  form  V(x)  —  aq2/2  +  bq4/2  +  dexp(  —  cq2)  was  used;  the 
parameters  ft,  a,  b,  c  and  d  were  varied  until  the  value  of  the  energy  closest  to  the 
experimental  value  was  obtained.  The  experimental  values  for  the  barrier  height  and 
the  equilibrium  distance  [5]  are  2020  +  12cm'1  and  0-382  +  0-004  A,  respectively. 

As  shown  in  §2,  the  Hamiltonian  associated  with  the  fourth-degree  polynomial 
potential,  eq.  (1)  reduces  to  that  of  a  double-well  anharmonic  oscillator,  eq.  (2).  The 
corresponding  Schrodinger  equation  was  numerically  solved  for  the  NH3  and  its 
isotopic  species.  Two  numerical  methods  were  employed,  the  first  being  described  in 
ref.  [14, 19],  while  the  second  is  mentioned  in  ref.  [15, 17],  and  the  results  found  were 


Table  1.    Values  of  the  molecular  constants  for  NH 


Molecular  constant 

This  work 

Ref.  [10] 

Mg) 

4-263  x  10-  24 

4-263  x  10-  24 

a(dynes/cm) 

9-45  x  10~4 

9-42  x  IO4 

&(dynes/cm3) 

2-96  x  IO21 

0-47  x  IO21 

c(cm~~2) 

0-00 

4-269  x  IO16 

d(ergs) 

0-00 

2-452  x  10-  12 

Vg^m"1) 

790-3 

789-9 

H  (Barrier  height,  cm"  x) 

1894-5 

2018 

^(Equilibrium  dist.,  A) 

0-399 

0-382 

^(Equilibrium  angle) 

22°  44' 

22°  12' 
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Table!.    Vibrational  spectrum  for  NH3(A  =  0-02607)  along  the  inver- 
sion coordinate  in  cm" 1.  (s  =  symmetrical,  a  =  antisymmetrical). 

State        Ref.  [13]       Ref.  [14]       Present  work      Experiment  [5] 


o* 

0-0 

0-0 

0-0 

0-0 

0" 

1-02 

0-995 

0-969 

0-793 

1* 

986-7 

924-12 

928-92 

932-51 

r 

1032-3 

981-04 

981-46 

986-70 

2s 

1682-0 

1559-57 

1561-41 

1597-60 

T 

2011-6 

1931-34 

1913-56 

1910-00 

3s 

2473-40 

2435-88 

2383-46 

3" 

3065-91 

2998-67 

2895-48 

Table  3.    Vibrational  spectrum  for  ND3(A  =  0-01970)  along  the  inver- 
sion coordinate  in  cm"1,  (s  =  symmetrical,  a  =  antisymmetrical). 


State 


Ref.  [14] 


Present  work          Experiment  [5] 


os 

0-0 

0-0 

0-0 

O" 

0-048 

0-047 

0-053 

r 

756-01 

757-55 

745-7 

ia 

760-84 

761-91 

749-4 

2s 

1338-91 

1344-31 

1359 

2" 

1442-98 

1438-90 

1429 

3s 

1818-64 

1811-33 

1830 

3" 

2151-52 

2127-62 

2106-6 

identical.  In  table  2  we  present  the  values  of  the  Vibrational  spectrum  of  NH3  predicted 
by  this  model,  the  experimental  values  [5],  and  the  results  obtained  by  Palma  et  al  [14] 
and  by  Bunker  et  al  [13]. 

Although  we  have  taken  only  six  points  of  the  surface  potential  calculated  by  Bunker 
to  determine  V(x)  through  a  least  squares  fitting,  from  tables  2  and  3  it  may  be  seen  that 
the  obtained  results  are  very  accurate. 


4.  Isotopic  species  of  NH3 

From  the  Born-Oppenheimer  approximation,  the  same  potential  curve  for  NH3  may 
be  used  for  the  isotopic  species  ND3,  NH2D  and  NHD2  and  the  results  must  also  be 
satisfactory  since  the  deuterium  nucleus  is  heavier  than  that  of  hydrogen.  From  table 
3  it  can  be  seen  that  our  results  are  very  similar  to  those  obtained  using  an  eight  degree 
polynomial  [14]  as  the  inversion  potential  for  ND3. 

The  results  obtained  for  the  isotopic  species  NH2D  and  NHD2  are  shown  in  table  4. 
In  these  cases,  the  motion  is  more  complex  but  we  will  still  assume  that  the  potential 
surface  is  the  same  as  for  NH3.  In  table  5,  our  results  are  compared  to  some 
experimental  values,  for  example  the  ground  state  splitting  A0  =  1 0"  -  0s  \  and  the  first 
excited  state  splitting  Aj  =  1 1°  -  15|.  Further,  also  in  that  table  we  show  the  value  of  k, 
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Table  4.  Vibrational  spectrum  for  NH2D 
(A  =  0-02354)  and  NHD2  (A  =  0-021 52)  along  the 
inversion  coordinate  in  cm" 1.  (s  —  symmetrical, 
a  =  antisymmetrical) 


State 


NH,D 


NHD, 


os 

0-0 

0-0 

O" 

0-342 

0-133 

1s 

854-16 

799-17 

1" 

877-32 

809-58 

2s 

1443-16 

1379-14 

2" 

1681-40 

1536-35 

3s 

2122 

1930-49 

3° 

2580-77 

2313-36 

Table  5.     Splitting  energies  for  NH3,  ND3,  NH2D  and  NHD2  in 

cm  ~ 1.  (s  =  symmetrical,  a  =  antisymmetrical) 


Experiment        Calculated          Ref.  [14] 


Ref.  [13] 


NH3 

AO 

0-793 

0-969 

0-995 

1-02 

AI 

35-81 

52-54 

56-92 

45-6 

1s 

932-51 

92S.-92 

924-12 

986-7 

1° 

968-32 

981-46 

981-04 

1032-3 

k 

1-0 

ND3 

AO 

0-053 

0-047 

0-048 

AI 

3-7 

4-36 

4-83 

1* 

745-7 

757-55 

756-01 

la 

749-4 

762-91 

760-84 

fc 

1-323 

NH2D 

AO 

0-406 

0-342 

A! 

19-7 

23-16 

1s 

874 

854-16 

1" 

894 

877-32 

k 

1-128 

NHD2 

A0 

0-1705 

0-133 

AI 

9-84 

10-41 

1s 

808 

799-17 

r 

818 

809-58 

k 

1-233 

the  square  root  of  the  ratio  of  the  reduced  mass  of  a  given  species  to  the  reduced  mass 


ofNH3. 


The  experimental  values  of  A0  and  At  for  NH2D  and  NHD2  have  been  taken  from 
refs.  [10, 18].  From  table  5  it  can  also  be  observed  that  the  fourth  degree  potential 


Pramana  -  J.  Phys.,  Vol.  45,  No.  1,  July  1995 


79 


s\  iv  siquiriu  KL  ui 

used  in  this  work  produces,  in  general,  better  results  than  the  eight  degree  potentials 
[13,14]. 

5.  Conclusions 

The  energy  levels  for  ND3  are  not  particularly  sensitive  to  the  type  of  potential  used  to 
model  the  inversion,  and  this  may  be  seen  from  table  4  where  the  results  obtained  using 
a  fourth  degree  polynomial  and  those  from  ref.  [14],  where  an  eighth  degree  poly- 
nomial are  practically  the  same;  we  believe  that  the  NH2  D  and  NHD2  molecules  must 
behave  similarly,  whereas  for  the  case  of  NH3  a  difference  is  indeed  observed  between 
our  results  and  those  of  ref.  [14],  ours  being  closer  to  the  experimental  values. 

The  discrepancy  between  our  calculated  values  (especially  the  higher  energy  levels) 
and  the  experimental  values  may  be  produced  by  the  fact  that  the  fourth  degree 
polynomial  increases  indefinitely  as  the  inversion  coordinate  x  increases,  while  the  true 
potential  has  a  limit  before  molecular  dissociation  takes  place  (figure  1). 

The  points  of  the  potential  surface  completely  determine  the  observables  of  the 
problem  (spectrum  and  molecular  constants),  so  that  in  the  present  work  it  has  not  been 
necessary  to  rely  on  experimental  values  of  the  geometry  or  to  use  adjustable 
parameters,  i.e.,  everything  is  determined  from  first  principles. 

The  molecular  constants  found  with  this  model  are  very  similar  to  those  obtained  by 
Swalen  and  Ibers  [10]  with  a  totally  empirical  method,  where  five  adjustable  par- 
ameters were  present.  Finally,  it  must  be  pointed  out  that  only  experimental  data  for 
NH2D  and  NHD2  are  available  in  the  literature,  whereas  no  theoretical  data  cal- 
culated from  first  principles  can  be  found. 
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Abstract.  We  describe  a  simple,  low  cost  pc  based  card  for  the  measurement  of  heat  capacity 
using  adiabatic  calorimetry  at  low  temperatures.  This  card  provides  the  control  pulse  to 
the  sample  heater  as  well  as  trigger  pulse  to  the  nano-voltmeter  which  monitors  the  sensor 
voltage  (Ge  sensor,  Lake  Shore  Inc.,  USA).  We  have  also  added  a  12  bit  DAC  on  this  card 
and  this  is  used  for  remote  setting  of  the  heater  current  of  an  old  SHE  (now  Biomagnetic, 
Inc.,  USA,  model  CCS)  analog  constant  current  source.  Although  this  card  is  used  here  for 
heat-capacity  measurements,  the  same  can  also  be  used  for  thermo-power  and  thermal- 
conductivity  measurements. 

Keywords.    Low  temperature;  automation;  heat-capacity. 
PACSNo.    07-20 

1.  Introduction 

Low  temperature  heat-capacity  measurements  are  always  very  important  to  the 
solid  state  physicists  [1,2].  Many  of  the  phase  transitions  (such  as,  magnetism  or 
superconductivity)  can  be  confirmed  only  after  performing  the  heat-capacity  measure- 
ments at  low  temperatures.  Usually,  the  variation  of  the  heat-capacity  of  solids  with 
temperatures  is  very  large  at.  low  temperatures.  Depending  on  the  nature  of  the 
samples,  the  mass  of  the  sample  used  in  the  heat-capacity  measurements  may  vary  as 
much  as  two  orders  in  magnitude.  It  is  therefore  essential  that  the  apparatus  used 
for  this  measurement  should  allow  flexible  control  of  the  heating  power  applied  to 
the  sample  at  low  temperatures.  One  normally  uses  a  programmable  current  source 
(Keithley,  model  220,  USA)  which  has  a  GPIB  interface  and  this  is  used  to  control 
the  value  of  the  heater  current  as  well  as  the  duration  of  the  heater  pulse  in  the  heat- 
capacity  measurements.  In  this  paper,  we  describe  a  PC  based  (compatible  with  IBM 
PC)  card  which  can  be  used  along  with  the  analog  current  source  in  place  of  the 
programmable  current  source  which  is  generally  used  for  heater  current  supply. 
Moreover,  the  card  provides  a  trigger  pulse  for  every  nth  (n  =  1  to  999)  second  to  the 
nanovoltmeter  which  monitor  the  sensor  voltage  within  lOjusec  after  the  heater  is 
turned  off.  This  helps  us  to  measure  accurately  the  sample  temperature  as  well  as  the 
time  to  determine  its  thermal  relaxation  time  of  the  sample  which  is  related  to  its 
heat-capacity  at  that  temperature.  Further,  this  circuit  provides  automatic  control  for 
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2.  Circuit  details 

Figure  1  shows  how  the  card  and  relay  drivers  are  connected  in  the  system.  We  have 
used  a  commercial  card  (from  M/s  Dynalog  Microsystems  Ltd.,  India)  which  provides 
the  basic  I/O  facilities  like  addresses,  chip-select  lines,  read,  write  and  other  I/O  control 
lines.  This  is  connected  to  an  I/O  slot  of  an  IBM  compatible  personal  computer  and 
uses  the  power  supply  of  the  host  computer.  On  this  card,  we  have  built  the  necessary 
interfaces  for  the  microvoltmeter  HP,  model  34401  A,  and  the  constant  current  sources 
(M/s  Keithley  model  225  and  M/s  Biomagnetic  Inc.,  model  CCS).  However,  one  can  use 


8  BIT  DATA 


SHE  CORPORATION 

CURRENT  SOURCE 

FOR  KCATER 


Figure  1.    Schematic  diagram  of  the  connection  of  the  interface  card  and  relay  unit 
to  the  host  computer,  microvoltmeter  and  the  analog  dc  current  source. 


any  IEEE-488  programmable  microvoltmeter  and  any  analog  dc  current  source  with 
voltage  programmable  facility  with  this  interface  card. 

The  voltage  output  from  a  calibrated  Germanium  sensor  (M/s  Lake  Shore  Inc., 
USA)  is  connected  to  the  nanovoltmeter  (Keithley,  model  1 82,  USA)  for  monitoring  the 
temperature.  The  card  provides  a  facility  to  measure  the  voltage  for  the  forward  and 
reverse  current  to  eliminate  the  error  due  to  thermo-emfs.  The  sensor  voltage  is 
monitored  every  two  seconds  (selectable  through  software)  for  a  period  of  2-3  min 
(programmable  using  software).  After  the  sensor  temperature  is  stabilized,  a  heat  pulse 
is  given  using  the  timer  and  DAC  built  on  the  I/O  card.  At  the  end  of  the  timer  pulse  to 
the  heater,  trigger  pulses  are  generated  from  the  card.  The  sensor  voltage  data  are  sent 
to  the  computer  every  one  second  by  this  external  trigger  pulse  to  the  nanovoltmeter. 
The  data  are  acquired  by  the  PC  using  a  IEEE-488  interface  bus.  There  is  a  very  short 
time  delay  ( <  10  /*sec)  between  the  first  trigger  pulse  and  the  end  of  heater  current  pulse. 

Figures  2  and  3  give  the  details  about  the  timer  interfaces  of  the  card  along  with  the 
DAC  for  heater  current  control  and  also  the  sensor  current  reversal.  With  the  help  of 
software  (written  in  Quick  Basic,  4.5,  Microsoft,  USA),  a  current  reversal  pulse  is  sent 
using  the  C  port  pin  1 0  of  1C  1 1  [3] .  This  1C  is  configured  with  all  ports  as  output  ports. 
The  A  and  B  ports  are  used  for  outputting  12  bit  data  to  the  DAC.  The  C  port  of  the 
IC1 1  also  provides  a  stop  pulse  option  for  switching  off  the  current  pulse.  The  timer 
gate  is  open  for  clock  pulses  with  the  start  pulse  for  the  heater  current  source.  The  clock 
frequency  is  100 kHz.  Along  with  the  0-1  s  pulse  from  pin  16  of  IC11,  Ql  of  IC2  goes 
high  and  Q2  of  IC2  gives  0-1  s  pulse.  This  opens  one  of  the  gates  of  IC3  and  gives 
100  kHz  clock  pulses  to  first  counter  in  I.C1  [3].  This  IC1  has  three  16  bit  counters.  The 
first  and  second  counters  are  configured  as  rate  generators.  These  clock  pulses  are  used 
in  generating  the  trigger  pulses  for  nanovoltmeter  and  current  pulse  for  the  heater. 
These  pulses  are  given  to  third  counter  in  IC1  and  this  is  configured  as  a  monoshot.  The 
period  of  the  monoshot  is  preselectable.  This  period  should  have  integer  values  in  order 
to  synchronize  with  trigger  pulses  and  this  period  is  obtained  by  loading  the  counter. 
The  output  of  the  monoshot  goes  low  (negative  pulse)  with  the  first  clock  pulse 
provided  the  gate  pulse  is  already  received.  The  same  gate  pulse  is  used  to  reset  the 
"divide  by  2  counter"  (first  flip-flop  of  IC6).  The  input  to  IC6  is  1  s  (selectable  using 
software)  pulses.  The  output  of  the  IC6  (two  second  pulses)  are  prevented  during  the 
heater  current  pulse  by  the  monoshot  output  of  1C  1.  At  the  end  of  this  current  pulse,  the 
two  second  pulses  come  out  of  one  of  the  gates  of  IC3,  which  are  given  to  the  first 
monoshot  of  IC7.  This  monoshot  gives  two  second  negative  pulses  (Ql)  for  triggering 
the  voltmeter.  The  monoshot  pulse  of  IC1  is  inverted  and  used  to  drive  the  reed  relays, 
which  supply  the  heater  current  pulse.  The  relays  have  2  NO  (normally  open)  contacts. 
The  relay  RL1  is  active  with  the  monoshot  pulse  of  IC1  and  this  gives  current  to  the 
heater.  The  relay  RL2  keeps  the  heater  current  source  short  circuited  when  IC1 
monoshot  pulse  is  absent.  For  sensor  current  reversal,  we  have  another  set  of  relay  RL3 
and  RL4  as  shown  in  figure  3.  The  driving  pulses  to  all  the  relays  are  fed  via  two 
opto-isolators  (MCT2E)  and  this  ensures  complete  isolation  for  these  pulses. 

The  heater  current  amplitude  is  controllable  using  an  external  dc  voltage  in  an  old 
SHE's  (model  CCS)  constant  current  source.  We  have  modified  this  current  source  so 
that  the  output  current  can  be  programmed  using  an  external  dc  voltage.  The  unit 
requires  1  V  dc  for  the  full  amplitude  in  any  current  range.  However,  one  can  use  any 

Pramana  -  J.  Phys.,  Vol.  45,  No.  1,  July  1995  83 


ixJ         ro 


<t  -j 

in  CD 
oo  2  W 
wf  2 

S*l 

f£8 

-  a. 


o 
i 
10  Q  OQ 


-  <  S 
cvT  ro  <r  «s  OD  2  ^  oc  £!  '- 


<J  OOOOOOO^O 

o. 

i. 


'     1 

N 
N 

ro 

O 

*      S 

5    s 

o 

N 
_      i 

< 

a 

s   d 

ed 

3 
or 

N 

ffi 

^ 

8 

rH 

I 


u 


84 


Pramana  -  J.  Phys.,  Vol.  45,  No.  1,  July  1995 


O  U 

h-  1 

^ 

> 

(M 

h-  5 

U 

_J 
U 

o 

Ul         \ 

\ 

N 

N 

OC 

o  v> 

tf-  o 

<    0 
111    CC 

X    U. 

— OUUUUULUJLr-— ' 


Pramana  -  J.  Phvs.,  Vol.  45,  No.  1.  Julv  1995 


R  S  Sannabhadti  et  al 

analog  voltage  programmable  current  source  with  this  card.  The  DAC  1C  is  AE 
from  Analog  Devices,  USA  [4].  The  12  bit  binary  0/P  from  A  and  B  ports  of  I1 
connected  to  this  DAC.  The  external  reference  to  this  DAC  is  derived  by  using  £ 
581  (Analog  Devices)  and  a  buffer  which  gives  a  10  V  reference.  The  DAC  ou1 
connected  to  an  operational  amplifier  LM308  [5]  which  gives  a  voltage  outpu 
feedback  path  is  completed  by  the  resistors  inside  the  DAC.  This  output  is  us 
heater  current  control. 

3.  Heat  capacity  measurements 

Heat-capacity  measurements  are  performed  in  a  standard  glass  cryostat  which  c< 
of  outer  liquid  N.2  dewar  and  an  inner  dewar  where  liquid  He  is  transferred.  The  1 
capacity  of  N2  dewar  is  3  litres  and  that  of  He  is  1-3  litres.  The  hold  time  for  He  d« 
16  hours  which  is  enough  to  perform  heat-capacity  measurements  from  1-5  K  tc 
We  have  used  adiabatic  heat  pulse  technique  to  measure  the  heat-capacity  of  ma 
in  the  above  mentioned  temperature  range.  In  this  technique,  the  base  temperate 
is  sampled  via  measuring  the  output  from  the  nanovoltmeter  (sensor  voltage)  eve 
second  for  3  minutes  and  if  the  drift  rate  is  less  than  5  //K/sec,  a  heat  pulse  is  app 
the  sample.  After  a  preselected  time  duration  (software  selectable)  of  this  heater  p 
a  given  temperature  range,  the  nanovoltmeter  is  triggered  every  second  to  measi 
time  vs  temperature  data  for  at  least  two  minutes  (software  selectable)  [7].  i 
triggering  operations,  current  settings  (heater  and  sensor  currents)  and  current  r< 
(for  sensor  current)  are  preformed  by  the'pc  card.  The  heat-capacity  is  calculate* 
the  expression, 

dT-=p- 


where  P  is  the  heating  power  of  the  heat  pulse,  Ts  is  *the  sample  temperature  i 
denotes  the  sample  temperature  in  the  absence  of  external  heating  and  is  determi 
extrapolating  the  pre-heating  drift  curve  as  shown  in  figure  4.  It  can  be  shown  t 
the  post-heating  period,  U(t)  =  Ts  —  T0  is  given  by, 


where  th  denotes  the  heating  time  and  j8  =  i'1  and  the  heating  starts  at  t  = 
relaxation  time  is  defined  as  T  =  C/K  where  C  is  the  heat-capacity  and  K:  is  the  t; 
conductance  of  the  heat  leak.  Figure  4  shows  T  vs  t  measurement  cycle  of  the  hea 
method,  when  i  is  comparable  to  th.  The  pre-  and  post-heating  drift  cun 
measured  during  a  time  tm.  The  post-heating  drift  curve  is  registered  after  a  t 
delay  time  td  following  the  heating  interval  th  in  order  to  allow  for  internal  temp< 
equilibrium  of  the  sample  and  the  sample  holder  assembly.  The  post-heating  te: 
ture  drift  curve  is  fitted  to  eq.  (2).  At  low  temperatures  (T  <  20  K),  a  linear  ver 
eq.  (2)  is  used  (retaining  only  the  first  order  terms)  and  at  high  temperatures, 
non-linear  expression  is  used.  The  fitted  expression  is  then  extrapolated  to  the  mi 
the  heating  period,  t  =  tJ2  to  obtain  U(tJ2l  The  heat-caoacitv  itself  is  cal< 


U(t) 


TP(t) 


0  th        TIME  (t) 

Figure  4.     A  schematic  plot  of  temperature  (T)  vs  time  (t)  in  the  pre-  and  post- 
heating  interval  of  the  heat-capacity  measurements. 


iteratively  from  the  expression, 


24 


(3) 


The  third  term  is  used  for  the  evaluation  of  C  at  low  temperatures.  A  mixed  language 
[6]  (BASIC-FORTRAN-BASIC)  has  been  used  for  controlling  the  instruments  (via 
IEEE-488  interface)  and  real  time  evaluation  of  heat-capacity  data. 

4.  Results 

The  pc  card  is  used  in  a  standard  adiabatic  calorimeter  to  measure  the  heat-capacity  of 
single  crystal  copper.  The  temperature  dependence  of  the  heat-capacity  (Cp)  is  shown  in 
figure  5.  The  heat-capacity  from  1-5  K  to  10  K  is  fitted  with  the  expression, 


S~i      


(4) 


The  inset  shows  the  Cp/T  vs  T2  plot  at  low  temperatures.  Our  values  (Cp  and  y)  agree 
with  the  literature  values  [7].  We  have  also  studied  the  heat  capacity  of  a  superconduc- 
tor Y  gOs^ej  0  and  the  data  are  shown  in  figure  6.  The  inset  shows  the  superconduct- 
ing transition  at  8-9  K  [8, 9]  in  this  compound.  The  heat-capacity  of  this  sample  is 
given  by, 

Cp  =  yT+  fiT3  +  ST5  (5) 

where  the  first  term  is  due  to  electronic  contribution,  the  second  and  third  terms  are 
lattice  and  anharmonic  contribution  to  the  heat  capacity  from  10  K  to  30  K.  The  values 
of  y,  ft  and  6  for  this  sample  and  for  single  crystal  copper  are  given  in  table  1. 

This  card  can  also  be  used  in  the  measurement  of  thermo-power  of  metals  and  alloys. 
The  onboard  DAC  and  timer  can  be  used  for  giving  heater  pulse  to  the  auxiliary  heater 
in  standard  thermo-power  setup  and  the  voltmeters  can  be  triggered  at  the  end  of  this 
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Figure  5.  Temperature  dependence  of  the  heat-capacity  of  single  crystal  coppei 
from  1-5  K  to  30  K.  The  inset  shows  Cp/T  vs  T2  at  low  temperatures.  The  solid  line  i; 
a  fit  to  the  eq.  (4)  (see  text). 
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Figure  6.  Temperature  dependence  of  the  heat-capacity  of  Y5  Os4Ge10  from  45 1 
to  25  K.  The  inset  shows  Cp  vs  T  at  low  temperatures.  The  solid  line  is  a  fit  to  the  ec 
(5)  (see  text). 


heat  pulse.  The  time  dependence  of  the  voltage  (dV/dt)  and  the  time  dependence  c 
temperature  (dT/dt)  can  be  computed  and  the  ratio  (dV/dT)  yields  the  thermo-powe: 
In  the  case  of  thermal-conductivity  measurements,  one  gives  known  value  of  the  hea 
pulse  (selectable  using  software)  to  the  auxiliary  heater  using  this  card  and  by  noting  th 
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Table    1.    Heat-capacity  data  of  single   crystal   copper   and   poly-crystalline 
Y5Os4Ge10.  ^^ 

Sample  Range  y  fi  <> 


K 

mJ/mole  K2 

mJ/mole  K4 

mJ/mole  K6 

Copper 

1-5-10 

0-7 

0-0476 

— 

Y5Os4Ge10 

10-30 

24-4 

2-1 

0-00013 

temperature  difference  across  the  sample.  Then  one  can  calculate  the  thermal-conduc- 
tivity of  the  specimen. 

5.  Conclusion 

We  have  described  a  simple  substitute  for  a  programmable/timer  current  source  using 
PC  based  card  in  conjunction  with  an  analog  dc  current  source.  The  card  eliminates  the 
use  of  two  such  programmable  current  sources  in  the  heat-capacity  experiments.  The 
simple  design  with  accurate  timer  and  very  low  cost  ( <  200  US  Dollars)  are  the  main 
features  of  this  interface  card.  Further,  this  card  can  also  be  used  in  other  measurements 
like  thermo-power  which  require  both  timer  and  programmable  current  sources. 
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Abstract.  We  formulate  a  theory  of  generalized  Fock  spaces  which  underlies  the  different  forms 
of  quantum  statistics  such  as  'infinite',  Bose-Einstein  and  Fermi-Dirac  statistics.  Single-indexed 
systems  as  well  as  multi-indexed  systems  that  cannot  be  mapped  into  single-indexed  systems  are 
studied.  Our  theory  is  based  on  a  three-tiered  structure  consisting  of  Fock  space,  statistics  and 
algebra.  This  general  formalism  not  only  unifies  the  various  forms  of  statistics  and  algebras,  but 
also  allows  us  to  construct  many  new  forms  of  quantum  statistics  as  well  as  many  algebras  of 
creation  and  destruction  operators.  Some  of  thess  are:  new  algebras  for  infinite  statistics, 
^-statistics  and  its  many  avatars,  a  consistent  algebra  for  fractional  statistics,  null  statistics  or 
statistics  of  frozen  order,  'doubly-infinite'  statistics,  many  representations  of  orthostatistics, 
Hubbard  statistics  and  its  variations. 

Keywords.  Fock  spaces;  quantum  statistics;  g-deformations;  quantum  groups;  Hubbard  model; 
orthostatistics. 

PACS  Nos    03-70;  05-30;  02-20;  71-27 

1.  Introduction 

Recently,  much  effort  has  been  devoted  to  ^-deformations  of  oscillators.  Both  single 
oscillators  [1-12]  as  well  as  systems  of  oscillators  [13-33]  have  been  studied.  However, 
inspite  of  the  larger  literature  which  now  exists,  a  unified  picture  of  multioscillator 
systems  covering  the  various  ^-deformations  and  algebras  that  have  been  proposed, 
has  not  emerged. 

The  aim  of  the  present  work  is  to  construct  a  general  formalism  which  may  help  one 
to  develop  such  a  unified  picture.  We  construct  a  theory  of  generalized  Fock  spaces 
which  has  sufficient  flexibility  to  encompass  all  types  of  oscillator  algebras  that  have 
been  proposed  in  the  past  as  well  as  those  that  may  be  proposed  in  the  future.  Using  this 
formalism,  we  are  able  to  classify  and  clarify  the  interconnections  that  exist  between 
different  g-deformations  and  different  algebras. 

In  another  paper  [34]  we  have  pointed  out  that  as  far  as  a  single-oscillator  is  concerned, 
g-defqrmation  does  not  lead  to  anything  fundamentally  new,  and  is  merely  equivalent 
to  a  change  of  variable.  A  q-deformed  oscillator  is  just  a  different  representation  of  the 
usual  oscillator.  On  the  other  hand,  when  we  go  to  multioscillator  systems,  new  things 
are  possible;  these  are  the  new  forms  of  statistics.  However,  even  here,  we  find  that  many 
of  the  ^-deformations  for  multioscillator  systems  proposed  in  the  literature  again 
belong  to  the  category  of  substitution  or  change  of  variables  and  should  be  regarded 
only  as  different  representations  of  the  well-known  Bose-Einstein  or  Fermi-Dirac 
statistics.  Particular  mention  must  be  made  of  the  work  of  Greenberg  [13, 14]  in  this 


nothing  but  Bose  or  Fermi  statistics  in  disguise,  Greenberg's  proposal  involves  a  new 
statistics,  called  infinite  statistics  and  this  statistics  is  infact  based  on  a  new  Fock  space 
which  is  much  larger  than  the  usual  bosonic  or  fermionic  Fock  spaces. 

In  order  to  construct  a  unified  theory,  we  find  it  convenient  to  start  with  the 
underlying  space  of  the  allowed  states  of  the  system.  We  construct  the  creation  and 
destruction  operators  <?f  and  c  as  outer  products  of  the  state  vectors.  It  is  this  device  of 
starting  with  the  set  of  state  vectors  of  the  system  as  the  primary  concept,  that  cuts 
through  the  jungle  of  different  algebras  which,  prima  facie  look  different,  but  on  closer 
examination  are  found  to  be  related.  Two  sets  of  creation  and  destruction  operators 
which  are  related  by  substitution,  operate  on  the  same  space  of  state  vectors  and  hence 
describe  essentially  the  same  class  of  systems. 

We  first  formulate  the  theory  of  the  generalized  Fock  spaces.  The  key  element  is  the 
notion  of  independence  of  the  permutation-ordered  states.  The  largest  linear  vector 
space  constructed  in  this  way  is  the  super  Fock  space.  The  subsequent  specification  of 
a  subset  of  states  in  this  space  as  null  states  leads  to  many  reduced  Fock  spaces.  The 
general  theory  which  applies  to  the  super  Fock  space  as  well  as  to  the  reduced  Fock 
spaces,  all  of  which  are  to  be  called  collectively  as  generalized  Fock  spaces,  then  allows 
us  to  construct  annihilation,  creation  and  number  operators.  Whereas  the  annihilation 
and  creation  operators  and  their  algebras  even  for  a  particular  Fock  space  are  not 
unique  and  many  representations  are  possible,  a  universal  representation  for  the 
number  operators  valid  for  all  forms  of  statistics  and  algebras  exists. 

We  apply  the  formalism  to  the  super  Fock  space  as  well  as  to  the  bosonic  and 
fermionic  Fock  spaces,  the  latter  being  the  most  familiar  examples  of  reduced  Fock 
spaces.  Super  Fock  space  is  characterized  by  a  unique  statistics  named  'infinite 
statistics'.  However,  the  same  infinite  statistics  can  be  represented  by  different  algebras 
of  c  and  cf  and  we  give  a  number  of  representations  of  infinite  statistics.  In  the  bosonic 
and  fermionic  Fock  spaces,  many  different  forms  of  statistics  can  be  defined  and  for 
each  form  of  statistics  many  different  representations  of  c  and  cf  are  possible. 

Although  the  main  aim  of  the  general  framework  presented  here  is  to  develop 
a  unified  picture  of  the  various  forms  of  statistics  and  algebras,  the  same  formalism  also 
allows  us  to  construct  a  large  number  of  new  forms  of  quantum  statistics  as  well  as  new 
algebras  of  c  and  cf.  In  fact  there  is  no  limit  to  the  number  of  possibilities. 

An  important  part  of  our  work  deals  with  two-indexed  systems.  Our  general 
formalism  for  the  single-indexed  system  applies  to  most  multi-indexed  systems  also 
since  in  most  cases  a  multiple!  of  indices  can  be  mapped  into  a  single  index.  But,  we 
show  that  there  exist  certain  situations  where  such  a  mapping  is  not  possible. 
Consequently,  we  develop  the  general  formalism  for  the  two  indexed  system  and 
discover  an  enormously  rich  variety  of  novel  forms  of  quantum  statistics  and  algebras. 

The  plan  of  the  paper  is  as  follows.  The  theory  of  the  generalized  Fock  spaces  is  given 
in  §  2.  In  §  3  we  apply  the  general  theory  to  the  super  Fock  space  and  in  §  4  we  apply  it  to 
the  reduced  bosonic  and  fermionic  Fock  spaces.  Section  4  also  contains  a  new  statistics 
called  null  statistics.  The  alternative  approach  of  starting  with  ccf  algebras  and  deriving 
cc  relations  therefrom  is  presented  in  §  5.  Two-indexed  systems  are  treated  in  §  6  and 
Appendix.  Section  7  is  devoted  to  summary  and  discussion. 

A  quick  overview  of  the  paper  may  be  obtained  from  the  pictorial  summaries  given  in 
figures  3  and  4  and  the  tabular  information  provided  in  tables  i-4. 
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*jivtju  a  aw,  i  ui  usi.uiai.uj.  a  wnu  muiucs  g,  n,  i  •  •  •  m,  we  construct  me  state  vector 

l«g5«h-"«m;l>  =  IV^lh---lh--lm--lm>  (2.1) 

"g  "h  nm 

On  the  r.h.s.  of  this  equation,  lff,  !,,-••  lm  appear  ng,nh---nm  times  respectively  and 
ng---nm  denote  the  number  of  quanta  with  indices  g  •  •  •  m  respectively.  Together  with  the 
state  (2.1)  we  consider  the  set  of  all  states  obtained  through  all  distinct  permutations  of 
the  entries  on  the  r.h.s.  of  (2.1).  Thus,  we  have 

\ns,nh---nm,2y  =  \l^\hlgl^-..l^^my  (2.2) 

Wg  ~~  1  WH  ~  1  1m 


(2.3) 


|»s,nh-..nm;s>  =  |l^m...l^V^>  (2.4) 

nm  «h  «g 

Here  we  have  given  examples  of  a  few  permutations.  In  (2.2),  the  positions  of  one 
^-quantum  and  one  /i-quantum  has  been  interchanged  and  in  (2.3),  we  have  indicated 
a  few  more  interchanges.  Collectively,  we  shall  denote  the  set  of  all  these  states  as 

\ng,nh~>nm;ny,    ^  =  l,2,...s  (2.5) 

where  s  is  the  total  number  of  distinct  permutations  and  fj,  labels  each  of  these  states.  It 
is  easy  to  see  that 

\-"-"/ 


ng\nh\---nm\ 

(In  (2.3),  n  has  been  put  35  rather  arbitrarily).  We  assume  the  existence  of  a  unique 
vacuum  state  corresponding  to  zero  occupation  number  for  all  the  oscillators: 

|0>  =  |0,0,0...0>  (2.7) 

We  shall  first  consider  the  set  of  all  the  s  states  given  in  (2.5)  as  linearly  independent. 
Although  they  are  linearly  independent,  they  may  not  be  orthogonal  to  each  other  in 
general,  nor  are  they  normalized.  However,  a  state  in  one  sector  characterized  by  the 
occupation  numbers  (ng,nh...  nj  is  orthogonal  to  any  state  in  another  sector  character- 
ized by  another  set  of  occupation  numbers  (n'g,rih...rij.  We  can  summarize  these 
statements  by  the' equation: 

ff*     Q\ 

Note  in  particular  that  the  inner  product  vanishes  even  if  a  single  occupation  number 
does  not  match.  Within  the  same  sector,  the  inner  product  is  given  by 

<n  ...nm;.a|n  ...nm;j8>  =  Ma/J  (2-8a) 
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dimensions,  one  corresponding  to  each  sector  {ng.  ..nm}.  We  choose  all  these  matric 
to  be  positive  definite.  This  set  of  inner-product  matrices  Af  plays  an  important  role 
the  general  formalism. 

From  the  set  of  linearly  independent  vectors  given  in  (2.5),  it  is  possible  to  constru 
an  orthonormal  set  of  vectors  which  we  shall  denote  by  a  double-barred  ket: 

||n,...nm;Ai»;    /*  =  !...«.  (2, 

These  satisfy  the  orthonormality  relation 

««;...  n'm\d.\\ng...nm^yy  =  5^ni...dn^md^  (2.1 

There  is  no  unique  way  of  doing  this  and  the  resulting  orthonormal  set  is  not  uniqi 
One  may  use  Gram-Schmidt  orthogonalization  procedure  or  calculate  the  eigenstat 
of  the  inner-product  matrix  M  or  follow  any  other  method.  Whatever  may  be  tl 
method,  one  can  write  the  relation  connecting  the  two  sets  of  kets: 

||«r..«m;^»=  ^\XVfl\ng...nm;vy  (2.1 

V 

and  the  inverse  relation: 

K..  .»„;£»  (2.1 


where  X  is  a  non-singular  matrix.  Although  X  is  not  unique  (since  it  depends  on  tl 
particular  orthogonalization  procedure  used),  it  is  possible  to  show,  usir  ^  (2.8),  (2.K 
(2.11)  and  (2.12),  that  XX^  is  the  inverse  of  the  innerproduct  matrix  [34a]. 

M-l  =  XX*  (2.1 

Thus,  one  simple  way  of  ensuring  positivity  of  the  inner-product  matrix  is  to  choo 
a  non-singular  matrix  X  and  then  determine  M  using  (2.13).  Again  it  must  be  kept 
mind  that  we  are  dealing  with  an  infinite  set  of  matrices,  X,  one  for  each  sect< 
{ng...nm}.  Also  the  orthonormality  relation  holds  between  vectors  from  two  differe 
sectors  and  we  have  already  used  this  in  writing  (2.10). 

The  completeness  relation  for  the  orthonormal  set  of  states  ||«9...nM;/^»  can  1 
written  in  the  form: 

/=  Z  ZK--«m;/OX<v--»^li  (2.1 


where  1  is  the  identity  operator.  Substituting  from  (2.1  1)  into  (2.14)  and  using  (2.13),  \ 
get  the  resolution  of  the  identity  operator  in  terms  of  the  non-orthonormal  set  of  state 

/=    Z    ZlV-«-;v>(M-l)¥i<«f...«m;A|  (2.14 


It  is  convenient  to  define  the  projection  operator 

**(*.-"*)  =  Z  K---»m;  /*»««,..•  «m;^ll  (2.1 


)vA<V..n(?I;A|  (2.1 

A,v 
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Generalized  Fock  spaces,  statistics  and  algebras 

so  that  we  have 

/=    Z    P(n,...nJ  (2.17) 

Flg...flm 

One  can  easily  verify  the  following  properties  of  the  projection  operators: 

P(w9...wm)||w;..,«;;^»=<5nX...^,c||ws...;;m;/2»  .  (2.18) 

P(nr..nJ\n's...n'm;iiy  =  6nin-t...Snmn-m\ng...nmiiJ.y  (2.19) 

It  is  worth  noting  that  P(n9...nm)  projects  out  any  single  state  not  only  from  the 
orthonormal  set  \\ng.  .  .nm;/*»  but  also  from  the  non-orthonormal  set  \ng.  .  .nm;|t>. 

In  terms  of  the  above  projection  operators  it  is  very  easy  to  construct  the  number 
operators: 

^fc=       Z       nkP(n,...nk...nJ  (2.20) 

ng...nic...nm 

which  satisfy  the  following  properties 

JVJ|n9...nt...nm;/i»  =  nk|!n9...Mfc...wm;/x»  (2.21) 

Nk\ng...nk...nm;ny=nk\ng...nk...nm;ny  (2.22) 

[jVk,jV.]=:0    for  any  /candy  (2.23) 

We  now  introduce  the  transition  operators  which  connect  states,  lying  in  different 
sectors.  Obviously  it  is  enough  to  define  the  so-called  annihilation  and  creation 
operators  c}  and  ct.  We  define 

c)=      Z      ZA^ng---(nj+V---nm'>»'><ng---nj-'-nm'>v\  (2.24) 

ng..,nj...nm  M'V 

and  Cj  as  the  hennitian  conjugate  of  cj,  where  A^^  are  a  set  of  arbitrary  (complex) 
numbers.  Note  that  the  span  of  /*'  is  larger  than  that  of  v  and  this  is  the  reason  for  the 
prime  on  //.  Specifically, 

'-'-*  ''' 


Hence  A  is  a  rectangular  matrix.  Since  4  is  arbitrary  in  general,  the  relation  (2.24) 
provides  the  most  general  definition  of  the  creation  operator.  Even  in  this  general  case, 
it  is  possible  to  verify  the  following  commutation  relation  between  the  number 
operator  defined  in  (2.20)  and  the  creation  operator  defined  in  (2.24) 

[c;,/VJ  =  ~ct<5jt  (2.26) 

The  projection  property  given  in  (2.19)  plays  a  crucial  role  in  the  proof  of  (2.26). 

So  far  we  did  not  specify  how  the  ordered  state  vectors  \n0.  .  .nw;/i>  are  constructed. 
In  fact,  in  general  there  is  no  need  to  specify  any  procedure  for  their  explicit 
construction.  The  formalism  given  so  far  holds  whatever  may  be  the  explicit  form  of 
their  construction.  However,  once  annihilation  and  creation  operators  c  and  cf  are 


cfand  the  vacuum  state  J0>.  This  procedure  has  the  advantage  that  the  arbitrarines; 
the  matrix  A  introduced  in  (2.24)  disappears  and  A  in  fact  gets  determined  in  terms 
M.  Hence  let  us  do  it. 

In  terms  of  creation  operators,  the  ordered  state  (2.2)  for  instance  is  constructed 
follows: 

'li^1*1*!!^---1^^"^^  (2.. 

and  other  states  are  constructed  in  a  similar  fashion.  We  may  introduce  the  notati. 


...C  (2.: 

where  (cj"«.  .  .cj"";^)  is  a  permutation  of  the  creation  operators  similar  to  the  permui 
tions  denned  in  (2.1)~(2.5)  and  (C-  •<?;*<)  is  defined  as  the  hermitian  conjugate 
(cJ"*...C",jt).  For  states  constructed  in  this  manner,  there  exists  a  simple  formi 
connecting  states  in  'adjacent'  sectors.  For  instance, 

Il;.^."^m>  =  c}|l^...l^m>  (2.3 

"g  "m  ng  "m 

More  generally,  we  may  write 


where  1  1;,  na.  .  .n}.  .  .;  A>  on  the  right  is  a  subset  of  states  in  which  one;  quantum  appea 
on  the  extreme  left.  Although  the  total  number  of  states  in  the  set  \n  (n  +  1)  •  A'  > 
5'  given  by  (2.25a),  the  total  number  of  states  in  the  subset  1  1  .,  n  .  .  «-  '  •  I>  is  s  given  t 
(2.25b).  Further  the  states  in  the  subset  |  !,,„,...  „,...;  A)  are  given  the  same  ordin 
number  A  as  m  the  set  \n,..  .«,-.  .  .;  A>.  This  is  possible  since  the  quanta  {n  n  .  }  a 
permuted  among  themselves  without  disturbing  the  extra  /-quantum  sitting  on  tf 
extreme  left. 

We  now  substitute  the  expression  for  c]  given  by  (2.24)  into  the  l.h.s.  of  (2.3  1).  We  ha; 


=         .-»v..nJ.+  l)...;/O  (23: 

/i'v  V*"-'< 

where  we  have  used  (2.8).  On  comparing  with  (2.31),  we  get 

Z^'VM»A  =  ^  (2J3 

From  (2.33)  we  see  that  ^  =  X.   This   means   that  in   (2.24),  only   the  subse 
I  iy,  nt..  .nj...i»  contributes  and  hence  A  is  in  fact  a  square  matrix  and  is  equal  to  th. 
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inverse  of  M~  l: 

A  =  M~l  (2.34) 

So,  we  may  rewrite  (2.24): 

Thus,  A  and  hence  c  and  c>f  are  completely  determined  in  terms  of  the  set  of  M  matrices. 
Next,  consider  the  expression  for  the  number  operator  (eqs.  (2.20),  (2.15)  and  (2.16)): 

...nk...«M;/i||  (2.36) 

ng...tik-..nm         Av 

One  may  try  to  express  this  in  terms  of  c  and  cf.  Using  (2.28)  and  (2.29), 

Nk=       £       nkYJ(cl"s...cl"k...c^m;l)\Qy 

ns...rik---Hm         ^v 

Apart  from  cf  and  c,  the  above  expression  contains  the  vacuum  projector  |0>  <0|.  Later 
we  shall  give  examples  where  |0>  <0|  is  determined  as  products  of  cf  and  c  so  that  Nk  can 
be  expressed  entirely  in  terms  of  c1"  and  c.  However,  in  general,  this  does  not  lead  to 
simple  results,  whereas  (2.36)  provides  us  with  a  universal  representation  of  number 
operators  which  is  valid  in  all  cases. 

So  far,  we  regarded  the  set  of  state  vectors  |  ng,  nh . . .;  fi  >;  fi  =  1 . . .  s  (where  s  is  given  by 
(2.6))  to  be  linearly  independent  and  the  resulting  generalized  Fock  space  is  the 
complete  Fock  space,  which  we  shall  call  the  super  Fock  space. 

We  shall  now  show  how  to  construct  reduced  Fock  spaces.  The  motivation  for  this  is 
that  many  Fock  spaces  of  physical  interest  such  as  the  bosonic  Fock  space  or  fermionic 
Fock  space  are  reduced  Fock  spaces.  There  are  various  ways  of  doing  this.  One  may 
postulate  relationships  between  states  connected  by  permutations,  or  one  may  disallow 
certain  permutations  by  equating  them  to  null  vectors.  Yet  another  way  to  achieve  this 
is  to  use  the  permutation  group  Sn  acting  on  the  «-particle  state.  The  super  Fock  space 
we  have  constructed  consists  of  all  the  representations  of  the  permutation  group.  If  we 
allow  only  certain  representations  of  Sn,  we  get  a  reduced  Fock  space. 

All  these  possibilities  are  contained  in  the  statement  that  in  the  space  of  vectors 
\ng,nh...',fj.y,  there  are  r  null  vectors  (r  <  s) 

r  <  s.  (2.39) 

where  B?  are  constants.  So,  the  dimension  of  the  vector  space  in  the  sector  {ng,  nh...}  is 
reduced  to  d  given  by 

d  =  s~r.  (2.40) 


AH  impuiiauL  «-»- —  ~-  -wv.— ~  * jr 

states  connected  by  permutation  of  indices  will  be  taken  to  be  related  to  each  other 
through  equations  of  the  type  (2.39).  In  other  words,  the  number  of  relations  r  in  (2.39)  is 
A-  -  l  We  shall  call  this  space  as  the  bosonic  Fock  space.  If  we  impose  the  additional 
restriction:  ng  =  0  or  1  only,  the  resulting  space  will  be  called  fermionic  Fock  space.  This 
restriction  can  also  be  stated  in  the  form  of  (2.39): 

lng,nh...^>=Q    for    n,tnh...Z2.  (2-41) 

We  can  define  a  new  reduced  Fock  space,  also  of  dimension  d=lm  each  sector,  by 
taking  the  set  of  all  the  permuted  states  as  null  states  except  a  single  state  (of  a  chosen 
order)  which  is  taken  to  be  the  allowed  state.  We  shall  call  this  as  the  Fock  space  of 

frozen  order. 

Another  important  class  of  reduced  Fock  spaces  are  those  associated  with  para- 
statistics  [35-37],  which  we  shall  call  parabosonic  and  parafermionic  Fock  spaces.  For 
these,  the  number  of  relations  r  is  smaller  than  for  bosonic  or  fermionic  Fock  spaces,  so 
that  the  dimension  d  satisfies 

Kd<s.  (2-42) 

The  formalism  constructed  in  the  present  section  is  valid  for  all  these  reduced  Fock 
spaces  also,  with  the  modification  that  all  the  summations  over  p,  v  etc.  will  now  go  over 
the  range  1 . . .  d  and  correspondingly,-  X,  M  and  A  become  d  x  d  matrices.  There  is  an 
arbitrariness  in  the  choice  of  the  d  states.  Any  choice  of  d  states  will  do,  as  long  as  they 
are  non-null  states. 

All  these  Fock  spaces,  the  super  Fock  space  as  well  as  the  reduced  ones  will  be 
collectively  called  generalized  Fock  spaces.  To  sum  up  this  discussion  we  may  note  that 
a  generalized  Fock  space  is  completely  defined  by  stating  what  are  the  allowed  states  of 
the  system. 

We  shall  define  statistics  by  the  precise  relationship  linking  states  obtained  by 
permutation.  In  general,  many  relationships  can  be  envisaged  and  hence  many  different 
forms  of  statistics  can  reside  within  a  particular  reduced  Fock  space.  However,  in  super 
Fock  space,  all  the  states  obtained  by  permutation  are  independent  and  so  there  is  a 
unique  statistics  associated  with  this  Fock  space,  namely  'infinite  statistics'.  Similarly,  in  the 
Fock  space  of  frozen  order  too,  there  exists  only  a  single  statistics,  named  'null  statistics1. 

In  this  section  we  started  with  the  generalized  Fock  space  consisting  of  the  set  of 
allowed  states  of  the  system  and  constructed  the  creation,  annihilation  and  number 
operators  in  terms  of  the  outer  products  of  state  vectors.  Do  these  c  and  c*  form  an 
operator  algebra?  In  general,  c  and  cf  constructed  in  this  way  may  not  form  a  simple 
algebra,  or  even  a  closed  algebra.  Historically,  it  is  the  reverse  route  that  has  been 
followed;  one  postulates  an  algebra  of  c  and  cf  and  then  deduces  the  states  allowed  by 
the  algebra.  In  this  sense,  a  given  relation  involving  c  and  c*  implicitly  defines  an  inner 
product  and  through  it  specifies  the  allowed  and  the  null  states  of  the  system  In 
practice,  starting  with  an  algebra  is  an  easier  procedure  and  we  shall  use  it  in  the  latter 
sections.  Actually,  it  is  complementary  to  the  approach  described  in  this  section 

Therefore  we  have  two  equivalent  ways  of  dealing  with  the  generalized  Fock  spaces. 
In  the  first  approach,  which  we  have  formulated  in  this  section  we  construct  c-  and  c'  in 
terms  of  the  allowed  states  of  the  system  and  the  algebra  of  c  and  ct  is  then  a  derived 
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consequence.  This  is  the  more  fundamental  approach  and  is  of  universal  validity.  In  the 
second  approach,  which  we  have  used  in  the  latter  sections,  we  start  with  an  algebra  of 
c  and  cf  and  then  determine  the  states  of  the  system  allowed  by  the  cc*  algebra.  Since  the 
res-  --trictions  on  the  allowed  states  of  the  system  can  generally  be  stated  in  the  form  of 
cc  relations,  the  first  approach  can  be  characterized  as  cc-+ccf  while  the  second  is 
ccf->cc.  Within  the  second  approach,  an  elegant  method  to  derive  cc  relations  from  cc1" 
algebras  will  be  described. 

For  infinite  statistics,  there  is  no  restriction  on  the  allowed  states  and  cc  relations  do 
not  exist.  Hence,  for  this  statistics,  the  first  approach  should  be  interpreted  as  'no  cc 
relation'  -+  cc1"  algebra  while  in  the  second  approach,  starting  with  any  particular  cc1" 
algebra  describing  infinite  statistics  one  shows  that  there  does  not  exist  any  cc  relation. 
In  those  cases  where  the  ccf  algebra  depends  on  a  continuous  parameter  g,  one  can 
determine  the  values  of  q  where  cc  relation  exists.  Generally,  these  values  of  q  corre- 
spond to  the  boundary  of  the  region  in  the  parameter  space  where  infinite  statistics  with 
positive  definite  M  exists.  On  this  boundary,  one  or  more  eigenvalues  of  the  M  matrices 
become  zero,  thus  leading  to  the  emergence  of  the  same  number  of  null  vectors  in  the 
Fock  space  which  can  equivalently  be  interpreted  as  the  emergence  of  cc  relations.  Thus 
the  formalism  unifies  infinite  statistics  residing  on  the  super  Fock  space  with  the 
various  forms  of  statistics  residing  on  reduced  Fock  spaces. 

It  must  be  noted  that  the  inner  product  matrices  M  occurring  in  (2.35)  are  quite 
arbitrary.  Consequently,  more  than  one  realization  or  representation  of  creation  and 
destruction  operators  is  possible.  In  fact,  it  is  this  freedom  to  select  arbitrary  M  which 
enables  one  to  construct  different  algebras  involving  c  and  c\  all  operating  over  the 
same  Fock  space. 

To  sum  up,  we  construct  a  three-tiered  structure  consisting  of  Fock  space,  statistics 
and  algebra.  Fock  space  is  specified  by  the  set  of  the  allowed  states  of  the  system. 
Statistics  is  defined  by  the  nature  of  the  symmetry  of  the  allowed  states  under 
permutation.  Algebra  of  the  creation  and  destruction  operators  is  determined  by  the 
choice  of  the  inner  product  matrices  M. 

Different  representations  of  infinite  statistics  are  constructed  in  §  3.  In  the  bosonic 
and  fermionic  Fock  spaces,  many  forms  of  quantum  statistics  which  include  Bose  and 
Fermi  statistics  are  possible.  These  and  the  null  statistics  in  the  Fock  space  of  frozen 
order  are  taken  up  in  §  4. 

3.  Super  Fock  space  and  infinite  statistics 

3.1  The  standard  representation  (M  =  1) 

Having  taken  all  the  s  states  to  be  independent,  the  simplest  choice  of  the  matrix  X  is 
the  unit  matrix  which  implies  M  and  A  also  to  be  unit  matrices  (of  appropriate 
dimensions)  for  all  the  sectors  {ng,nh. . .}: 

X  =  M  =  A  =  1.  (3.1) 

For  this  choice,  which  we  shall  call  the  standard  representation  of  infinite  statistics,  the 
ordered  states  (2.5)  themselves  form  an  orthonormal  set: 

<nfl,  nh. . . ;  /*|  ng,  nh. . . ;  v>  =  6^.  (3.2) 
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Further,  the  creation  and  annihilation  operators  are  given  Dy 

c}=    Z    Z  1 1j>/W--;j"><>V>V ••''<"!  (3.3 

ns,nh...   ft 

Cj=    Z    I,\ng,nh...;n><lj,ng,nk...;n\  (3.4 


From  (3.3)  and  (3.4),  using  the  orthogonality  and  completeness  relations,  one  can  derive 
the  trf  algebra: 

c,cj  =  $y  (3.5 

This  algebra  was  first  proposed  by  Greenberg  [13],  in  an  important  paper  on  infinite 
statistics. 
In  the  standard  representation,  we  also  have  the  useful  identity: 

(3.6] 


Although  this  identity  can  be  obtained  from  (3.3)  and  (3.4)  that  define  cf  and  c,  it  is 
important  to  note  that  within  Fock  space  it  follows  from  the  cc1  algebra  (3.5).  We  shall 
prove  it  by  showing  that  (3.6)  is  valid,  when  applied  on  any  state  in  the  super  Fock 
space.  First  applying  on  1  0>,  we  find  that  both  sides  are  zero.  Next  we  apply  on  the  other 
states  cjc}...  1  0>: 


(3.7) 


Using  (3.5)  and  noting  <0|c]  =  0  we  see  that  the  left  side  of  (3.7)  infact  vanishes,  thus 
completing  the  proof  of  (3.6). 

In  a  different  context,  Cuntz  [38]  had  studied  the  algebra  defined  by  (3.5)  and  by  the 
relation: 

V  c' c  =1  fl  o\ 

^S-h-      l-  (3.8) 

2 

Cuntz  algebra  is  inconsistent  with  Fock  space,  as  can  be  seen  by  applying  both  sides  of 
(3.8)  on  |0>.  In  contrast,  our  ^.6),  because  of  the  inclusion  of  the  vacuum  projector  term 
in  it,  is  consistent  with  Fock  space  and  is  infact  a  consequence  of  the  algebra  of  c  and  cf 
in  the  standard  representation  of  infinite  statistics. 
Putting  M  =  1,  the  number  operator  Nk  given  in  (2.38)  becomes 

Nk=      Z      wkE(cI"*---crfc---cIi"m;M)|0><0|(c^"...cJ[fc...cJ«;Ai)  (3.9) 

«g...nfc...nm         fi 

Substitution  of  |0><0|  from  (3.6)  into  this  equation  and  a  straightforward  but  tedious 
calculation  finally  leads  to 

Nk  =      Z      Z  (cl"'-  •  •<#*. .  .cj";  ii)clck(d£. .  .c£. .  .cfm)  (3.9a) 


ng...nk...nm 


This  expression  is  identical  to  Greenberg's  formula  for  Nk  [13],  though  it  is  written  in 
a  different  form.  This  illustrates  the  derivation  of  the  representation  of  Nk  in  terms  of 
c  and  c  .  As  already  mentioned,  in  general  neither  the  procedure  nor  the  result  is  simple 
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ind  in  any  case  one  does  not  need  it.  The  universal  representation  of  Nk  given  in  (2  36)  is 
ufficient.  '  ' 

i.2    q-mutators  with  real  q 

dany  other  choices  of  M  are  possible.  A  particularly  interesting  choice  is  the  one  in  which 
kf  is  given  as  a  function  of  a  real  parameter  q.  Consider  the  inner  product  between  the 
i-particle  state  vectors  with  all  occupation  numbers  unity:  |  lg,  lh.  .  .;  ^  The  inner  product 
aatrix  M  in  this  sector  has  dimension  nl  x  «!  and  its  matrix  element  is  taken  to  be 

<1(/>U--.;Ai|lfl,lh...;v>  =  ^,  (310) 

/here  q  is  a  real  number  lying  in  the  range  -  1  <  q  <  +  1  and  J  is  the  number  of 
aversions  required  to  transform  the  state  \lg,  lh...;v>  into  the  state  |1  ,lh...;^>. 
dumber  of  inversions  is  the  minimum  number  of  successive  interchanges  between 
djacent  quanta  that  will  take  the  state  |  lg,  !„.  .  .  ;  v>  to  |  lg,  lh.  .  .  ;  /*>.  For  example, 

<lfcVJlfflfclfc>  =  g  (3.11) 

OAgy*lfc>  =  <?2.  (3.12) 

The  positivity  of  the  ^-dependent  M  matrices  defined  above  has  been  proved  by  Fivel 
15]  and  Zagier  [16].  In  particular,  Zagier  has  given  the  explicit  form  of  the  determi- 
ant  of  the  (n\  x  n!)  dimensional  M  matrix  for  arbitrary  n: 


detM=  H    [i_q*<*+i>]("-«»Wk+i>  (3  13) 

k=i 

it  <j  =  0,  M  is  the  unit  matrix  and  so  is  positive-definite.  Hence,  it  will  remain 
ositive-defmite  in  —  1  •<  q  <  1,  if  det  M  has  no  zeroes  there.  According  to  (3.13),  for 
2al  q,  zeroes  of  det  M  occur  only  at  q  =  +  1,  thus  proving  the  positive-definiteness  of 
4  in  the  range  —  1  <  q  <  1. 

The  inner  product  for  states  with  occupation  numbers  larger  than  unity  (which  are 
le  same  as  states  with  repeated  indices)  is  obtained  from  the  above  inner  product  for 
tates  with  distinct  indices  by  symmetrizing  with  respect  to  the  repeated  indices.  For 
xample,  consider 

<2fljl.lml,>  =  <l.lfljl.lml,>  (3.14) 

.eplace  one  of  the  #'s  by  h  in  both  the  initial  and  final  states  and  thus  get  a  matrix 
[ement  with  distinct  indices  which  can  be  calculated  using  (3.10).  This  replacement  can 
e  done  in  (2!)2  ways.  The  sum  of  these  (2!)2  matrix  elements  divided  by  2!  is  the  required 
nswer.  Thus, 


=  q  +  q2  (3.16) 

is  clear  that  the  matrix  M  for  reoeated  indices  is  obtained  from  the  higher 
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It  is  worth  noting  that  the  norm  for  rc-particle  state  with  all  the  indices  repeated 
<n,l«5>-[»A=sW«Cn*-1V-C2],[l],  (3. 

where  the  '^-number'  [ri]q  is  defined  by 


Inverting  these  M  matrices  and  defining  the  creation  operator  through  (2.35),  c 
can  derive  the  algebra  of  c  and  c*.  This  is  more  difficult  than  the  reverse  procedi 
which  is  the  way  this  subject  really  developed.  Greenberg  [14]  proposed  the  <?-muta 
algebra: 

CjcJ-gctc^y  (3. 

The  ^-dependent  M  matrices  defined  through  (3.10),  (3.15)  and  (3.16)  follow  from  t 
algebra.  Fivel  and  Zagier  then  proved  the  positivity  of  these  M  matrices  : 
-  1  <  q  <  +  1.  In  spite  of  the  fact  that  we  have  not  derived  (3.19)  from  the  M  matrii 
defined  here,  we  can  assert  its  validity  because  the  M  matrices  completely  determi 
c  and  cf.  It  is  in  order  to  emphasize  this  point  that  we  have  presented  the  M  matrii 
first  and  the  algebra  of  c  and  cf  as  a  derived  consequence. 

3.3  New  representations  of  infinite  statistics 
First  we  briefly  consider  the  2-parameter  algebra 

W]  -  ql  cfa  -  q2du  £  c\ck  =  5{j  (3.: 

k 

where  ql  and  q2  are  real  parameters.  This  may  be  regarded  as  another  representation 
infinite  statistics.  Although  the  determination  of  the  full  region  in  the  {q1,q2}  pa: 
meter  space  for  which  M  is  positive  definite  is  still  an  unsolved  problem,  one  can  sh< 
[19]  positivity  of  M  on  the  straight  line  defined  by  ql  =  0;  —  1  <  q2  <  oo.  Infact  il 
possible  to  map  this  whole  line  on  to  the  point  ql  =  0;  q2  =  0  by  a  redefinition  of  c  anc 
so  that  we  just  get  back  the  standard  representation  defined  by  (3.5). 
We  next  present  two  new  algebras: 


and 

c/cj-  0-^  =  0    for    i 

C.ct-CfC=DNl 
cici        cici       V 

where  p  is  a  real  parameter  and  Nt  are  number  operators  already  written  down  in  §  2. 
these  two  algebras,  the  first  one  (3.21)  is  based  on  ordered  indices,  that  is,  given  any  ti 
indices  i  and;',  one  must  be  considered  larger  or  smaller  than  the  other.  So,  we  may  ta 
the  indices  to  be  the  natural  numbers  1,  2,  3  ... 

Equations  (3.21)  and  (3.22)  are  again  representations  of  the  same  infinite  statist! 
for  it  is  possible  to  map  both  these  algebras  on  to  Greenberg's  g-mutator  algebra 
(3.19)  with  the  following  identification  of  the  parameters:  p  =  q~1.  Temporar 
renaming  the  (c,  cf)  of  (3.21)  and  (3.22)  by  (b,  tf)  and  (d,  ^)  respectively,  the  mapping 

mo  ™  —  ..       _,.._.._  ____ 
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ansformation  equations  are 


4-  *""'"•  c,.  (3.24, 

ence,  it  is  clear  that  both  the  algebras  of  (3.21)  and  (3.22)  lead  to  positive  definite 
r  matrices  for  p  >  -h  I  and  p  <  —  1  (corresponding  to  —  1  <  q  <  +  l). 
An  interesting  feature  of  the  algebra  (3.21)  is  the  validity  of  ordinary  commutation 
lation  between  c(  and  ct  for  i  ^j  as  in  Bose  statistics;  nevertheless  the  full  algebra  (3.21) 
ascribes  infinite  statistics! 

In  contrast  to  the  situation  for  the  g-mutator  algebra  of  (3.19),  where  the  number 
aerators  can  be  expressed  in  terms  of  c  and  cf  only  after  considerable  algebraic 
anouvers  [16,17]  and  the  resulting  expressions  are  quite  complicated,  the  corre- 
>onding  expressions  for  the  algebras  (3.21)  and  (3.22)  are  simple.  For  (3.21),  we  get 
iking  p  >  0) 


<k 


k<t 


(325) 
or  (3.22),  the  number  operator  is  even  simpler: 

N  =  —L-  log(c,cf  -  etc,-)    for  all  i.  (3.26) 


i  spite  of  our  ability  to  write  down  such  formal  expressions  for  JV£  in  terms  of  c's  and 
s,  we  must  also  point  out  that  they  are  not  of  much  use.  All  that  one  ever  needs  of  the 
jmber  operators  are  the  properties  contained  in  (2.21)-(2.23)  and  (2.26)  and  as  for 
;plicit  representation,  (2.36)  and  (2.37)  will  do. 

4  q-mutators  with  complex  q 

re  now  generalize  Greenberg's  g-mutator  algebra  to  complex  q.  This  generalized 
gebra  is  based  on  ordered  indices  and  is  defined  by  the  following  equations. 

CfC}  —  gcjc,  =  0    for    i<j  (3.27) 

C|cl-pclc,=  l  (3-28) 

here  q  and  p  are  complex  and  real  parameters  respectively  and  the  indices  ij  etc.  refer 
any  of  the  natural  numbers  1,  2,  3,  ....  The  relation  for  the  opposite  order  i  >j  is 
irivable  from  (3.27)  by  hermitian  conjugation: 

c,.cj  -  q*  c]Ci  =  0    for     i>j  (3-27') 

id  so  it  is  not  an  independent  relation. 

Let  us  now  calculate  the  inner  product  matrix  for  this  algebra.  For  distinct  indices, 


A  K  Mlshra  and  G  Rajasekaran 
2  1 


1 


Figure  1.    Inversion  diagram  for  the  permutation  (2  13) 
Positive  inversions:  (  1  ,  3)  -»  (3,  1)  and  (2,  3)  -»  (3,  2). 
Negative  inversion:  (2,  1)  -*  (  1,  2). 


(3  12). 


where  J+  and  J  _  are  the  number  of  positive  and  negative  inversions  in  the  permutation 
v  -» j«.  The  total  number  of  inversions  (the  sum  of  positive  and  negative  inversions)  is  the 
same  as  the  number  of  inversions  already  defined  below  (3.10).  We  further  define  an  inver- 
sion as  positive  if  it  is  a  transposition  of  indices  from  the  ascending  order  to  the  descending 
order  and  as  negative  if  it  is  the  reverse  transposition.  For  example,  (1,2)-*  (2, 1)  is 
a  positive  inversion  while  (2, 1)  -*•(!,  2)  is  a  negative  inversion.  Thus  we  have 

<l3M2ll2M3>  =  (<7*)24  (3.29;) 

since  the  permutation  (21 3)-*  (3 12)  contains  two  positive  and  one  negative  inversions 
as  shown  in  figure  1. 

The  relationship  between  the  algebra  defined  by  (3.27)  and  (3.28)  with  complex  q  (and 
p  =  \q\)  and  Greenberg's  algebra  defined  by  (3.19)  with  real  q  can  be  given.  Calling  the 
creation  operators  for  the  former  and  latter  algebras  as  c*(q)  and  c\  \  q  \ )  respectively,  the 
relationship  is 


(3.30) 


where  6  is  the  phase  of  q: 


S  =  k|ew  (3.31) 

and  Nk  are  the  number  operators  defined  in  (2.37). 

As  a  consequence,  the  inner  product  matrices  for  the  two  algebras  are  related  by 
0-dependent  unitary  matrices: 


M(q)=r(0)M(\q\)T(0) 


(3-32) 
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We  have  already  given  in  (3.13),  Zagier's  result  for  the  determinant  of  M  for  real  q.  For 
our  algebra  with  complex  q,  because  of  (3.32),  the  determinant  of  M  for  n  particles  with 
distinct  indices  is 

n-  1 

Zeroes  of  detMn(g)  occur  only  on  the  circle  \q\  =  1.  Hence,  Mn(q)  for  complex  q  is 
positive  for  \q\  <  1,  and  we  have  thus  extended  the  Fivel-Zagier  result  for  positivity  to 
complex  q  (for  the  case  p  =  \q\). 

In  contrast  to  the  situation  for  Greenberg's  algebra,  the  inner  product  matrix  M  for 
states  with  repeated  indices  cannot  be  derived  from  that  for  states  with  distinct  indices, 
for  the  present  algebra.  For  n-particle  states  with  all  the  indices  repeated,  the  norm  is 
the  same  as  in  (3.17)  and  (3.18),  with  q  replaced  by  p: 

which  is  positive  for  p  >  —  1.  For  states  with  only  some  indices  repeated,  M  is 
a  function  of  both  q  and  p  and  the  problem  is  more  complicated;  but  we  have  verified 
positivity  of  the  M's  up  to  3-particle  states  for  —  1  <  p  <  \  q  \  ~  2. 

If  we  choose  p  =  —  1,  all  states  with  repeated  indices  are  forbidden.  This  is  just  Pauli's 
exclusion  principle.  Thus,  the  algebra  defined  by  (3.27)  and  (3.28)  with  p  =  —  1  leads  to 
infinite  statistics  with  exclusion  principle. 

Infact,  we  can  restrict  both  (3.5)  and  (3.19)  to  i  ^j  and  for  i  =  j  replace  them  by 

c.ct  +  CJC.  =  i  (3.36) 

In  all  these  cases,  we  have  infinite  statistics  with  exclusion  principle. 

A  compilation  of  the  various  algebras  all  representing  infinite  statistics  is  given  in 
table  1. 

3.5  Unitary  transformations 

We  must  point  out  an  important  difference  between  the  algebras  described  by  (3.5), 
(3.19)  and  (3.20)  on  the  one  hand  and  the  algebras  of  (3.21),  (3.22),  (3.27)  and  (3.28)  on  the 
other  hand.  It  is  easy  to  see  that  the  former  are  covariant  under  the  unitary  transform- 
ations on  the  indices: 

ck-+J]Uk  cm\     U*U  =  UU*=1.  (3.37) 

m 

In  fact, onei  can  show  [19]  that  under  certain  conditions  (3.20)  is  the  most  general 
bilinear  algebra  ofcf  and  c]  invariant  under  this  unitary  transformation.  This  property 
is  violated  for  the  complex  <?-algebra  of  (3.27)  and  (3.28)  as  well  as  the  algebras  of  (3.21) 
and  (3.22).  Covariance  under  the  unitary  transformation  is  desirable  in  a  general 
context  since  it  is  closely  connected  to  the  superposition  principle  in  quantum 
mechanics  [3].  Nevertheless,  algebras  violating  this  requirement  have  been  proposed 
in  the  recent  literature,  either  because  of  the  possibility  of  applications  to  specific 
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Table  1.     Representations  of  infinite  statistics. 


Statistics 

Representation 

Algebra 

Infinite 

Standard  representation 
g-mutator  (with  real  q) 
Two-parameter  algebra 

^]  =  ^ 
cic]-qc}ci  =  &l} 

M-wfa 

q-mutator,  transformed  c^-^.c^d^p2-"''  "P 

4-mutator,  transformed  V]  -p~lc]c(  =  0,    for  i 


g-mutator  (with  complex  q)       fcfc]-<5fc]c,  =  0,     for  /<./ 

1  ,-,  /!•_  nr+<-  —  1 
(.cici       Pc;c-i  ~  l 

Infinite  with 
Pauli  principle  Standard  representation 


4-mutator  (with  real  q)  jc,.c]-£/c]c,.  =  0,     for  i^j 

lciC;  +  c-!c-.=  l. 
g-mutator  (with  complex  q)      Jcjct-q'c]fl-  =  0,     for  i<j 


systems  in  condensed  matter  physics  or  because  of  mathematical  motivation.  Hence 
we  include  such  algebras  in  our  investigation. 

3.6  Thermodynamics 

A  detailed  treatment  of  thermodynamic  aspects  is  outside  the  scope  of  the  presen 
paper.  Nevertheless  a  brief  statement  is  in  order. 
Consider  the  partition  function  for  the  canonical  ensemble: 

Z=  Tre-"H  (3.38 

where  /?  =  (kT)~ l  and  H  is  the  Hamiltonian  of  the  system.  If  the  system  consists  c 
non-interacting  particles,  we  have 


where  N,-  are  the  number  operators  and  £,.  are  the  single-particle  energies.  Hence,  usin. 
the  orthonormal  set  || . . .«,-.. . ;  ju»  in  the  evaluation  of  the  trace  in  (3.38),  we  get 

Z=    £    d(...nj...)e~p*l*lllt  (3.4C 

Jii.iu... 

where  d(...Hj...)  is  the  dimension  of  the  sector  {... «,...},  defined  in  (2.40),  and  th 
summation  is  over  all  the  allowed  occupation  numbers  n{  for  all  /. 

All  the  thermodynamics  of  the  system  can  be  derived  from  the  partition  functior 
Further,  the  set  of  d(. .  .iij. . .)  on  the  r.h.s.  of  (3.40)  provide  invariant  characteristics  c 
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he  Fock  space  that  are  not  dependent  on  the  particular  statistics  or  algebra  living  in 
hat  Fock  space.  Hence,  thermodynamics  is  the  same  for  all  forms  of  statistics  and 
Igebras  residing  in  a  given  Fock  space. 

For  super  Fock  space  and  the  associated  infinite  statistics,  for  which  d  is  equal  to 
defined  by  (2.6): 

!;    «=I>,  (3.41) 

he  thermodynamics  is  independent  of  the  particular  representation  or  algebra  of  c  and 
:f.  In  particular,  if  one  uses  the  q-mutator  algebra  (3.19)  to  describe  infinite  statistics, 
he  properties  of  the  system  in  thermodynamic  equilibrium  are  independent  of  q, 
>rovided  q  lies  in  the  range  —  1  <  4  <  1.  Greenberg  [14]  had  originally  envisaged  small 
Eolations  of  Fermi  or  Bose  statistics  by  choosing  q  =  +  (1  -  e),  e  small  and  positive.  We 
;ee  that  equilibrium  thermodynamics  will  not  manifest  such  small  violations. 

The  expression  for  d  given  by  (3.41)  is  the  same  as  Boltzmann  counting  and  hence 
jreenberg  has  called  infinite  statistics  as  'quantum  Boltzmann  statistics'.  However, 
dnce  the  Gibbs  factor  1/n!  is  missing,  the  statistical  mechanics  of  a  system  of  free 
jarticles  obeying  infinite  statistics  with  the  index  i  in  (3.40)  interpreted  as  momentum 
vill  suffer  Gibbs  paradox  £20].  Therefore,  the  super  Fock  space  and  the  associated 
nfinite  statistics  cannot  be  naively  applied  to  familiar  physical  systems.  In  our 
ipproach,  super  Fock  space  plays  the  role  of  a  mathematical  template  which  can  be 
ised  for  carving  out  various  physical  systems. 

As  long  as  one  takes  the  Hamiltonian  to  be  of  the  form  (3.39)  which  is  the  only  correct 
brm  for  non-interacting  particles,  the  statistical  mechanics  and  thermodynamics  of  the 
system  are  independent  of  the  algebra  of  c  and  cf.  One  can  then  construct  interaction 
:erms  involving  c  and  ^  as  in  the  usual  many-body  theory  and  their  effect  will  of  course 
iepend  on  the  algebra. 

Recent  literature  contains  many  calculations  on  the  statistical  distributions  or  other 
:hermodynamic  quantities  for  the  Hamiltonian 

H=5>ctCl.  (3.42) 

i 

tfith  c  and  c1'  satisfying  some  ^-deformed  algebra.  Since  c[c{  is  not  a  number  operator  in 
general,  the  physical  meaning  of  the  Hamiltonian  (3.42)  is  not  a  priori  clear,  although 
Dne  may  suppose  that  it  takes  into  account  some  type  of  interactions. 

4.  Statistics  and  algebras  in  bosonic,  fermionic  and  frozen  Fock  spaces 

We  introduce  two  types  of  reduced  Fock  spaces  in  subsection  4.1.  In  one,  we  take  any 
two  states  obtained  by  permutation  to  be  related  to  each  other  and  we  allow  all 
occupation  numbers:  nt  =  0, 1, 2, . . .  for  all  i.  In  the  other,  any  two  states  obtained  by 
permutation  are  again  related  to  each  other  but  occupation  numbers  are  restricted  by 
sxclusion  principle:  nf  =  0  and  1  only.  We  shall  call  the  former  as  bosonic  Fock  space 


bosonic  and  fermionic  Fock  spaces  is  to  take 

|«1,n2...;/i>  =  4Jlnl5rt2...;l>  (4.1) 

where  q  is  a  complex  number  and  J  is  the  number  of  inversions  in  the  permutation: 
1  —  »•  jj..  The  number  of  inversions  was  already  defined  in  §  3.  Although  J  was  defined 
there  for  the  case  in  which  the  occupation  numbers  were  restricted  to  unity,  the  same 
definition  can  be  extended  to  larger  occupation  numbers.  We  assume  q  •&  0.  The  case 
q  =  o  will  be  considered  separately.  Many  different  forms  of  statistics  as  well  as  various 
algebras  of  c  and  cf  can  be  shown  to  be  contained  as  particular  cases  of  (4.1). 

We  first  make  some  general  remarks  amplifying  the  meaning  of  (4.1).  This  equation 
can  be  obtained  by  repeated  application  of  the  elementary  relation: 

|...11.1J,..>  =  9|....1J1(....>,    for    i>;  (4.2) 

where,  except  for  the  two  adjacent  quanta  of  indices  i  and  j  which  are  interchanged,  all 
other  quanta  are  left  unchanged.  In  (4.2),  we  have  used  the  same  notation  for  the  state 
vectors  as  on  the  r.h.s.  of  (2.1)-(2.4).  It  is  clear  that,  for  any  q  other  than  +  1,  (4.2)  makes 
sense  only  if  the  indices  i  andj  are  ordered  through  an  inequality  (to  be  specific,  i  >  j). 
Hence  we  have  taken  the  indices  to  be  the  natural  numbers  1,  2,  3  ...  in  writing  (4.1). 

The  relationship  among  the  state  vectors  given  in  (4.1)  or  (4.2)  can  be  equivalentiy 
expressed  as  a  quadratic  relation  between  two  creation  operators  or  two  annihilation 
operators.  For  any  state  |.  .  .>,  using  (2.30)  or  (2.31)  we  have 


cjc}|...>  HI,  I,...) 
Comparing  with  (4.2),  we  get 

ctc]|...>  =  gctct|...>,    for    i>j 
Since  this  is  valid  for  any  state  |.  .  .),  we  can  write 

c\c]  -  qc]c\  —  0    for    i>j 
or, 

cjci-q*cicj  =  ^    for    i  >;.  (4.3) 

The  above-ct  relation  (eq.  4.3)  is  common  to  both  bosonic  and  fermionic  spaces.  But 
for  the  fermionic  space  there  exists  the  additional  restriction: 

ctcJ^O,    or    c,.c,  =  0.  (4.4) 

All  the  states  \nitn2...-tfj.y  for  fixed  occupation  numbers  (nltn2...)  but  different 
values  of  ji  being  related  to  each  other  through  (4.1),  the  dimension  of  the  vector  space 
in  any  sector  (nltn2...)is  reduced  to  unity.  So,  for  each  sector  we  choose  one  standard 
vector  |  «!  ,  n2.  .  .  ;  1>  which  we  rewrite  as  |  «x  ,  n2  .  .  .>,  dispensing  with  p.  completely. 

The  matrices  X  and  M  then  become  numbers  related  by 


(4.5) 
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Equation  (2.11)  becomes 

and  we  have 

«nj,  ,n2...||n'1,n'2...))  =  <5()  n.  <5n 
From  (2.35),  we  get 


as 


(4.6) 

(4.7) 


=     I     ^t<JHI\nltn2...(nj+  I)..  ><B,,n2.  ..nj..|M-l(nMn,...) 

n,...Mj... 

where  we  have  pushed  the;  quantum  to  the  right  of  all  i  quanta  for  i  <j  using  (4.2).  Also 
using  (4.5)  and  (4.6), 


(4.8) 

Equation  (4.8)  and  its  hermitian  conjugate  define  the  most  general  form  of  creation  and 
destruction  operators  for  the  bosonic  and  fermionic  Fock  spaces. 
If  we  assume  factorization  of  the  norm  M  as  well  as  that  of  X: 


then  we  get 


c  •  ~~ 


(4.9) 


(4.10) 


(4.11) 


We  now  construct  the  operator  algebras  for  c*  and  c  given  by  (4.  10)  and  (4.  1  1  ).  First  of 
all,  for  any  form  of  </>(«;),  we  get 


For  i  = j,  the  algebra  depends  on  the  choice  of  q  and  <p  («,-).  We  find 

2 


(4 


(4.13) 


(4.14) 


where  N;,  N  -.  .  .  are  number  operators  and  0(N{)  is  the  function  introduced  in  (4.9).  So 


we  can  write  the  generalised  commutation  relation 


-p 


(4.15) 


where  p  is  any  (complex)  number.  Equations  (4.12)  and  (4.15)  constitute  the  ccf  algebra 
for  ^-statistics  defined  by  the  cc  algebra  of  (4.3). 

What  we  have  derived  above  can  be  regarded  as  the  most  general  algebra  of 
creation  and  annihilation  operators  for  ^-statistics,  subject  to  the  assumption 
of  the  factorization  of  the  normalization  factor  implied  by  (4.9).  In  particular,  since  the 
function  (f>(Nj)  occurring  in  (4.15)  is  arbitrary,  we  may  regard  our  equations  as 
constituting  an  infinite-parameter  deformation  of  the  algebra  of  c  and  c*.  Exploiting 
the  arbitrariness  of  4>(Nj)  we  can  get  many  simpler  forms  of  the  algebra  which  we 
now  describe. 

(i)  Choosing  0(n7)  to  be  a  constant  independent  of  n,-  but  noting  that  $(  —  1)  must 
vanish  [38a],  (4.15)  becomes 


where  <5^.i0  is  the  Kronecker  delta,  which  can  also  be  represented  as  si 
(ii)  Putting  p  =  0  in  (4.16)  we  get  the  simple  algebra: 

Cjc]  =  \q\2^N<  '  (4j7) 

(iii)  As  a  third  possibility,  we  may  choose  f(Nj)  in  (4.15)  to  be  unity  and  p  real  so  that 
we  have  the  algebra: 


This  choice  requires  that  the  function  <j>(n)  must  satisfy  the  equation 


-P 


=  1 


(4.18) 


(4.19) 


The  solution  of  this  functional  equation  (with  the  constraints  that  0(0)  is  finite  and 
<j>(—  1)  vanishes)  is 


where 


(4-20) 


p  — 


(4-21) 


For  p  =  1,  the  >numbers'  in  (4.20)  and  (4.21)  become  ordinary  numbers 

(ll}  ^JZ1  uh°iCe  WC  PUtp  =  k|2  in  <4-18H4.21).  As  a  consequence  of  this  choice,  the 
nh.s.  of  (4.18)  becomes  a  bilinear  function  of  c  and  c\  For,  from  (4.14),  we  can  prove 


an 


(\q\2-l) 
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=  Z>l2Mi(l<7l2Nk-l) 

=  '*'    f<J  '~l  (4.22) 

where  we  have  used  (4.20)  and  (4.21)  with  p  =  \q\\  Hence  (4.18)  can  be  replaced  by 

J    J  }    1  '    L^      k    k  (4.23) 

Any  one  of  (4.16),  (4.17),  (4.18)  or  (4.23)  together  with  (4.12)  constitute  a  cc+  algebra 
and  one  can  construct  many  more  examples  of  ccf  algebras,  all  of  which  correspond  to 
the  same  ^-statistics  specified  by  (4.3).  We  have  so  far  assumed  that  the  occupation 
numbers  are  unrestricted:  n,.^0  and  so  these  algebras  are  valid  for  the  bosonic 
g-statistics,  namely  ^-statistics  in  the  bosonic  space. 

The  fermionic  ^-statistics  is  defined  by  (4.3)  and  (4.4).  In  this  case,  n  -  in  (4.10)  and 
(4.1 1)  can  take  the  value  0  only,  while  all  the  other  occupation  numbers  will  range  over 
0  and  1.  So,  the  only  arbitrary  parameter  that  enters  the  definition  of  c  and  cf  is 
r  =  0(0)/0(1).  The  ccf  relation  for  i  ^j  is  still  the  same  as  in  (4.12)  while  the  c  cf  relation 
of  (4.15)  can  be  simplified  to 


Further,  we  can  generally  prove  the  fermionic  analogue  of  the  identity  which  was 
earlier  proved  in  (4.22)  for  the  bosonic  case  only  for  a  special  choice  of  <p(n): 

(l<z!2-D  Z  4c,  =  (k!2-i)  Z  I«l2r"-*|if  *Wfc>1 

k<j  k<j 

=  Z  \q\2*i<kNi(\q\2Nk-V\r\2  (4.25) 

*</ 


Hence,  (4.24)  can  be  rewritten  as 

Cjc}-pc]cj={\r\2  +  (\q\2-l)  Z  4^}(^.0-P^-,i)  (4-26) 

k<j 

Equations  (4.12)  and  (4.26)  constitute  the  general  algebra  for  the  fermionic  ^-statistics. 
In  contrast  to  (4.  1  5)  of  the  bosonic  case,  the  fermionic  case  does  not  have  the  freedom  of 
infinite-parameter  deformation. 

On  the  r.h.s.  of  (4.26),  apart  from  the  curly  bracket  {  }  which  is  bilinear  in  c  and  c\ 
there  are  Kronecker  deltas  which  depend  on  the  operator  Nj.  A  simpler  relation  is 
obtained  for  the  choice  p  =  -  1,  since  <5iVji0  +  5Njtl  =  1.  We  then  have 

Cjc]  +  c}Cj  =  |r|2  +  (\q\2  -  1)  Z  ctc*  (4'27) 

k<j 

A  further  choice  |r|2  =  1  gives 

cjc]  +  c]cj=\+(\q\2-l)^clck  (4-28) 

k<j 

which  is  the  fermionic  analogue  of  (4.23). 
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The  algebra  given  by  (4.3),  (4.12)  and  (4.23)  (for  real  q)  are  co variant  under  the 
quantum  group  SUq(n)  where  n  is  the  total  number  of  indices.  This  is  true  of  the 
fermionic  algebra  of  (4.3),  (4.12)  and  (4.28)  also.  (See  refs.  [21-31]  for  this  relationship  to 
the  theory  of  quantum  groups.)  However,  our  approach  based  on  states  related  by 
q-statistics  (4.1)  does  not  involve  any  notion  of  quantum  group  per  se  and  so  we  shall 
not  describe  it.  The  formalism  of  the  generalized  Fock  space  appears  to  be  capable  of 
incorporating  many  general  algebraic  structures.  In  particular,  the  c;-ct  algebra  of  (4. 1 5) 
constitutes  a  general  infinite-parameter  deformation  while  the  quantum-group  related 
algebra  of  (4.23)  is  only  a  particular  case  of  this. 

We  have  constructed  the  general  algebras  for  arbitrary  complex  q.  The  choice  of 
q  defines  the  symmetry  of  the  state  under  permutation.  Hence  we  call  the  different 
choices  of  q  as  different  forms  of  statistics.  Particular  values  of  q  such  as  q  =  ±  1  or 
q  =  eie  are  of  special  interest,  although  these  cases  are  all  contained  in  the  general 
formulae  already  given.  We  shall  call  the  statistics  obtained  for  q  =  4-1,  —  1  and  eie  as 
Bose  statistics,  Fermi  statistics  or  'fractional'  statistics  respectively.  Either  the  bosonic 
or  fermionic  Fock  space  can  be  used  to  construct  any  one  of  the  various  forms  of 
statistics.  Thus,  within  our  terminology,  it  is  perfectly  possible  for  instance  to  have 
Fermi  statistics  residing  in  bosonic  Fock  space  or  vice  versa. 

Within  a  particular  statistics,  different  choices  of  M  or  0  correspond  to  different 
representations  of  c;  and  ct,  which  in  turn  lead  to  different  ccf  algebras.  If  we  assume  the 
factorization  given  in  (4.9),  for  a  given  q,  only  the  i  =j  part  of  the  cc1"  algebra  depends  on 
the  representation. 

For  the  sake  of  clarity,  all  these  forms  of  statistics  and  algebras  are  exhibited  in 
tables  2  and  3  for  the  bosonic  and  fermionic  spaces  respectively. 

Several  comments  are  in  order,  concerning  the  contents  of  these  tables.  The  algebra 

CfCj  -  CjC{  =  0,    for    i^j  (4.29) 

c,ct-cjc,.  =  0,    for    i^j  (4.30) 

c£c]  =  l  (4.31) 

provide  the  simplest  representation  for  Bose  statistics  based  on  the  simple  choice  of 
M  or  <j&  as  unity.  This  is  analogous  to  the  standard  representation  (3.5)  for  infinite 
statistics  but  very  different  from  it.  In  fact  they  live  in  entirely  different  Fock  spaces.  In 
recent  literature,  (4.31)  has  been  sometimes  confused  with  infinite  statistics.  Equation 
(4.31)  coupled  with  (4.29)  and  (4.30)  infact  describes  Bose  statistics  only,  although  in 
a  noncanonical  representation. 

Attention  may  be  drawn  to  the  'anticommuting  bosons'  and  the  'commuting 
fermions'  shown  in  tables  2  and  3  respectively.  The  commuting  fermions  have  been 
called  hard-core  bosons  in  condensed-matter-physics  literature.  Our  terminology 
seems  more  appropriate  since  they  are  really  fermions  in  disguise,  living  in  the 
fermionic  Fock  space. 

The  precise  connection  of  our  'fractional'  statistics  to  the  exchange  property  of  the 
wavefunctions  proposed  [39, 40]  for  one  and  two-dimensional  systems  needs  further 
study.  In  particular,  a  suitable  mapping  of  the  ordered  indices  to  coordinates  in  one  and 
two-dimensions  is  necessary. 

11£  Pramano  _    T    Phvc      Vnl    4<     Mn    ">      Anmi 


Generalized  Fock  spaces,  statistics  and  algebras 

Among  all  the  different  algebras  given  in  tables  2  and  3,  only  two  of  them,  namely  the 
canonical  representations  for  the  bosons  and  fermions  have  the  distinction  of  being 
covariant  under  the  unitary  transformations  that  mix  the  indices  (see  (3.37)).  All  the 
other  algebras  violate  this  requirement,  although,  as  we  have  already  mentioned,  two  of 
them,  one  in  the  bosonic  and  the  other  in  the  fermionic  Fock  spaces,  are  covariant 
under  the  quantum  groups  SUq(n). 

The  examples  contained  in  these  tables  can  be  called  deformed-oscillator  algebras  on 
which  many  papers  [1-12,21-27,  32,41,42]  have  been  written  recently.  In  spite  of  the 
multiplicity  of  these  deformed  algebras,  we  must  not  ignore  the  basic  fact  that  they  are 
all  avatars  of  just  two  primary  constructions  which  may  be  taken  to  be  the  canonical 
bosonic  algebra  for  the  bosonic  Fock  space  and  the  canonical  fermionic  algebra  for  the 
fermionic  Fock  space.  All  the  different  forms  of  statistics  belonging  to  the  bosonic  Fock 
space  as  well  as  their  various  algebraic  representations  are  related  to  each  other  and  to 
the  canonical  bosonic  algebra  and  similar  is  the  situation  for  the  fermionic  Fock  space. 
Given  two  different  forms  of  statistics  within  the  bosonic  Fock  space  characterized  by 
the  statistics  parameters  <?:  and  q2  respectively,  or/and  two  different  representations 
characterized  by  the  functions  ^(n)  and  </>2(rc)  respectively,  it  is  easy  to  get  from  (4.10) 
the  relationship 


c,(2)  -  ;'(!)  (4  32) 

jP)~W  **<*,)     *,<W;-l)j()  (         ' 

The  corresponding  equation  for  the  fermionic  Fock  space  is 


Hence,  from  the  point  of  view  of  Fock  space,  there  is  nothing  new  in  all  these  deformed 
oscillators. 
In  the  above  equations,  the  number  operator  ATf  are  given  by 

..  nf...||  (4.34) 

and,  written  in  this  fashion  in  terms  of  the  normalized  states  ||  »,  N{  is  the  same  in  all  the 
different  forms  of  statistics  (within  the  bosonic  or  fermionic  Fock  spaces)  and  in  all  the 
different  representations. 

Using  (4.14)  it  is  also  possible  to  express  the  number  operators  in  terms  of  c  and  c*  in 
any  statistics  or  any  representation  within  the  bosonic  or  fermionic  Fock  space.  Noting 
that  for  the  canonical  bosonic  algebra 


we  get  from  (4.  14) 

A^cfc,.  (4.36) 

For  other  statistics  and  representations  also,  the  set  of  equations  obtained  from  (4.14) 
starting  with  7=  1  and  successively  increasing;,  can  be  seen  to  be  implicit  recursion 
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formulae  for  all  Nj.  Thus  we  have 


C2C2  = 


4>(N2-\) 


<t>(N 


(4.37) 


By  choosing  4>(Nj)  one  then  gets  the  desired  explicit  expressions  for  Nj.  However,  as 
already  emphasized,  such  expressions  are  not  of  much  use:  the  universal  representation 
of  NJ  given  in  (4.34)  is  sufficient  for  all  purposes. 

The  formalism  of  Fock  space  developed  here  has  sufficient  flexibility  so  as  to  allow 
many  straightforward  extensions  or  generalizations.  For  instance,  the  relationship 
between  permuted  states  defined  by  (4.1)  or  (4.2)  can  be  generalized  further.  Instead  of 
(4.3),  we  may  take 


=  0    for 


(4.38) 


where  q{j  are  complex  parameters.  This  may  be  called  multiparameter  q-statistics.  The 
algebras  corresponding  to  such  extensions  as  well  as  further  generalizations  can  all  be 
constructed  essentially  by  the  same  procedure  as  given  in  this  section.  Details  will  be 
presented  elsewhere.  Suffices  it  to  say  that,  once  the  relationship  among  the  permuted 
states  is  specified  through  the  cc  relation,  the  rest  of  the  story  follows. 

Special  cases  of  such  algebras  have  been  reported  in  the  literature  on  quantum 
groups  [21-33],  We  have  already  referred  to  the  generality  of  our  approach  as 
compared  to  quantum  groups.  As  further  points  of  comparison  between  the  two 
approaches  we  must  mention  the  following.  From  the  point  of  view  of  quantum  groups, 
the  whole  set  of  relations  among  c  and  cf  are  taken  for  granted.  In  contrast,  our  analysis 
based  on  the  underlying  Fock  space  reveals  the  cc  relation  as  the  key  to  the  whole 
algebraic  structure,  although  the  normalization  function  (p(n)  also  plays  a  role  in 
determining  the  actual  representation  of  the  operators.  Hence,  depending  on  the  mode 
of  expressing  the  cc  relation  and  the  choice  of  0(«),  one  can  generate  any  number  of 
algebras  of  c  and  cf.  Thus,  our  approach  helps  to  demystify  the  quantum-group  related 
algebras  by  reducing  them  to  their  basic  essentials  which  are  identified  to  be  simple 
properties  of  state  vectors  in  the  Fock  space.  Reversing  the  procedure,  one  can 
probably  reconstruct  the  whole  edifice  of  the  quantum  group  itself  starting  from  the 
more  elementary  notions  relating  to  states  in  the  Fock  space.  This  of  course  lies  outside 
the  purview  of  the  present  paper. 

4.2  Fock  space  of  frozen  order  and  null  statistics 

In  subsection  4.1  we  had  taken  any  two  states  obtained  by  permutation  to  be  related  to 
each  other.  We  now  consider  a  limiting  situation  in  which  the  particles  are  frozen  in 
a  particular  order,  with  no  permutation  allowed.  This  is  the  Fock  space  of  frozen  states 


this  may  be  obtained  as  the  limit  g->0  of  the  g-statistics,  more  properly,  it  must  be 
regarded  as  an  independent  statistics.  For,  whereas  the  statistics  for  two  (non-zero) 
values  of  q  are  related  to  each  other  through  (4.32)  or  (4.33),  the  statistics  for 
q  =  0  cannot  be  related  to  statistics  for  q  ^  0.  Hence,  we  shall  construct  the  algebra 
of  c  and  c1"  for  this  system  of  frozen  states  directly  from  the  definition.  Again,  depend- 
ing on  whether  the  particles  obey  exclusion  principle  or  not,  we  have  two  different 
versions  of  the  system  which  we  shall  call  the  fermionic  and  the  bosonic  versions 
respectively. 

Let  us  first  consider  the  bosonic  version.  Referring  to  (4.1)  the  Fock  space  of  frozen 
states  is  obtained  by  taking  \n^  n2  .  .  .  ;  1>  as  the  allowed  state  and  requiring 

|n1n2...;Ai>  =  0    for    ^1  (4.39) 

Or,  equivalently 

|...1J.1,....>=0    for    i<j  (4.40) 

and  hence 

c,.c,.  =  0    for    i<j  (4.41) 

Assuming  factorization  of  the  norm  as  in  (4.9),  we  can  represent  the  creation  operator 
in  the  form: 


C3=    I  TrrJi^---^-^^-1-^^! 

nj,nj-n...  <P(.nj'T~  L) 

(4.42) 

The  zeroes  in  the  state  vectors  arise  from  the  fact  that  c]  can  create  a  quantum  with 
index  j  only  if  indices  i  <j  are  unoccupied.  From  (4.41)  and  the  orthogonality  of  the 
states,  we  get 

ckc}  =  0    for    k*j  (4.43) 

The  cjc]  algebra  will  depend  on  the  choice  of  0(^)70(^+1).  We  shall  assume 
0(n.)  =  0(«j  +  1)  for  simplicity.  Then,  by  using  the  completeness  relation  for  the  states, 
one  can  verify 

cjc]=l~  14'*  <444) 

k<j 

Equations  (4.41),  (4.43)  and  (4.44)  together  define  the  algebra  for  the  bosonic  version  of 
the  statistics  of  frozen  order. 
For  the  fermionic  version,  (4.41)  is  replaced  by 

ciCj  =  Q  for  i^j  (4-45) 

and  one  again  gets 

CfeCJ  =  0  for  k*j  <4-46) 

but,  instead  of  (4.44),  one  finds 

vJ-i-14*  (447) 

ksSj 
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Thus,  the  fermionic  version  of  the  statistics  of  frozen  states  is  described  by  the  algebra  of 
(4.45)-(4.47).  It  is  interesting  to  note  that  the  replacement  of  the  sign  <  in  the  bosonic 
algebra  by  the  sign  ^  yields  the  fermionic  algebra. 

Finally,  we  note  that  the  expression  for  the  number  operators  in  terms  of  c  and  cf  for 
the  above  algebra  of  null  statistics  can  be  shown  to  be  essentially  the  same  as  that  for 
the  standard  representation  of  infinite  statistics  (eq.  3.9a),  but  because  of  (4.41)  or  (4.45), 
it  can  be  simplified  to  read  as 


\f   — 
l\— 


kkk 


where  the  sum  over  n1,n2...nk  is  unrestricted  (^0)  for  the  bosonic  version,  but  is 
restricted  for  the  fermionic  version:  n1,n2...nk_1  go  over  0  and  1  while  nk  =  0. 

4.3  Thermodynamic  aspects 

For  the  bosonic  and  fermionic  Fock  spaces,  (3.40)  for  the  partition  function  becomes 

Z=    £    e-m*t,m  (4.49) 

«,,»2... 

where  «,-  =  0,  1,  2,  .  .  .  oo  for  the  bosonic  space  and  nf  =  0,  1  for  the  fermionic  space.  This 
is  a  well-known  result  for  the  canonical  Bose  and  Fermi  statistics,  but  we  emphasize  the 
fact  that  the  partition  function  and  the  resulting  thermodynamics  is  the  same  for  all  the 
various  q-statistics  and  q-deformed  algebras  living  in  the  same  bosonic  Fock  space  and 
similarly  for  the  fermionic  Fock  space.  This  point  has  been  missed  in  much  of  the 
current  literature  and  so  considerable  confusion  has  been  created.  Part  of  this 
confusion  is  due  to  the  choice  of  the  inappropriate  Hamiltonian  (eq.  (3.42))  and  the 
wrong  emphasis  placed  on  some  particular  algebra  of  c  and  cf. 

We  further  note  that  the  above  Z  in  (4.49)  is  valid  even  for  the  Fock  space  of  frozen 
order  and  null  statistics  since  d  =  1  in  this  case  too. 

To  sum  up,  equilibrium  thermodynamics  of  a  system  of  free  particles  is  determined 
entirely  by  the  dimension  d  and  spectrum  of  allowed  occupation  numbers  which 
together  characterize  the  Fock  space  and  does  not  depend  on  the  permutation 
properties  of  the  multiparticle  states  or  the  algebras  of  c  and  cf. 

5.  Derivation  of  cc  relations  from  ccf  algebras 

In  the  last  section  construction  of  the  bilinear  algebra  of  c  and  cf  starting  from  states 
related  by  (4.1)  was  shown.  In  other  words,  ccf  algebra  has  been  derived  from  the  cc 
algebra  given  by  (4.3).  The  converse  is  also  true;  the  cc  algebra  can  be  derived  from  the 
ccf  algebra,  as  is  shown  in  this  section.  Thus,  within  the  framework  of  Fock  space  it  is 
unnecessary  to  give  both  cc  and  ccf  relations.  Either  the  cc  relation  or  the  cc1"  relation 
can  be  used  to  define  the  Fock  space  and  the  other  can  be  derived.  But  there  are  some 
caveats: 

(i)  Although  the  cc  relation  does  define  the  Fock  space,  the  ccf  algebra  does  not  follow 
uniquely;  as  already  pointed  out,  the  operators  c,  c  f  and  their  algebras  depend  on  the 
choice  of  M  . 
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(ii)  In  order  to  define  the  Fock  space  completely,  the  cc+  algebra  must  fulfill  certain 
conditions.  The  ccf  algebra  must  be  such  that  an  arbitrary  inner  product  <0|  <?,•£,•... 
c*cj,|0>  or  infact  the  vacuum  matrix  element  of  any  polynomial  in  the  c's  and  c>f's 
arbitrarily  ordered,  can  be  calculated  using  the  cc>+  algebra  and  the  definition  of  the 
vacuum  state  |0>: 

cJO>  =  0    for  all  fc.  (5.1) 

A  general  form  of  the  cc  +  algebra  that  satisfies  this  requirement  is: 

^]  =  AU+  Y.Bijkmclcm  (5.2) 


k,m 


where  A{j  and  Bijkm  can  be  constants,  but  more  generally  they  can  be  functions  of  the 
number  operators.  All  the  ccf  algebras  considered  in  this  paper  are  of  this  form. 

There  exists  an  elegant  method  [19}  to  derive  cc  relations  from  the  ccf  algebras. 
Restricting  ourselves  to  relations  quadratic  in  c,  we  define 

Qij^CfCj-q'CjCi  (5.3) 

where  q'  may  be  an  arbitrary  complex  parameter.  Suppose  that,  by  using  the  given  ccf 
algebra,  we  are  able  to  prove 


i'j'V 

for  all  i,j  and  k  where  F,.^,.,^.  may  be  a  onumber  or  operator.  Then,  by  applying  this 
equation  successively,  we  get 

QijCkCm---~    Z,      L     Fijk;i'j'k'Fi'j'm;i"j"m'Ck'Cm'---Qi"j"  (5-5) 

i'j'k'  i"j"m' 

Allowing  both  sides  of  this  equation  to  act  on  |0>,  the  right  side  vanishes  and  so  we  see 
that  QLj  acting  on  any  Fock  state  c£c£,...|0>  gives  zero.  Hence  we  may  write  the 
operator  identity: 

<20.  =  0  (5.6) 

which  is  the  cc  relation  sought  after.  It  may  also  be  pointed  out  that  often  one  finds 


so  that  (5.4)  is  simplified  to 

eyc;=/(xey  .  (5-g) 

Thus,  the  form-in  variance  of  Qfj  on  being  pushed  to  the  right  of  c£,  as  explicitly  given  in 
(5.4)  or  (5.8)  is  the  necessary  and  sufficient  condition  for  the  existence  of  the  cc  relation. 
We  may  now  apply  this  method  on  various  ccf  algebras  discussed  in  the  previous 
sections. 

5.1  q-mutator  algebra  with  real  q: 
The  cc*  algebra  is 

dij,     ViJ.  (5.9) 


We  define 


From  (5.9), 


The  form-invariance  of  Qtj  requires 

q'  =  q=  ±1. 
So,  we  get  the  cc  relations 


-  q')CjSki 


(5.10) 


(5.11) 


(5.12) 


(5.13) 


corresponding  to  Bose  and  Fermi  statistics  at  q  =  +  1.  For  —  1  <  q  <  1,  there  are  no  cc 
relations  and  we  have  infinite  statistics.  The  inner  product  matrix  M  which  remains 
positive  definite  for  -  1  <  q  <  1,  develops  zero  eigenvalues  at  q  =  ±  1.  (see  eq.  (3.13)) 
corresponding  to  the  null  states  such  as  (c\c^  +  c]ct)|0>  arising  from  (5.13). 

5.2  q-mutator  algebra  with  complex  q 
The  cc1"  relations  are 

ct.ct-qctc(.  =  0    for    i<j 

Cic\-pc\Ci=\ 


We  define 

Qu  =  CiCj~q'cjCi    for 

=  CjC,.    for    i=j 
Using  (5.14)  and  (5.15),  we  find 

QiA  =  q2clQtj    for    i 
=  q*2clQu    for 
=  \q\2cQ      for 


(5.14) 
(5.15) 

(5.16) 
(5.17) 


ij    for    fc  =  i 
for    i<;  = 
for    f=   =  k. 


(5.18) 


The  form-invariance  of  Q(j  for  i  <j  requires 

q'  =  q*  =  e-ie  '  (5.19) 

where  6  is  real.  We  thus  get  the  cc  relation  corresponding  to  fractional  statistics  at  the 
boundary  \q\  =  1  of  the  disc  \q\  <  1  in  the  complex  plane. 

We  already  know  that  the  disc  \q\<l  corresponds  to  the  region  of  positive-definite 
inner  product  matrix  M  for  the  complex  <?-mutator  algebra  of  infinite  statistics.  At  the 
boundary  of  this  positivity  region,  we  have  fractional  statistics  living  in  a  reduced  Fock 
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Im  q 


Re  q 


Figure  2.  The  complex  g-plane  of  the  g-mutator  algebra.  The  disc  \q\  <  1  corre- 
sponds to  infinite  statistics  and  the  circle  \q\  =  1  corresponds  to  fractional  statistics. 
F  and  B  are  the  Fermi-Dirac  and  Bose-Einstein  points. 


(5.20) 


space  (see  figure  2).  It  is  the  states  of  vanishing  norm  of  the  form 
(...ctc]...|0»     -*»(.. .cjcf...|0»    for     i</ 

which  contribute  to  the  zeroes  of  det  Mn  in  (3.34)  at  \q\  =  1.  Once  we  remove  these 
states,  we  get  the  reduced  Fock  space  of  positive-definite  norm. 
The  form-in  variance  of  Qtj  for  i  =jf  requites 

p=-l  (5.21) 

with  the  corresponding  cc  relation 

ctct  =  Q  (5.22) 

This  leads  to  the  exclusion  principle  namely  nt  =  0  or  1  only;  equivalently,  the  norms  of 
states  with  repeated  indices  are  zero  as  seen  by  putting  p  =  —  1  in  (3.35). 

To  sum  up,  we  may  note  four  possibilities  all  contained  in  the  algebra  of  (5.14)  and 
(5.15): 


(a)  Infinite  statistics  with  multiple  occupation 

\q\*\\    p*-L 

(b)  Infinite  statistics  with  exclusion  principle 

l*l*i;   P=-I 

(c)  Fractional  statistics  with  multiple  occupation 
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(5.23) 
(5.24) 

(5.25) 
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5.3  q-statistics 
The  ccf  relations  are 

cfc]-gcjc,.  =  0    for 

cjc]-pc]cj  =  \q\^ 
We  define 

Qij  =  cicj-q'cjc[    for 

Using  (5.27),  (5.28)  we  get 

a/-I  =  <724Qu    for    i 
=    *'c          for 


(5.27) 
(5.28) 

(5.29) 


for     i>j  = 

for    /c  =  f> 


(5.30) 


Repeating  the  argument  of  form-invariance  of  Qfj,  we  conclude 

cicj  —  q*"cjC£  =  0    for    z>;  (5.31) 

In  this  proof  we  have  not  used  any  particular  form  of  f(Nj),  but  have  used  only  the 
standard  commutation  relations  among  Nk  and  cf  (eqs.  (2.23)  and  (2.26)). 

This  derivation  of  the  c.-Cj  relation  for  i  ^j  is  valid  for  the  bosonic  as  well  as  the 
fermionic  Fock  spaces.  However,  for  the  fermionic  space,  there  exists  the  additional 
relation: 


(5.32) 


To  derive  this,  we  now  commute  cj  through  c,c{  using  (5.27-5.28),  we  get 


=  q2c]icici    for 


(5.33) 

(5.34) 

(5-35) 
we  see  that  (5.34)  is  satisfied,  since  in  the  fermionic  Fock  space,  5Ni>2  =  0  and  ctSNt  0  ==  0. 
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for     i  = 

So,  for  the  validity  of  (5.32),  we  require 
ci{pf(Ni-l)  +  f(Ni)}  =  0 
Substituting  the  form  of  f(N{)  for  the  fermionic  Fock  space  (from  eq.  (4.24)) 


One  may  also  note  that  (5.34)  is  also  satisfied  for  p  —  —  1  and  f(Nt)  =  constant 
(which  may  be  chosen  to  be  unity).  So,  for  bosonic  Fock  space,  i.e.  cici  ^  0,  we  must 
avoid  p  =  -  1  in  (5.28)  and  (4.15). 

The  above  derivations  of  cc  relations  have  used  the  general  ccf  algebra  and  hence 
includes  the  cases  of  quantum-group-covariant  algebras,  independent  deformed  oscil- 
lators, commuting  fermions,  anticommuting  bosons  etc.  Further,  it  must  be  noted  that 
the  presence  of  the  term  with  the  factor  I  q  \2Zi<JNi  in  (5.28)  is  crucial  for  the  validity  of  the 
cc  relations;  without  this  factor,  there  will  be  no  cc  relation  and  we  will  get  infinite 
statistics. 

6.  Two-indexed  systems 

We  have  so  far  considered  generalized  Fock  spaces  consisting  of  states  \ng,nh,...,;  jU> 
where  the  indices  g,h,...  may  refer  either  to  a  single  quantum  number,  or  to  a 
collection  of  quantum  numbers,  specifying  the  space,  spin  and  other  internal  degrees 
of  freedom.  In  the  latter  case,  it  may  be  supposed  that  one  has  mapped  a  collection 
of  indices  to  a  single  index.  What  is  envisaged  in  this  section  are  situations  where 
such  mapping  is  not  possible.  This  can  happen  in  various  ways.  To  be  specific, 
let  us  consider  oscillators  with  a  pair  of  indices,  a  latin  index  (g, h,...)  and  a  greek 
index  (a,/J...).  There  exists  a  class  of  systems  in  which  the  occupation  numbers 
with  a  single  index  ng  or  «a  are  defined,  but  occupation  numbers  with  both  the  indices 
nga  are  not  defined.  Such  systems  cannot  be  mapped  into  single-indexed  systems. 
In  another  class  of  systems,  nga  do  exist,  but  the  subsidiary  conditions  that  define 
the  reduced  Fock  space  depend  on  the  two  indices  g  and  a  in  such  a  way  that  prevents 
mapping  of  (g,  a)  into  a  single  index.  We  consider  these  two  classes  of  systems  in 
subections  6.1  and  6.2  respectively. 

In  may  also  be  mentioned  that  we  first  encountered  such  systems  in  the  study  of  the 
Hubbard  model  in  the  limit  of  infinite  Coulomb  repulsion  [43-46].  Although  this  was 
our  original  motivation,  this  has  now  opened  the  door  to  a  more  general  framework 
encompassing  novel  forms  of  statistics  and  algebras. 

6.1  Systems  in  which  nga  do  not  exist 

We  specify  the  states  as  \ng,  nh. . . ;  na,  n^ . . ;  /^>,  where  ng,  nh...  are  the  numbers  of  quanta 
with  indices  g,  h. . .  respectively,  while  na,  rip, . . .  are  the  numbers  of  quanta  with  indices 
a,  ^. . .  respectively  and  we  have  the  constraint: 

We  may  regard  ng  as  the  total  number  of  quanta  with  index  g  whatever  may  be  their 
greek  index  and  similarly  for  na.  In  such  a  state,  the  occupancy  number  with  both 
indices  such  as  nka  is  not  defined.  The  latin  indices  and  the  greek  indices  are  indepen- 
dently permutated  and  this  leads  to  a  much  enlarged  space  in  each  sector.  Now  jU  goes 
over  1 ...  s'  where 

,  fa +  n, +  ...)!  {6>2) 


nglnh\... 
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In  general,  5'  is  larger  than  s. 

Let  us  consider  the  two-particle  sector  as  an  example.  If  we  specify  occupation 
numbers  with  both  indices,  we  have  the  two  states  in  the  sector  (lka,  lm/?):|  lfca,  lm/J;  M> 
with  p.  -  1  or  2  which  correspond  to  |  lkx,  lm/?>  and  |  lm^,  lfca>  and  another  two  states  in 
the  sector  (lma,lk/l):|lma,lw;^>  with  /i=l  or  2  corresponding  to  |lm,,lk/,>  and 
|lt/r,  lMa>.  (Here  we  are  not  using  /ca,m/?  etc.  in  their  generic  sense;  they  are  used  to 
denote  specific  values  of  the  indices).  On  the  other  hand,  with  the  new  type  of  states  with 
decoupled  indices  for  which  occupation  number  with  both  indices  do  not  exist,  we  have 
the  two-particle  states  |lk,lm;la,lp;/i>  and  now  ^  goes  over  1  to  4  corresponding  to  the 
four  state  vectors  |lta,lM/r>,  |lm/JJ  lkst>,|lma,  I*/, XIV  !„«>•  Thus,  both  the  sectors 
considered  earlier  are  combined  to  form  a  single  enlarged  sector.  Such  a  regrouping  of 
sectors  with  consequent  enlargement  occurs  throughout  the  Fock  space,  in  the  case  of 
the  decoupled  indices. 

The  construction  of  the  orthonormal  set  as  well  as  the  other  properties  of  the 
generalized  Fock  spaces  given  in  §  2  goes  through  for  the  present  case  of  decoupled 
indices  also  except  that  the  matrices  X,M  etc.  will  be  of  higher  dimensions.  The 
creation  and  destruction  operators  c£a,  ckat  also  can  be  constructed  in  the  same  way  as  in 
§2.  The  relevant  equations  and  formulae  with  the  appropriate  changes  incorporated 
are  given  in  Appendix  A.  They  are  self-explanatory. 

Just  as  in  the  case  of  single-indexed  systems  (cf.  §2),  we  can  again  have  a  super  Fock 
space  in  which  all  the  states  connected  by  independent  permutation  of  the  latin  and 
greek  indices  are  taken  to  be  independent.  We  shall  call  the  associated  statistics  as 
'doubly-infinite1  statistics  since  it  is  infinite  statistics  in  latin  and  greek  indices  separate- 
ly. By  imposing  relations  among  the  permutated  states,  one  can  get  many  kinds  of 
reduced  Fock  spaces.  Because  of  the  larger  number  of  available  states  in  each  sector, 
many  new  types  of  statistics  become  possible.  These  can  be  discussed  and  the  associated 
algebras  can  be  constructed  by  the  same  procedure  as  in  §  3, 4  and  5.  However,  we  shall 
be  brief  and  restrict  ourselves  to  presenting  some  of  the  important  results  only.  Some  of 
these  algebras  and  the  new  kinds  of  statistics  implied  by  them  have  been  discussed  by  us 
in  greater  detail  in  the  earlier  papers  [18, 19,46]. 

Doubly-infinite  statistics 

Consider  the  a>f  algebra  described  by 

?A  ~  tf,p  I  clfcky  =  dkmdap  (6.4) 

where  q  is  a  real  parameter  lying  in  the  range 

-1<9<1.  (6.5) 

It  is  the  second  term  on  the  left  of  (6.4)  in  which  a  and  ]8  have  been  dissociated  from 
k  and  m  respectively  that  leads  to  the  decoupling  of  the  latin  and  greek  indices  and 
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prevents  the  mapping  of  latin  and  greek  indices  to  a  single  index.  One  can  show  [19] 
that  the  inner-product  matrices  following  from  this  algebra  are  all  positive-definite  for 
-  1  <  q  <  1.  Further,  one  can  show  using  arguments  similar  to  those  in  §  5  that  there  is 
no  cc  relation  for  this  algebra  in  the  same  parameter  range  and  so  all  the  states 
connected  by  independent  permutation  of  the  latin  and  greek  indices  are  independent. 
The  underlying  Fock  space  is  the  full  super  Fock  space  of  dimension  given  by  (6.2)  and 
we  have  infinite  statistics  in  latin  and  greek  indices  separately,  or  doubly-infinite 
statistics. 

For  q  =  0  in  (6.4),  the  two  indices  can  be  mapped  into  a  single  index  and  the  algebra 
reduces  to  the  standard  representation  of  single-indexed  infinite  statistics  described 
in  §3.1. 

The  algebra  of  (6.4)  is  covariant  under  the  unitary  transformations  on  the  latin 
indices: 


\  (6.6) 

m 

as  well  as  under  separate  unitary  transformations  on  the  greek  indices: 

«te=Z^cu;     KtK=KFt=1.  (6/7) 

A 

But  the  algebra  is  not  covariant  under  the  enlarged  unitary  transformations  involving 
both  the  latin  and  greek  indices  (for  q  =£  0).  A  special  case  of  the  unitary  transformations 
of  (6.6)  and  (6.7)  is  the  phase  transformation.  Equation  (6.4)  is  covariant  under  either  of 
the  following  phase  transformations: 

dka  =  e^cka  (6.8) 

eka  =  e^cka  (6.9) 

As  a  consequence,  the  number  operators  Nk  and  Na  exist.  However,  (6.4)  is  not 
covariant  under  the  transformation: 

A.  =  «*-cte  (6.10) 

and  correspondingly,  NkS  does  not  exist. 

We  can  make  q  in  (6.4)  complex  provided  we  order  the  latin  indices  and  thus  we  get 
an  algebra  which  is  the  analogue  of  the  g-mutator  algebra  with  complex  q  for  the 
single-indexed  systems  (§3): 

v-Zc-0    for    *<J  (6-n) 


where  q  is  complex,  but  p  is  real.  This  again  describes  the  same  doubly-infinite  statistics; 
only  the  representation  and  algebra  are  different.  This  algebra  is  no  longer  covariant 
under  the  unitary  transformations  of  (6.6)  and  (6.7),  but  is  still  covariant  under  the 
phase  transformations  of  (6.8)  and  (6.9).  Positivity  of  the  inner-product  matrices 
M  requires  \q\  <  1.  As  for  p,  similar  statements  as  in  §3  can  be  made. 
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Coming  back  to  (6.4),  at  the  boundary  of  the  range  of  q  given  in  (6.5),  namely  at 
q  _  +  i?  we  get  two  new  forms  of  statistics  called  orthobose  and  orthofermi  statistics 
[18,  19,46]  which  reside  in  reduced  Fock  spaces  defined  by  cc  relations.  The  algebras 
for  these  are  given  below: 

Orthobose  statistics: 

^d/.-^Ic^-^^  .     .  (6-13) 

y 
c   c    __c    c    __o  (6.14) 

CkaCm/J        Cmacfc0        u'  v  ' 

Orthofermi  statistics: 

Ck*clp  +  &*f  I  clft;  =  *****  (6L5) 

v 

Vm,  +  <W*,-0.  (6-16) 

The  inner  product  matrices  M  for  the  reduced  Fock  spaces  generated  by  these 
algebras  (eqs.  (6.  1  3)-(6.  1  6)  )  can  be  shown  to  be  positive.  In  both  these  statistics,  any  two 
states  obtained  by  permuting  the  greek  indices  are  independent.  On  the  other  hand, 
states  obtained  by  permuting  the  latin  indices  are  either  equal  to  each  other,  or  related 
by  (  —  1)J  where  J  is  the  number  of  inversions  in  the  permutation  of  the  latin  indices,  for 
the  orthobose  or  orthofermi  statistics  respectively.  Thus,  we  have  infinite  statistics  in 
the  greek  indices  and  Bose  or  Fermi  statistics  in  the  latin  indices.  For  orthofermi 
statistics,  we  have  the  further  condition: 

«fc  =  0orlonly.  (6.17) 

It  must  be  noted  that  the  exclusion  implied  by  (6.17)  is  stronger  than  the  usual  Pauli 
exclusion  principle;  (6.17)  requires  that  there  cannot  be  more  than  one  particle  with 
index  k  whatever  may  be  its  greek  index. 

We  have  constructed  a  local  relativistic  quantum  field  theory  [47]  based  on 
orthostatistics.  It  may  be  worth  mentioning  that  although  orthostatistics  does  involve 
infinite  statistics,  the  problems  faced  by  finite  statistics  are  avoided  by  associating  the 
latin  index  (pertaining  to  Fermi  or  Bose  statistics)  with  the  conventional  degrees  of 
freedom  such  as  momentum  and  spin  and  assigning  the  greek  index  (of  infinite 
statistics)  to  a  new  degree  of  freedom. 

Orthostatistics  can  be  generalized  to  g-orthostatistics.  We  keep  the  infinite  statistics 
in  the  greek  indices  but  have  <?-bose  or  <?-fermi  statistics  in  the  latin  indices.  More 
precisely,  we  must  regard  these  as  g-orthostatistics  lying  in  orthobosonic  and  orthofer- 
mionic  Fock  spaces.  We  can  call  them  g-orthobose  or  g-orthofermi  statistics. 

q-Orthobose  statistics: 

cfac#  +  ^<rfIc;yC,7  =  0    for     i<j  (6.18) 


V  k<j   y 

c:.  c  :o  +  a*  c  .-_£..„  =  0    for    i  <.  i  (6  ?(Y! 


Generalized  Fock  spaces,  statistics  and  algebras 
q-Orthofermi  statistics: 

Vj,  +  ^lcj,clr-0    for    i<j  (6.21) 

V 

CA  +  *<*  Z  cVjy  =  4*  +  MM2  ~  D  Z  Z  <V*,  (6.22) 

V  k<J    V 

CbCjf  +  q+CfrCi^Q    for    i<j  (6.23) 

V^  =  0  (6.24) 

In  (6.18)-(6.24),  q  is  an  arbitrary  complex  parameter.  For  q  —  el°,  and  q  =  0  we  shall 
have  fractional  statistics  and  statistics  of  frozen  order  respectively,  but  in  the  latin 
indices  only.  Of  course  one  can  construct  many  other  algebras  corresponding  to  the 
same  statistics,  just  as  in  the  case  of  the  single-indexed  systems. 

Further,  these  equations  are  analogous  to  (4.3),  (4.12),  (4.23)  and  (4.28)  which  are 
co variant  under  quantum  groups.  So,  another  direction  is  indicated  here  for  the  further 
generalization  of  quantum-group-theoretic  structures  to  two-indexed  systems. 

6.2  Systems  in  which  nga  exist 

We  shall  now  consider  a  different  kind  of  double-indexed  systems.  Here,  the  occupation 
numbers  ngoi  exist,  nevertheless  mapping  to  single-indexed  systems  is  not  possible 
because  of  the  subsidiary  conditions  that  define  the  reduced  Fock  space.  We  give  below 
three  examples  of  such  double-indexed  systems,  described  by  the  algebras: 

(a) 

(6.25) 


(6.26) 

(b) 

/  \ 

(6.27) 

(1  — ,5    \(c   c     —  c    c   )  =  0.  (6.28) 

(c)  , 

'    -Z4yCky)  (6'29) 

CkaCmB  -  (1  -  <5km)CmflCka  =  °'  (6.30) 

Each  of  these  algebras  is  covariant  under  the  unitary  transformation  on  the  greek 
indices  defined  by  (6.7),  but  not  covariant  under  the  unitary  transformation  on  the  latin 
indices  defined  by  (6.6).  However,  all  of  them  are  covariant  under  not  only  the  phase 
transformations  in  (6.8)  and  (6.9)  but  also  the  /ca-dependent  phase  transformation  of 
(6.10),  hence  all  the  occupation  numbers  nk,na  as  well  as  nka  exist. 

Algebra  (a)  leads  to  states  which  are  antisymmetric  for  simultaneous  interchange  of 
latin  and  greek  indices  as  in  Fermi-Dirac  statistics,  but  the  usual  Pauli  exclusion 
principle  is  replaced  bv  the  stroneer  or  more  exclusive  exclusion  principle,  as  in 


orthofermi  statistics  (see  eq.  (6.17)): 

nk  =  0  or  1  only, 
where 


(6.31) 
(6.32) 


These  are  precisely  the  states  that  are  allowed  in  the  Hubbard  model  of  strongly 
correlated  electrons  in  the  limit  of  infinite  intrasite  Coulomb  repulsion  if  we  interpret 
the  latin  index  as  the  site  and  the  greek  index  as  the  spin.  Hence,  this  algebra  and  the 
statistics  can  be  called  Hubbard  algebra  and  Hubbard  statistics  respectively.  For  more 
details,  the  reader  is  referred  to  [46].  Since  nka  exists,  the  states  of  the  system  can  be 

Table  4.    Statistics  and  algebras  for  2-indexed  systems. 


Statistics  in       Statistics  in 
Nk  latin  indices      greek  indices 


ccf  algebra 


Do  not  exist 


infinite 


infinite 


i,  c+  „ 

ka.   mfl 


c  „  — 


5  „  Y  cf  c.  =5,  <5  „    (real  q] 

afl  f—i    my    ky          km    a.p      v  "' 

y 

^ZcLvcky  =  °'    for  /c<m 

V 

t/iZ^/kv^^    (complex  q 
y  and  real  p) 


4-         ,      ^        T-n      f  0         c 

"'     -r  n       t   c'    r     =(i     (i 
~mp  —  Ua0  LJ  cmycfcy       "km0!! 


Exist 


,  x  =  |^fl2  forqr-bose;  x=-l  forg-fermi 


Fermi/Bose 

q-fermi/ 
q-bose 


Antisymmetric  for  total 
exchange,  but  nk  ^  1 


Symmetric  for  total  exchange  cfcac^  -  (1  -  ^Jc^c 

for  k?=m,  but  infinite  / 

statistics  in  greek  indices  =  km  ** \  +  7  °ky 
for  k  =  m 


Symmetric  for  total  exchange      ctoc^  —  ( 1 
but  nt  <  1  .     . 
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which  can  be  rewritten  in  the  form: 

£nte  =  0orlonly  . 

a 

this  system  cannot  be  mapped  into  a  single-indexed  system. 

Algebra  (b)  can  be  regarded  as  the  bosonic  'counterpoint'  of  algebra  (a).  States  are 
symmetric  under  simultaneous  exchange  of  latin  and  greek  indices  for  quanta  with 
different  latin  indices  (k  ^  m).  But  for  quanta  with  identical  latin  indices  (k  -  m),  there  is 
no  restriction  on  the  symmetry  with  respect  to  the  greek  indices.  In  other  words,  there  is 
infinite  statistics  in  greek  indices  if  the  corresponding  latin  indices  are  identical. 
Whereas  algebra  (a)  leads  to  more  exclusive  states  than  allowed  by  Pauli,  algebra  (b) 
leads  to  'more  inclusive'  states  than  allowed  by  Bose.  For  this  reason  we  may  call 
algebra  (b)  as  the  'inclusive  counterpoint'  to  algebra  (a).  Such  a  restriction  on  the 
allowed  states  of  a  two-indexed  system  which  distinguishes  k  =  m  from  k  ^  m  cannot  be 
mapped  into  a  condition  for  a  single-indexed  system. 

Finally,  algebra  (c)  leads  to  states  that  are  symmetric  for  simultaneous  exchange  of 
latin  and  greek  indices,  but  the  stronger  exclusion  principle  of  (6.33)  is  also  valid  as  in 
algebra  (a).  Hence,  algebras  (a)  and  (c)  represent  two  forms  of  statistics  which  may  be 
called  antisymmetric  and  symmetric  Hubbard  statistics  respectively  both  residing 
within  the  same  reduced  Fock  space.  On  the  other  hand,  algebra  (b)  and  its  statistics  lie 
in  a  different  reduced  Fock  space  which  is  a  Fock  space  with  the  new  'inclusion' 
principle. 

A  compilation  of  the  algebras  and  statistics  for  two-indexed  systems  is  given  in 
table  4.  The  cc  relations  are  not  included  since  they  can  be  shown  to  follow  from  the  ccf 
algebra,  whenever  they  exist. 

7.  Summary  and  discussion 

We  have  formulated  a  theory  of  generalized  Fock  spaces  which  is  sufficiently  general  so 
as  to  encompass  the  well-known  Fock  spaces  and  many  newer  ones.  We  have  shown 
that  such  a  theory  can  be  constructed  without  introducing  creation  and  annihilation 
operators.  The  only  requirements  for  constructing  a  generalized  Fock  space  are  to 
specify  the  set  of  allowed  states,  and  to  make  it  an  inner  product  space.  By  freeing  the 
notion  of  the  underlying  state  space  from  c  and  c\  we  are  able  to  define  different  forms 
of  quantum  statistics  in  a  representation  independent  manner.  Subsequently,  one  can 
construct  c  and  cf  and  their  algebras  in  any  desired  representation. 

Our  general  formalism  not  only  unifies  the  various  forms  of  statistics  and  algebras 
proposed  so  far  but  also  allows  one  to  construct  many  new  forms  of  quantum  statistics 
as  well -as  algebras  of  c  and  cf  in  a  systematic  manner.  Some  of  these  are  the  following: 

(a)  Many  new  algebras  for  infinite  statistics 

(b)  Complex  ^-statistics  and  a  number  of  ccf  algebras  representing  them 

(c)  A  consistent  algebra  of  c  and  cf  for  'fractional'  statistics 

(d)  Null  statistics  or  statistics  of  frozen  order 

(e)  'Doubly-infinite'  statistics  and  its  representations 

(f)  g-orthobose  and  g-orthofermi  statistics 
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(g)  A  statistics  for  two-indexed  systems  with  a  new  'inclusion  principle', 
(h)  A  symmetric  version  of  Hubbard  statistics. 

Our  primary  concept  is  that  of  generalized  Fock  space,  of  which  many  categories 
have  been  introduced  in  this  paper.  Next  comes  the  notion  of  statistics  which  is  defined 
by  the  type  of  symmetry  or  relationship  among  the  state  vectors  residing  in  the 
particular  type  of  Fock  space.  In  a  given  Fock  space,  more  than  one  type  of  symmetry 
can  be  postulated,  the  prime  example  of  this  being  the  symmetry,  antisymmetry  or 
g-symmetry  in  the  bosonic  and  fermionic  Fock  spaces.  For  a  given  statistics,  there  can 
exist  different  representations  of  c  and  cf,  leading  to  different  ccf  relations.  To 
summarize,  a  particular  Fock  space  can  admit  different  statistics,  and  a  particular 
statistics  can  be  represented  by  more  than  one  ccf  algebra.  But  the  important  point  is 
that  various  statistics  and  algebras  residing  in  a  given  Fock  space  are  all  interelated. 


GENERALISED  FOCK  SPACES 


Many 
represen- 
tations of 
q-  statis- 
tics 


Canonical 

bosonic 

and 

fermionic 

algebras 


Deformed 
oscilla- 
tors 


Commutinj 
fermions 


Anti- 
commut- 
ing 
bosons 


Figure  3.    Generalized  Fock  spaces,  quantum  statistics,  algebras  and  their  inter- 


These  interconnections  are  given  by  generalized  versions  of  the  well-known  Jordan- 
Wigner-Klein  transformations.  No  such  interconnections  exist  among  statistics  and 
algebras  belonging  to  distinct  Fock  spaces.  We  must  further  add  that  equilibrium 
thermodynamics  of  a  system  of  free  particles  is  the  same  for  all  the  different  statistics 
and  algebras  within  the  same  Fock  space. 

For  the  sake  of  clarity,  the  above-described  logical  order  of  concepts  as  well  as  their 
logical  interconnections  are  presented  in  the  form  of  flow  charts  or  block  diagrams  in 
figures  3  and  4.  The  single-indexed  systems  are  considered  in  figure  3.  The  Fock  spaces 
of  higher  dimension  are  shown  to  the  right  of  those  of  lower  dimension.  The  Fock  space 
of  frozen  order  as  well  as  the  bosonic  and  fermionic  Fock  spaces  have  the  lowest 
dimension  d  —  1  in  any  sector  {ng,  nh. . .}.  Next  come  the  parafermionic  and  para- 
bosonic  Fock  spaces  which  have  d  >  1.  At  the  extreme  right,  we  have  the  super  Fock 
space  which  has  the  largest  dimension  d  =  s  in  each  sector  with  5  given  by  (2.6).  Null 
statistics  and  infinite  statistics  can  be  regarded  as  the  opposite  limiting  cases  of 
generalized  statistics  and  hence  these  two  forms  of  statistics  along  with  their  Fock 
spaces  occupy  the  opposite  ends  of  the  diagram.  Although  not  shown  separately 
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Figure  4.    Same  as  figure  3  for  systems  with  two  indices  that  cannot  be  mapped  into 
a  single  index. 
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in  figure  3  because  of  lack  of  space,  the  bosonic  and  fermionic  Fock  spaces  are  distinct 
and  each  must  be  separately  associated  with  the  complete  set  of  statistics  and  algebras 
shown.  Same  is  true  of  the  parabosonic  and  parafermionic  Fock  spaces.  Further, 
there  are  two  Fock  spaces  of  frozen  order,  the  bosonic  and  fermionic  type.  And  finally, 
there  exists  another  super  Fock  space  with  exclusion  principle,  which  is  not  shown 
separately. 

Within  the  parafermionic  and  parabosonic  Fock  spaces  many  'deformations'  of 
parastatistics  and  many  other  representations  and  algebras  apart  from  Green's 
trilinear  algebra  [35]  are  possible.  These  are  indicated  by  the  hanging  arrows  in 
figure  3.  Further,  as  shown  by  the  dotted  lines,  there  is  enough  room  for  many  new 
varieties  of  Fock  spaces  and  associated  statistics  and  algebras.  These  possibilities  may 
be  pursued  in  the  future. 

Coming  to  figure  4  depicting  the  systems  with  two  indices,  here  again  Fock  spaces  of 
higher  dimension  generally  lie  to  the  right.  Although  shown  together,  the  orthobosonic 
and  orthofermionic  Fock  spaces  must  be  regarded  distinct.  Here,  one  can  envisage 
a  richer  harvest  of  new  Fock  spaces,  statistics  and  algebras  because  of  the  two  indices 
and  this  again  is  for  the  future. 

We  now  conclude  with  some  general  comments: 

1 .  We  must  once  again  repeat  and  emphasize  the  point  that  most  of  the  g-deformations 
on  oscillators  discussed  in  the  literature  amount  to  only  a  change  of  variable  and 
hence  must  be  regarded  as  different  avatars  of  bosonic  or  fermionic  systems. 
However,  one  must  clearly  distinguish  those  deformations  such  as  the  g-mutator 
algebra  of  Greenberg  that  require  the  construction  of  new  types  of  Fock  spaces. 
Obviously,  Greenberg-type  of  deformations  can  never  be  reduced  to  change  of 
variables  living  within  the  bosonic  or  fermionic  Fock  space.  Some  degree  of  confusion 
prevails  in  recent  literature  since  this  distinction  is  not  kept  in  mind,  (see  for  instance 
[7,32,41,42,48]). 

2.  In  §  4,  we  have  shown  that  algebras  that  are  covariant  under  quantum  groups  are 
only  a  particular  case  of  the  more  general  class  of  algebras  that  can  be  derived  from  the 
formalism  of  generalized  Fock  spaces.  This  formalism  is  based  on  linear  vector  space 
and  linear  operators  acting  on  this  space;  mathematically,  no  more  sophistication  is 
required.  And  yet  it  is  capable  of  handling  quantum-group  related  structures  in 
a  self-contained  manner.  It  would  seem  that  the  basic  concepts  of  quantum  groups  are 
contained  in  the  theory  of  generalized  Fock  spaces  and  it  must  be  possible  to  construct 
quantum  group  itself  starting  from  this  theory. 

3.  We  have  already  referred  to  the  desirability  of  co variance  under  unitary  transform- 
ations that  mix  the  indices  as  a  requirement  for  the  algebras  of  creation  and  annihila- 
tion operators.  We  shall  call  the  algebras  that  statisfy  this  requirement  as  covariant 
algebras.  This  property  stems  from  the  superposition  principle  in  quantum  mechanics. 
Since  the  indices  describe  quantum  states  of  a  single  particle,  if  we  demand  that,  for  any 
orthonormal  set  of  quantum  states  obtained  by  superposition  of  the  original  set  of 
quantum  states,  the  algebra  should  retain  the  same  form,  then  covariance  under 
unitary  transformations  follows.  Many  of  the  algebras  presented  in  this  paper  violate 
this  requirement.  Nevertheless,  these  algebras  may  be  useful  to  describe  specific  systems 
in  specific  states  such  as  those  encountered  in  condensed  matter  physics. 
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Some  of  these  noncovariant  algebras  do  have  other  nice  physical  and  mathematical 
properties.  Under  this  category,  we  may  include  the  algebras  that  are  covariant  under 
quantum  groups  or  the  algebras  representing  braid  group  statistics  [49,50]. 

Among  the  algebras  for  single-indexed  systems  that  have  been  discussed,  Green- 
berg's  (j-mutator  algebra  is  the  only  ^-deformation  that  is  covariant  under  unitary 
transformations,  but  then  one  has  to  pay  the  price  of  the  enlarged  Fock  space.  Every 
other  known  g-deformation  leads  to  a  noncovariant  algebra. 

Greenberg's  q-mutator  algebra  (including  the  case  q  =  0  which  is  the  standard 
representation),  the  canonical  bosonic  and  fermionic  algebras  and  Green's  trilinear 
algebras  for  parabosons  and  parafermions  are  the  covariant  representatives  living 
respectively  in  the  super  Fock  space,  bosonic  and  fermionic  Fock  spaces  and  the 
parabosonic  and  parafermionic  Fock  spaces.  All  the  other  algebras  living  in  these  three 
categories  of  Fock  spaces,  although  noncovariant,  can  be  transformed  to  these 
covariant  algebras  through  equations  such  as  (4.32).  This  is  not  the  case  for  the  algebra 
of  null  statistics  or  the  algebra  of  infinite  statistics  with  Pauli  principle  living  respective- 
ly in  the  Fock  space  of  frozen  order  and  the  super  Fock  space  with  Pauli  principle.  In 
these  Fock  spaces,  covariant  algebras  do  not  exist. 

4.  Quantum  mechanics  is  sometimes  viewed  as  a  deformation  of  classical  mechanics 
since  the  commutator  bracket  of  quantum  mechanics  can  be  related  to  the  deformation 
of  the  classical  Poisson  bracket,  the  Planck's  constant  playing  the  role  of  the 
deformation  parameter.  Relying  on  similar  reasoning  it  has  been  proposed  that 
a  deformation  of  canonical  commutation  relations  will  lead  to  fundamentally  new 
mathematical  or  physical  structures  [51-53].  The  analysis  presented  in  this  paper 
shows  that  nothing  of  this  sort  happens,  if  viewed  within  the  framework  of  Fock  space. 
The  transition  from  classical  to  quantum  mechanics  requires  the  replacement  of 
the  notion  of  the  phase  space  by  that  of  the  Hilbert  space  or  Fock  space.  In  contrast, 
we  have  seen  that  all  the  deformations  of  commutation  relations  can  be  formulated 
within  the  framework  of  Fock  space.  In  fact  most  of  the  deformed  structures  proposed 
in  the  literature  exist  within  the  time-honoured  bosonic  and  fermionic  Fock  space.8 
only.  Even  Greenberg's  infinite  statistics  lives  within  a  Fock  space,  although  an 
enlarged  one. 

5.  While  remaining  within  the  framework  of  quantum  mechanics,  the  general  theory  of 
Fock  spaces  presented  here  throws  light  on  the  enlarged  framework  within  which  the 
familiar  quantum  field  theory  and  statistical  mechanics  reside  and  hence  may  lead  to 
newer  forms  of  quantum  field  theory  and  statistical  mechanics.  This  is  infact  the  main 
motivation  behind  our  work.  Apart  from  earlier  work  on  parastatistics  [37],  we  may 
mention  as  examples  of  new  forms  of  quantum  field  theories,  Greenberg's  construction 
[13, 14,54]  of  a  non-relativistic  quantum  field  theory  based  on  infinite  statistics  and 
our  construction  [47]  of  a  local  relativistic  quantum  field  theory  based  on  orthostatis- 
tics.  Many  other  forms  of  quantum  field  theories  based  on  the  generalized  Fock  spaces 
may  be  possible.  Their  formulation  and  study  is  an  agenda  for  the  future. 

6.  One  may  not  be  able  to  construct  local  relativistic  quantum  field  theories  corre- 
sponding to  many  of  the  newer  forms  of  statistics  and  algebras,  since  admissible 
statistics  in  relativistic  systems   is  severely  restricted  by  the  axioms  of  local 
quantum  physics  [55].  However,  nonrelativistic  quantum  field  theories  based  on  such 
'inadmissible'  statistics  are  still  possible.  Condensed  matter  physics  is  a  rich  field  where 
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applications  of  such  theories  may  be  relevant.  In  fact  there  is  no  reason  why  the 
quasiparticles  encountered  in  condensed  matter  systems  should  be  bosons  or  fermions 
only.  We  have  shown  that  any  of  the  generalized  Fock  spaces  provides  a  perfectly  valid 
quantum-mechanical  framework  for  many-panicle  systems.  Hence,  quasiparticles 
living  in  a  generalized  Fock  space  offer  an  important  field  of  study. 

Appendix  A 

Generalised  Fock  Spaces  for  Two-indexed  Systems 

Here  we  consider  only  those  two-indexed  systems  in  which  ngaL  do  not  exist.  See  §6.1. 
The  state  vectors,  inner  products  and  orthonormal  sets: 
^n'g,n'h...;n'a,n'p...;^\ng>nh...;nx,nf...;vy 

=  <5-    <5-    ...<5  .    (5  •    ...Muv  (Al) 

ntnt  nknk       n»ni  nfnt         PV 

n,n...;n,n...;=X\ntn...;n9n...;vy  (A2) 


(A4) 
(AS) 


_     y 
Projection  operators: 


L)¥A<n....;«....;A|  (A8) 

=    £    P^n....;!!.,^...)  (A9) 
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Number  operators: 


Nk=   Z   «fcJp(>v ••«*••  •;"«•••) 

ng...nk... 

yy  _    y    n  p^n      ;n  ,nB...) 

ng... 

1\J      n        n         'MM         •//V<JS  =  M||M         n         •  M  •  YI 

iv k    rig. .  .nk. . . ,  Ma,  Up. . .,  p.//  —  rik  \\rig. . .  nk. . . ,  na,  «^. . . , , 


IN*  Njl  =  C^.,  N  /»]  = 
Total  number  operator  is 

tf=Etf*=Itf. 

fc  a 

Creation  and  destruction  operators: 

c}p=  Z  Z/ 


=  0     for  any  k,j  and  any  a, 


(A12) 
(A13) 

(A14) 
(A15) 
(A16) 
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For  some  particular  u, 


ng...nj+  l. 


—      V      V 

-    L    L 
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I»=     Z 
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Abstract.  In  this  communication,  we  reanalyze  the  causes  of  the  singularities  of  canonical 
perturbation  theory  and  show  that  some  of  these  singularities  can  be  removed  by  using 
time-dependent  canonical  perturbation  theory.  A  study  of  the  local  and  global  properties  (in 
terms  of  the  perturbation  parameter)  is  also  undertaken. 
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1.  Introduction 

For  almost  a  hundred  years,  it  has  been  known  [1]  that  canonical  perturbation  theory 
is  generically  singular.  Later  works,  most  notably  the  KAM  theorem  [2]  have  shown 
that  in  spite  of  the  singularities,  the  perturbation  series  does  make  sense  in  certain 
regions  of  phase  space  (the  irrational  tori),  and  hence,  the  orbits  of  the  perturbed  system 
lie  close  to  those  of  the  unperturbed  system.  In  the  case  of  the  rational  tori,  even  an 
infinitesimal  perturbation  is  adequate  to  completely  destroy  the  integrability  and  the 
orbits  of  the  unperturbed  system.  Our  analysis  of  perturbation  theory,  in  this  paper, 
tries  to  determine  the  exact  cause  of  the  singularities  and  shows  how  time-dependent 
perturbation  theory  can  be  used  to  study  Hamiltonian  systems  which  are  close  to 
integrable  systems. 

Consider  the  Hamiltonian 


,  0),  (1) 

where  /  denote  the  action  variables  and  6  denote  the  angle  variables  of  the  unperturbed 
Hamiltonian,  H0  .  (It  is  assumed  that  H0  defines  compact  orbits.)  In  the  language  of  Lie 
transforms,  the  aim  of  canonical  perturbation  theory  is  to  determine  a  generator 
J^(/,  6)  which  intertwines  between  the  time  evolutions  generated  by  H0  and  H: 

jFexp(-  tH0)  =  exp(-  tH)&  (2) 

as  operators  acting  on  phase  space.  To  calculate  J5",  we  write 

^  =  ...exp(e"Fn)...exp(eF1)  (3) 

where  the  action  of  the  F£  is  through  Poisson  Brackets: 

...  (4) 


for  any  function/  on  phase  space.  To  order  B,  and  in  the  limit  of  infinitesimal  t,  we  get 

(e2),  (5) 


l/VttoJ-Wi-  (6) 

We  now  recognize  that  the  fl's  represent  angles;  to  make  sense,  all  quantities,  including 
Ht  and  Fl  must  be  periodic  in  these  angles.  This  motivates  the  Fourier  expansions: 


-0)];  (7) 

Defining  the  frequencies  of  the  unperturbed  motion  by  o}  =  cH0/cL  we  get  the 
algebraic  condition: 


in,  (8) 

which  leads  to  an  obvious  singularity  whenever  (n-co)  =  0  (the  rational  case). 

1.1  Reasons  for  singularities  in  the  generator  Fl 

It  is  worthwhile  investigating  at  this  stage  the  reason  for  this  singularity.  As  we  shall  see, 
such  an  analysis  will  provide  us  with  a  way  of  circumventing  the  singularity. 

A.  Equation  (6)  can  be  considered  as  a  partial  differential  equation  for  F  l  .  Using  the 
method  of  characteristics,  we  see  that  the  solutions  of  the  characteristics  are  given  by 

0  =  d0  +  cot,    /  =  /0,  (9) 

where  00,/0  represent  the  initial  conditions.  As  can  be  seen,  in  the  rational  case, 
characteristics  are  closed;  as  a  result,  global  solutions  can  be  found  only  if  suitable 
initial  conditions  can  be  formulated  such  that  the  value  of  Ft  on  the  initial  surface 
matches  with  that  obtained  by  integrating  H^  along  the  characteristic  to  the  point 
where  it  reintersects  the  initial  surface.  This  is  possible  iff  the  integral  of  Hj  along  closed 
characteristics  vanishes,  i.e.,  iff.Hln  =  0  whenever  (n-co)  =  0.  Thus,  generically,  the  PDE 
has  no  global  solutions. 

B.  Equivalently,  (6)  can  be  solved  by  using  the  formula 

tffoJ/^dr,  (10) 


where  the  integral  is  to  be  computed  along  the  solutions  to  the  equations  of  motion 
using  H0  as  the  Hamiltonian.  In  general,  Hl  is  periodic  in  time  on  the  H0  orbits  and  can 
contain  a  piece  which  is  constant  in  time.  Such  a  piece,  on  integration  gives  rise  to 
a  term  which  is  linear  in  t  and  hence  to  a  term  which  is  linear  in  0's  (an  aperiodic  term). 
From  this  point  of  view,  the  singularity  in  Ft  arises  from  an  attempt  to  express  a  term 
linear  in  angles  as  a  periodic  series. 

C.  Equation  (6)  can  also  be  considered  as  an  operator  equation  in  a  Hilbert  space  of 
periodic  functions;  such  an  equation  admits  solutions  iff  the  RHS  of  the  equation  is 
orthogonal  to  the  null  eigenvectors  of  the  adjoint  of  the  operator,  a  condition  that  can 
easily  be  seen  to  be  equivalent  to  the  condition  that  Hn  =  Q  whenever  (n-w)  =  0. 
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D.  From  a  general  point  of  view,  any  canonical  transformation  that  is  well-defined 
(e.g.,  analytic)  and  is  time-independent  is  dynamically  trivial:  If  3?  generates  such 
a  canonical  transformation,  then  H0  and  H  have  the  same  spectrum:  there  is  a  1-1  map 
between  corresponding  orbits.  If  H  0  and  H  have  distinct  dynamical  contents  (as  would 
be  expected  to  be  the  case),  then  the  canonical  transformation  has  to  be  either 
ill-defined  (by  being  singular,  as  in  canonical  perturbation  theory  or  by  being  multiple- 
valued)  or  time-dependent. 

From  this  analysis,  it  is  clear  that  the  solution  to  opt  for  is  to  have  either  aperiodic 
terms  in  ^  or  to  allow  for  time-dependent  J^'s.  We  shall  assume  the  latter  in  this  paper. 

1.2  Derivation  of  the  time-dependent  generating  function 

To  derive  the  equation  for  F1  ,  assume  that  g  satisfies  the  equation 

^+{0,H0}=0  (11) 

and  define  h  =  exp(eFi)g  =  y  +  s{Fi,g}  +  0(e2).  Using  the  condition  that  h  be  an 
invariant  under  H,  i.e.,  (dh/dt)  +  {h,  H}  =  0  gives  the  equation  determining  Fl  : 

dF 

-^+{F1,H0}  =  Hl,  (12) 

whose  solution  is 


where  F10  is  an  arbitrary  function  of  /  and  00,  representing  the  value  on  the  initial 
surface  t  =  0,  and  where  90  =  B  —  ait.  Thus, 

-  F10.  (14) 


where  F10  is  now  an  arbitrary  function  of  /  and  6  —  cot. 

The  above  solution  can  also  be  written  in  a  different  form  which  is  sometimes  more 
useful: 


F,(I,0,t)=       HJI,6  +  o)(z-t),z)dz  +  Fi(t  (15) 

Jo 

which  follows  easily  from  the  characteristic  equations  of  the  PDE  for  Fx  (eq.  12). 

As  is  clear,  F10  corresponds  to  the  fact  that  we  can  choose  the  canonical  transform- 
ation arbitrarily  at  t  —  Q.  It  is  also  easy  to  see  that  the  time-dependent  canonical 
transformation  where  F10  is  zero  is  equivalent  to  using  the  unperturbed  Hamiltonian 
to  go  back  in  time  and  to  go  forward  in  time  using  the  perturbed  Hamiltonian.  In  what 
follows,  we  will  choose  F10  =  0. 

Another  motivation  for  considering  time-dependent  perturbation  theory  comes 
from  Hamilton-Jacobi  theory.  Usually,  while  applying  Hamilton-Jacobi  theory,  one 
considers  the  generating  function  to  be  linear  in  time  and  writes  the  Hamilton-Jacobi 
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equation  in  the  form 


S=W-Et.  (17) 

However,  it  is  also  possible  to  work  with  the  full  Hamilton-Jacobi  equation:  Assume, 
9)  =  H0(l)  +  eHl(I,6)  (18) 

"S 


s2(o,t)  +  ...  (20) 

Using  these  equations  in  the  Hamilton-Jacobi  equation 

dS 


,)  =  0  (21) 

and  equating  equal  powers  of  e  we  get  a  series  of  equations  determining  St.  It  is  then 
trivial  to  see  that  by  allowing  S  to  be  explicitly  dependent  on  time,  we  can  derive  an 
expression  for  S  which  is  equivalent  to  the  above  Lie  transform  formalism. 

To  analyze  the  effect  of  such  an  F1  and  to  draw  conclusions  about  the  integrability  of 
the  transformed  system,  it  is  convenient  to  replace  the  given  dynamical  system  with  one 
with  an  extra  degree  of  freedom;  the  two  extra  phase  space  coordinates  will  be  (t,  T), 
and  the  new  Poisson  bracket  is  defined  as, 


where  the  unprimed  bracket  refers  to  the  original  Poisson  bracket.  We  also  define 

H'0  =  H0+T,     H'  =  H+T  (23) 

and  use  z  to  denote  the  (new)  arc  parameter.  We  notice  that  there  is  no  change  in  the 
definition  of  H  L  .  It  is  then  trivial  to  see  that  the  equation  determining  F±  is  the  same  as 
(6),  using  primed  PBs,  and  that  the  solution  coincides  with  that  given  in  (15).  We  can 
thus  interpret  F^  as  a  genuine  function  on  the  extended  phase  space  defined  by 
(/,  T,  0,  t).  This  also  allows  us  to  establish  the  integrability  of  Hf  (to  order  e):  we  have 
n  +  1  invariants  (I,  T)  for  H'Q  and  their  transforms  under  the  canonical  transformation 
define  invariants  for  H'.  It  is  also  straightforward  to  derive  the  equations  for  higher 
order  perturbation  theory  (i.e.,  for  Ftt,n>  1)  in  a  similar  fashion. 

2.  Analyticity  properties  of  the  perturbation  series 

We  now  consider  the  analyticity  properties  of  the  perturbation  series  that  we  have 
developed  above.  There  are  two  aspects  to  this  study:  first,  we  consider  the  finiteness  of 
each  term  in  the  perturbation  theory,  that  is,  we  consider  the  behaviour  of  each  of  the 
generators  Ft  as  a  function  on  phase  space.  Secondly,  we  consider  the  convergence  of 
the  canonical  transformation  as  a  whole,  ^,  as  a  function  of  the  perturbation 
parameter. 
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Canonical  perturbation  theory  I 

2.1  Analyticity  properties  of  F{ 

We  note  that  the  function  F±  has  the  property  of  being  an  entire  function  of  the  two 
variables  t  and  (n-co)  (and  hence  /,  if  co's  and  Hn  are  entire  functions  of/).  This  important 
property  persists  to  all  orders  of  perturbation  theory: 

Lemma.  Assume  that  H'Q  and  Hx  are  analytic  in  I.  Define  a  sequence  of  canonical 
transformations  through  the  equations: 


[FltH'Q  }'  =  //!,  (24) 

H(B+1')  =  exp(fi(B+1)FJI+1)H("),     n^O, 
-  {^n  +  1  >#o}'  =  coefficient  of  en+1  in  H(n\    n^l.  (25) 

(Note  that  at  the  nth  stage,  F(n)  is  chosen  to  kill  the  term  of  order  e".)  Then,  Fn  is  an  entire 
function  of  /  and  t  and  is  periodic  in  0  for  all  n  =  1.  2,  ...  (In  brief,  we  shall  say  that  Fn  is 
regular.) 

Proof.  The  proof  follows  from  induction,  using  the  following  facts: 

(1)  If  H(n)  is  regular  and  Fn  is  regular,  so  is  H(n+  1}:  this  follows  from  the  fact  that  the 
computation  of  H("+1)  involves  the  computation  of  derivatives,  which  preserves 
regularity. 

(2)  The  PDE  {^4,  H'0  }'  =  G  has  the  property  that  if  G  is  regular,  then  A  is  regular;  in  fact, 
if  G  =  £Gnexp[i(n-0)],  where  the  Fourier  coefficients  are  entire  functions  of/  and  t, 
then  A  =  2L4nexprj(n-0)],  where 

r 

t.  (26) 

The  RHS  is  an  integral  of  an  entire  function  of  t  and  (n-co)  along  a  contour  which  lies 
within  the  region  of  analyticity;  hence  the  conclusion. 

Thus,  in  contrast  to  canonical  perturbation  theory,  each  of  the  generators  is  finite.  At 
this  stage  it  is  worthwhile  examining  points  (A)  and  (B)  once  again  from  the  new 
perspective. 

A'.  In  the  extended  system,  the  characteristics  are  never  closed  (the  equation  for  t  is 
t  =  t0  +  z);  thus,  it  is  possible  to  choose  as  initial  surface,  the  surface  t  =  0  and  choose 
F  arbitrarily  on  this  surface. 

B'.  Constant  terms  under  the  integral  are  retained  explicitly  as  functions  of  time,  not 
replaced  by  (aperiodic)  functions  of  0. 

2.2  Convergence  of  perturbation  theory 

As  noted  above,  the  full  canonical  transformation  3F  is  equivalent  to  transforming 
backwards  in  time  using  HQ  and  then  transforming  forwards  in  time  using  H.  Since  the 
first  step  is  independent  of  e,  the  analyticity  properties  of  3F  is  decided  by  the  analyticity 
properties  of  the  time  evolution  operator  H  as  a  function  of  e.  In  general,  of  course,  the 
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analyticity  properties  of  &  as  a  function  of  £  will  depend  on  r;  this  is  in  contrast  to  the 
time-independent  case,  where  analyticity  properties  are  independent  of  L  In  general,  it 
is  not  possible  to  derive  general  results  regarding  the  analyticity  properties  of  the 
transformation;  however,  there  are  two  special  results  that  are  of  interest: 

1.  We  assume  that  there  exists  a  time-independent  canonical  transformation 
& ,  N  (calculated  using  e.g.,  KAM  theory)  for  a  phase  space  point  and  for  a  certain 
value  of  £.  The  following  diagram  shows  the  relation  of  2F  with  the  KAM  generator. 


KAM 


H0t 


Ht 


^  KAM 

which  translates  into 

(27) 


Thus,  &  will  exist  provided  the  time  evolution  of  ^Y.MA  under  the  unperturbed 
Hamiltonian  is  well-defined  for  real  t.  In  particular,  since  <^"KAM  will  depend  on  /,  B  and 
their  evolution  under  HQ  is  trivial,  3F  will  be  analytic  in  g  provided  -FKAM  ^  analytic  in 
/,  9.  Further,  in  such  a  case,  3F  will  be  analytic  in  £  for  all  t. 

2.  For  a  class  of  Hamiltonian  systems  it  is  possible  to  relate  the  complex-time 
analytical  structure  of  fully  perturbed  Hamiltonian  systems  to  the  complex-£  analytical 
structure  of  the  canonical  transformation  which  transforms  the  Hamiltonian  with 
a  small  s  value  to  the  Hamiltonian  with  a  large  e  value. 

Take  a  Hamiltonian  system  where  the  unperturbed  part  is  a  homogeneous  function 
of  degree  m  in  phase  space  variables  (q{,  pt)  and  the  perturbation  part  is  homogeneous 
function  of  degree  n. 

H(qttPl)  =  H0(qttPt)  +  fiff  1(«,,p,);  (28) 

assuming  n  >  m,  a  scale  transformation  of  the  variables, 

q'i  =  q^>    P'i  =  Pi£a  (29) 

where  a  =  (l/(n  —  m)),  changes  the  Hamiltonian  into, 

H'te;,Pi)  =  e-"W«!,P;)  +  f*M,Pi))  (30) 

(Note  that  this  transformation  is  not  a  canonical  transformation,  but  it  transforms  the 
vector  field  of  H  at  some  £  to  the  vector  field  of  H  at  £  —  1.)  Reparametrizing  the  system 
by  inserting  £  =  (1  +  £')(1/ma>  yields, 

q'i,p'i)  +  H1(q'i,p'i)).  (31) 


This  Hamiltonian  represents  the  perturbation  of  the  Hamiltonian  #g=1(gj,p|)  = 
#o  (<?!•»  P|)  +  #1  (<?!•>  PJ)  by  itself  using  the  perturbation  parameter  e'.  i.e. 

^'(^p;-)  =  H£=1(^,p;.)  +  £'H£=1(^,p;).  (32) 
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For  this  problem,  Ft  =  He=1(gj,pj)r,  and  all  the  higher  order  generators  vanish;  thus 
Fj  alone  is  the  generator  of  the  canonical  transformation  and  so  it  can  be  used  as  the 
Hamiltonian  for  the  &'  evolution  (except  at  e  =  0,  because  there  the  scaling  transform- 
ation is  ill-defined). 

3F,       dp',         <9F, 

_  L  •     _£i  _  __  L  (33l 

dp\  '     ds'          dq'i  ' 

Now  Fi  is  the  same  as  Hiq^pf)  at  e  =  1,  but  for  an  extra  multiplier  t.  So  above  equations 
can  be  rewritten  as, 

ggiaH..1(gi.Pi).     dp'i         dHtmlWt,ti) 

'  (     } 


ds't  dPi         '     de't  dqt         ' 

But  these  equations  are  the  time  evolution  equations  for  the  Hamiltonian  He=  1  .  Thus  if 
He  =  1  has  singularities  in  the  complex  t  plane,  then  a  canonical  transformation  defined 
by  F!  also  will  have  singularities  in  the  complex  s't  plane,  and  hence  in  the  complex 
e  plane  for  a  fixed  t.  Also,  the  presence  of  a  natural  boundary  in  the  complex  t  plane  [3] 
for  the  Hamiltonian  He=l  will  manifest  itself  as  a  natural  boundary  in  the  complex 
e  plane  for  2F  .  (A  similar  technique  will  work  for  n  <  m.) 

In  case  the  singularities  of  the  time  evolution  operator  exp(—  tH)  in  the  complex 
e  plane  are  isolated,  it  is  possible  to  use  analytic  continuation  to  define  F  beyond  the 
radius  of  convergence.  However,  as  the  above  study  shows,  the  existence  of  natural 
boundaries  in  the  complex  t  plane  may  imply  the  existence  of  similar  boundaries  in  the 
complex  s  plane  also.  In  applications  to  specific  systems,  it  is  possible  to  study  the 
analyticity  properties  of  F  using  standard  tools  for  determining  the  analyticity  proper- 
ties of  the  solutions  of  the  equations  of  motion  using  H  as  the  Hamiltonian. 

3.  Conclusions 

The  following  points  can  be  noted  regarding  the  canonical  transformations  derived 
above: 

1.  Because  of  the  explicit  time-dependence  of  ^,  two  points  which  lie  on  the  same 
unperturbed  orbit  do  not  in  general  lie  on  the  same  perturbed  orbit. 

2.  The  spectra  of  the  two  Hamiltonians  H0  and  H  can  be  very  different,  even  though 
they  are  related  by  a  canonical  transformation:  this  is  because  of  the  explicit  time- 
dependence  of  &r. 

3.  It  is  trivial  to  see  that  time-dependent  perturbation  theory  works  satisfactorily  in 
two  situations  where  conventional  perturbation  theory  is  singular: 

a.  Let  Hl  be  a  function  of/  alone.  Conventional  theory  is  singular  in  this  case,  as  the 
only  non-zero  Fourier  component  corresponds  to  n  =  0.  Our  theory  allows  for 
a  regular  solution,  F1=H1t,  which  yields, 

8H 
J  =  I-     U=T+eH1;    u  =  t;    <£  =  0-e~;  (35) 

where  (/,  T,  t,  6)  denote  the  original  canonical  coordinates  and  (J,  I/,  u,  0)  the  trans- 
forms of  these  coordinates. 


b.  Let  Ht  be  such  that  E^  explicitly  depends  on  9  and  {H0,  H^ }  =  0.  (Note:  this  can 
happen  only  if  the  unperturbed  frequencies  CD  are  constant.)  Once  again,  we  have  the 
solution  F  =  H!  t  and  it  is  easy  to  check  the  correctness  of  the  transformed  invariants. 

4.  As  the  examples  described  above  clearly  show,  the  canonical  transformation  defined 
by  &  changes  not  only  the  (usual)  canonical  coordinates  (/,  0)  but  also  T.  It  is  the 
motions  defined  by  H'G  and  H'  in  the  extended  phase  space  defined  by  (/,  9,  T,  t)  which 
are  being  mapped  into  one  another.  In  principle,  we  could  try  to  reduce  the  transform- 
ation to  one  on  the  usual  phase  space  by  eliminating  t  (using,  e.g.,  the  invariant 
corresponding  to  T);  it  is  fairly  straightforward  to  see  that  this  results  in  recovering  the 
singularities  of  canonical  perturbation  theory. 

Numerical  studies  of  the  application  of  this  theory  will  be  studied  in  companion 
papers. 
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Abstract.  In  this  communication,  we  report  the  results  of  the  application  of  time-dependent 
perturbation  theory  to  the  Henon-Heiles  system.  We  show  that  the  predictions  of  the  perturba- 
tion theory  hold  good  for  short  times,  and  try  to  explain  the  increase  of  error  in  the  predicted 
results  with  the  increase  in  energy. 
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1.  Introduction 

One  of  the  fundamental  problems  in  canonical  perturbation  theory  is,  given  a  Hamiltonian 
H  =  H0  +  aH1,  (1) 

where  e  is  a  parameter,  to  determine  a  canonical  transformation  which  transforms  H0 
to  H,  as  a  power  series  in  £.  In  a  previous  paper  [1],  this  was  attempted  by  allowing  the 
canonical  transformation  to  be  time-dependent  and  it  was  shown  that  the  approach 
removes  some  of  the  singularities  of  canonical  perturbation  theory.  The  method 
introduced  a  new  degree  of  freedom,  by  using  time  as  a  canonical  coordinate,  with  Tas 
its  conjugate  variable,  and  replacing  the  original  Hamiltonian  H  by  H  +  T.  The 
salient  features  of  the  results  were: 

1.  The  generators  of  the  canonical  transformation  were  finite  at  every  order  of 
perturbation  theory. 

2.  The  generators  at  nth  order  of  perturbation  theory  were  O(£n). 

In  this  communication,  we  study  the  application  of  the  theory  to  the  Henon-Heiles 
Hamiltonian.  In  general,  perturbation  theory  can  be  tested  via  two  approaches, 
namely,  the  predictions  regarding  invariants  and  the  solutions  of  the  equations  of 
motion  (EOM).  Both  these  tests  have  been  performed  using  third  order  perturbation 
theory  for  the  system  under  study. 

The  plan  of  the  paper  is  as  follows:  In  §  2,  we  review  the  Henon-Heiles  system. 
Section  3  contains  the  basic  formulae  for  the  generators  of  the  canonical  transform- 
ation, the  invariants  and  the  solutions.  In  §4  we  present  the  numerical  results.  Section 
5  concludes  with  a  discussion  on  the  convergence  properties  of  the  perturbation  series. 


Henon  and  Heiles  [2]  to  investigate  the  integrability  of  motion  in  an  axisymmetric 
potential.  The  Hamiltonian  is  a  truncation  of  the  Toda-lattice  Hamiltonian  [3]  up  to 
third-order  terms  and  it  is  a  model  for  motion  of  a  star  in  an  axisymmetric  gravitational 
field.  The  Hamiltonian  is  given  by, 

(2) 


where  e  is  the  perturbation  parameter,  e  can  be  rescaled  using  the  transformation, 


i  —  1,  2  under  which, 


H~+~h 


where, 


Thus  solving  for  motion  of  Henon-Heiles  system  for  one  value  of  e  and  different  values 
of  energy  is  equivalent  to  solving  for  one  value  of  energy  and  different  values  of  £.  Our 
approach  here  has  been  the  first  one  i.e.  we  have  fixed  the  value  of  £  to  0-01  and  studied 
the  system  for  different  values  of  energy.  (For  comparison  with  other  results,  energies 
are  always  quoted  for  the  equivalent  Hamiltonian,  h.) 

Further  the  system  shows  a  transition  from  regular  behaviour  to  chaotic  behaviour  with 
the  change  in  energy  [2].  The  following  characteristics  are  well-known:  for  E  <  1/12  the 
Poincare  sections  indicate  regular  motion,  i.e.,  the  section  is  filled  with  closed  curves  and 
there  is  no  exponential  divergence  of  trajectories.  In  the  range  1/12  <  E  <  1/6  the  motion 
shows  a  transition  from  regular  to  chaotic  behaviour,  becoming  more  and  more  chaotic 
with  increase  in  E,  i.e,  the  area  which  is  covered  by  closed  curves  in  the  section  becomes 
smaller  with  increase  in  E.  Beyond  E  =  1/6,  the  motion  becomes  non-compact,  with 
solutions  escaping  to  infinity.  (There  are  unbounded  orbits  for  all  energies,  including 
for  those  with  E  <  1/6;  these  orbits  do  not  play  any  role  in  our  analysis,  as  a  suitable 
choice  of  initial  conditions  ensures  that  the  orbits  remain  bounded.) 

Classical  perturbation  theory  is  known  to  be  singular  for  this  Hamiltonian  because 
of  resonances  [3],  which  arise  at  second  order  in  perturbation  theory.  The  KAM 
theorem  [4]  cannot  be  applied  directly  to  the  system;  the  condition, 


is  not  satisfied,  as  the  unperturbed  frequencies  are  independent  of  the  actions.  Gustav- 
son's  method  [5]  give  good  results  for  small  energies  but  does  not  predict  the  chaotic 
behaviour  of  Poincare  sections. 
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We  take  the  unperturbed  Hamiltonian  as  the  two-dimensional  harmonic  oscillator 
Hamiltonian, 

r  r;  r  1      '      Jr  2  "  1  "  2       i        HT  /  T\ 

W0  = +  T  (3) 

with 

(.2     i     — ,—    i     ,-,2,     i     »2 


and  the  full  Hamiltonian  as, 

H'^Hc  +  eHi+T  (4) 

where, 

3 
^2  /-t\ 

-^r  (5) 


where  (r,  T)  are  extra  phase  space  coordinates  [1],  and  the  new  Poisson  bracket  is 
defined  as 


with  {/,<?}  denoting  the  usual  Poisson  bracket. 
In  unperturbed  action-angle  coordinates  we  get, 

#;>  =  /!  +  /,+  r,  (6) 


The  equations  determining  the  generators  are, 

{FlfH'0  }'  =  //,  (8) 


31 


'}'  (10) 


The  Fs  were  calculated  using  Mathematica  programs.  The  expressions  for  the  generat- 
ing functions  are  given  in  the  appendix. 
Accordingly  the  constants  of  motion  for  H  to  third  order  are  given  by, 


| 


3 


where  /  =  1,2. 
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The  solutions  of  the  EOM  for  H  can  be  obtained  from  those  of  H0  using  the 
equations 


where  /  =  1   9  ^  4  r    —I     F    —I      f    _  fi      £    _  fl     ?'  —  v     *•'  —  i'    £'  _  fl'    F  —  ft' 
wneiei  -  i,A  J,^,  £1  -^i,  C2  —  ^2>S3~  "i>  S4  —  y2>  Ci  ~  '1^2  =  ^2>  £3  =  yi'  £4-02- 

Here  I(t),  8(t)  represent  the  solutions  of  the  EOM  for  the  Henon-Heiles  Hamil- 
tonian,  while  /'(r),  d'(t)  represent  the  solutions  for  the  harmonic  oscillator  Hamiltonian. 

4.  Numerical  results 

Using  a  4th-order  Runge-Kutta-Gill  adaptive  ODE  solver,  the  EOM  for  the  Henon- 
Heiles  system  were  solved  and  the  invariants  predicted  by  third  order  perturbation 
theory  were  computed.  Figures  l(a)  and  l(b)  give  the  relative  variation  in  one  of  the 
constants  of  motion  (I\)  for  two  different  energies:  l(a)  for  the  regular  regime  of  the 
perturbed  Hamiltonian  and  l(b)  for  the  chaotic  regime  of  the  Hamiltonian,  for  first, 
second  and  third  order  perturbation  theory.  (In  the  figures,  relative  deviation  in  l\  (t)  is 
calculated  as  2  *  (I\  (t)  -  1\  (0)  )/(/;  (t)  +  1\  (0)).)  Figure  2  shows  the  third  order  prediction 
at  different  energy  values.  With  increase  in  E  the  constancy  becomes  worse  i.e.,  error 
increases  but  higher  order  results  remain  superior  to  lower  order  results  for  some  time, 
at  larger  time  the  error  become  too  large. 
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Figure  la.  Relative  variation  in  predicted  time-dependent  constant  of  motion 
1\  for  £  =  0-034,  O15  O2  and  O3  represent  results  of  first,  second  and  third  order 
perturbation  theory  results  respectively. 
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Figure  Ib.     Same  as  figure  la  for  E  =  0-1 12. 
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Figure  2.     Relative  variation  in  predicted  constant  of  motion  at  third  order  at 
different  £  values  for  the  Henon-Heiles  system. 

For  comparing  the  solutions,  the  same  Runge-Kutta-Gill  ODE  solver  was  used  and 
results  were  compared  with  the  predictions  of  (12).  These  results  are  summarized  in 
figures  3(a)  and  3(b)  (solid  lines),  which  show  the  relative  error  in  predicted  solutions 

theor"58  The  ^  nUmeriCal  solutions'  for  the  first>  sec°nd  and  third  order  perturbation 

i  ^ '    '    v*ipW     1  inW)/\f  ip(t)  —  Iin(t))  where  I,    is  the 
predicted  solution  and  /,_  is  the  numerical  snlntinn  \  Fim,r«  A  ou««,n  +u~  _lp..i._  _, 
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Figure  3a.  Relative  error  in  predicted  solution  for  /1  for  the  Henon-Heiles  system 
at  £  =  0-034.  O15  O2  and  O3  represent  results  of  first,  second  and  third  order 
perturbation  theory  results  respectively.  Dotted  curves  show  numerically  calculated 
higher  orders. 
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Figure  3b.    Same  as  figure  3a  for  E  =  0-1 1 2. 


application  of  the  third  order  perturbation  theory  for  a  spectrum  of  energies  ranging 
from  the  regular  to  the  chaotic  regime. 

We  also  tested  perturbation  theory  for  two  orbits  of  the  Henon-Heiles  system,  one  of 
which  is  known  to  be  chaotic  and  the  other  regular  (as  seen  from  their  Poincare 
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Figure  4.    Relative  error  in  third  order  predicted  solution  for  Jj  for  the  Henon- 
Heiles  system  at  different  £  values. 


sections).  Figure  5(a)  shows  the  behaviour  of  the  constants  of  motion  for  a  chaotic 
trajectory  and  5(b)  shows  the  same  for  a  regular  trajectory  at  the  same  energy  value. 
Figures  6  (a)  and  6(b)  are  the  Poincare  sections  for  the  initial  conditions  of  5  (a)  and  5(b) 
respectively. 


-0.2 


Figure  5a.    Relative  variation  in  third  order  predicted  constant  of  motion  I\  for  the 
Henon-Heiles  system  for  a  chaotic  trajectory  at  E  -  0-125. 
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Figure  5b.    Relative  variation  in  third  order  predicted  constant  of  motion  1^  for  the 
Henon-Heiles  system  for  a  regular  trajectory  at  E  =  0-125. 


5.  Convergence  of  the  series 

As  it  is  obvious  from  the  numerical  results,  the  prediction  of  the  perturbation  theory 
does  not  seem  to  be  in  agreement  with  the  numerical  results.  This  may  be  due  to  two 
reasons, 
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Figure  6a.    Poincare  section  of  the  chaotic  trajectory  of  figure  5a. 
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Figure  6b.     Poincare  section  of  the  regular  trajectory  of  figure  5b. 


1.  Only  first  few  orders  of  the  perturbation  series  are  considered,  in  which  case  the 
perturbation  theory  should  agree  with  the  numerical  results  at  higher  energies,  and 
larger  times  if  enough  number  of  terms  in  the  perturbation  series  are  considered. 

2.  Contrary  to  our  assumption  that  the  generating  function  is  analytic  in  the  complex 
e  plane,  there  exists  a  singularity;  in  that  case  one  can  expect  the  perturbation  theory  to 
converge  only  in  the  region  where  the  distance  of  £  from  origin  is  less  than  that  of  the 
radius  of  convergence. 

To  investigate  which  one  of  the  former  reasons  is  responsible  for  the  disagreement  of 
the  perturbation  theory  with  numerical  calculations  we  analyzed  the  analytic  structure 
of  the  system  in  complex  e  real  t  space  [1]. 

For  the  initial  conditions  selected  for  figure  2,  the  equations  were  integrated  in  real 
time  for  100  different  e  values  lying  on  a  circle  with  radius  0-0 1  and  centre  e  =  (0, 0).  The 
failure  of  the  integration  subroutine  was  considered  as  existence  of  singularity.  Three 
different  subroutines  drkgs  (SSP;  Runge-Kutta),  d02baf  (IMSL)  and  d02ebf  (IMSL  stiff 
equation  solver)  were  used;  the  routine  d02ebf  fails  with  ihalf  =  2,  which  indicates 
presence  of  singularity.  The  results  of  the  calculations  are  shown  in-table  1.  The  smallest 
time  at  which  one  of  the  points  on  the  circle  becomes  singular  is  given. 

It  is  not  expected  for  the  above  test  to  work  for  any  general  Hamiltonian  because  it 
does  riot  detect  branch  point  singularities,  but  in  this  specific  case  it  turns  out  that 
a  branch  point  in  qi  corresponds  to  a  branch  point  in  pi  with  negative  exponent  (this 
can  be  seen  from  Painleve  analysis  [6])  and  same  is  true  for  q2  and  p2.  Thus  even 
though  there  are  singularities  where  the  position  variables  remain  finite,  the  corre- 
sponding momentum  variables  become  infinite,  hence  the  time  at  which  one  of  the 
variables  become  infinity  is  the  same  at  which  another  variable  has  a  branch  point  with 
finite  value. 
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Table  1.    Complex-e  singularities  in  real  time. 


Initial  conditions 


Time  for  appearance  of  singularity 


Pi 


drkgs 


d02baf 


d02ebf 


4-47214 

4-89898 

0 

0 

t>200 

14-1421 

21-4476 

0 

0 

111-6145 

lll-8<£  <  111-9 

lll-8<t<  111-9 

30 

24-4949 

0 

0 

27-0544 

27-3  <  t  <  27-4 

27-3  <  t  <  27-4 

26-6648 

31-9374 

0 

0 

19-37 

19-6  <£<  19-7 

19-6  <£<  19-7 

34-9285 

34-9285 

0 

0 

13-4095 

13-6  <t<  13-7 

13-6  <t<  13-7 

This  analysis  indicate  that  the  perturbation  theory  should  be  convergent  for  given 
initial  conditions  and  e  for  larger  time  values  than  what  is  seen  in  figure  2,  thus  the 
reason  for  disagreement  should  be  truncation  of  the  series  rather  than  existence  of 
a  singularity.  To  verify  this  reasoning  we  did  a  numerical  study  as  follows. 

When  one  uses  the  generating  functions  for  mapping  of  the  solution  for  unperturbed 
Hamiltonian  into  the  solution  of  the  perturbed  Hamiltonian,  the  result  is  the  solution 
^  of  H  in  terms  of  solutions  %  of  H0,  t  and  a  power  series  in  e.  In  this  series  for  £  the 
coefficient  of  a  term  e."  is  (l/n!)(dn^/ds").  We  did  numerical  calculations  for  d"^/dfi"  using 
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Figure  7.  At  fixed  energy  1 00  different  initial  conditions  are  chosen  randomly  and 
each  initial  conditions  is  evolved  in  time  with  all  the  100  different  e  values  equally 
spaced  on  a  circle  of  radius  1  and  centre  (0, 0)  in  the  complex-e  plane.  The  minimum 
time  at  which  one  of  the  points  in  complex-e  circle  encounters  a  singularity  is  noted. 
From  the  minimum  time  thus  found  for  each  initial  conditions  at  a  fixed  energy  the 
minimum  and  maximum  times  are  plotted  against  energy. 
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contour  integral  definition  for  derivatives  and  matched  our  numerical  results  with  the 
analytical  results  predicted  by  perturbation  theory.  As  can  be  seen  from  figure  3,  where 
the  dotted  curves  denote  numerically  calculated  higher  orders,  the  third  order  analyti- 
cal predictions  are  well  in  agreement  with  numerical  values  and  the  fourth  and  fifth 
order  numerical  results  show  that  a  higher  order  analytical  calculation  would  make  the 
predictions  better. 

To  explain  the  worsening  of  perturbation  theory  at  higher  energies,  we  did  the 
following  calculation.  We  selected  100  different  initial  conditions  randomly  for  each  of 
49  different  energies  and  for  each  set  of  initial  conditions  we  fixed  100  points  on  the 
complex  s  circle  with  radius  1  and  centre  (0, 0)  and  calculated  the  minimum  value  of 
time  at  which  one  of  the  points  on  the  circle  became  singular.  This  value  of  time  is  the 
minimum  real  time  at  which  there  is  a  singularity  in  the  complex  e  plane  and  this 
singularity  will  decide  the  radius  of  convergence  for  the  perturbation  theory.  Figure  7 
shows  the  minimum  and  maximum  values  of  the  above  minimum  time  from  the  100 
initial  conditions  at  each  energy  value  versus  the  energy.  The  minimum  time  value  in 
the  figure  is  the  value  up  to  which  the  perturbation  theory  will  converge  if  enough 
number  of  orders  are  calculated,  whereas  the  maximum  at  each  energy  shows  that  there 
are  some  initial  conditions  for  which  the  perturbation  theory  will  converge  for  very 
large  time  without  any  need  for  analytical  continuation  of  the  generator. 

It  thus  appears  from  our  analysis  that  the  time-dependent  canonical  perturbation 
theory  can  be  used  for  studying  the  Henon-Heiles  system,  the  fact  that  the  results  show 
disagreement  with  numerical  results  is  to  be  attributed  to  the  low  order  of  perturbation 
theory  used. 

Appendix  A.  Generating  functions 

We  give  below  the  form  of  the  generating  functions  up  to  order  e3. 
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Abstract.  It  is  demonstrated  how  the  energy-dependent  Green's  function  for  the  Schrodinger- 
Coulomb  problem  can  be  deduced  from  a  knowledge  of  the  harmonic  oscillator  time-propaga- 
tor. All  the  known  results  of  the  Coulomb  system  are  shown  to  be  elegantly  derivable  from  such 
a  connection. 
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1.  Introduction 

The  purpose  of  this  article  is  to  show  how  the  quantum  mechanical  Green's  function  for 
the  Coulomb  potential  can  be  obtained  from  a  knowledge  of  the  Feynman  propagator 
for  the  harmonic  oscillator.  The  key  ingredient  in  the  analysis  is  a  remarkably  simple 
relation  between  the  radial  parts  of  the  Green's  functions  for  the  oscillator  and  the 
Coulomb  problems.  This  is  an  example  of  a  general  relation  between  the  Green's 
functions  for  different  potentials  [1].  That  there  exists  a  connection  between  the 
oscillator  and  Coulomb  systems  is  something  known  for  a  long  time.  What  is  more 
recent  is  the  realization  that  this  connection  takes  the  most  succinct  and  useful  form 
when  expressed  in  terms  of  the  radial  Green's  function.  It  enables  one  to  extract  all  the 
results  pertaining  to  one  physical  system  when  the  Green's  function  for  a  related  system 
is  known. 

It  is  somewhat  surprising  that  the  Greeii's  function  for  the  Schrodinger  equation 
with  the  Coulomb  potential  was  not  known  completely  until  the  sixties,  given  the  fact 
that  the  exact  treatment  of  the  hydrogen  atom  was  one  of  the  early  triumphs  of 
quantum  mechanics.  The  first  general  expression  for  the  Coulomb  Green's  function 
appears  to  be  a  double  integral  representation  due  to  Wichmann  and  Woo  [2]. 
A  one-parameter  integral  for  the  Green's  function  in  momentum  space  was  derived  by 
Schwinger  [3].  It  was  Hostler  [4, 5]  who  obtained  for  the  first  time  a  closed  form 
expression  in  coordinate  space.  Starting  with  the  bilinear  expansion  of  the  Green's 
function  in  terms  of  energy  eigenfunctions,  he  carried  out  the  summation  over  the 
bound  states  and  the  integration  over  the  continuum,  and  expressed  the  resulting 
one-parameter  integral  as  a  sum  of  products  of  Whittaker  functions  of  different 
arguments.  More  recently,  several  authors  [6-8]  have  computed  the  Coulomb  Green's 
function  in  the  context  of  solving  the  hydrogen  atom  problem  by  path  integration. 

The  starting  point  of  our  method  is  the  time-dependent  propagator  for  the  three- 
dimensional  isotropic  harmonic  oscillator.  By  Fourier-transforming  the  radial  part  of 
the  propagator,  we  derive  first  an  expression  for  the  radial  part  of  the  Green's  function. 
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Invoking  the  relation  referred  to  earlier,  we  go  from  the  oscillator  radial  Green's 
function  to  the  Coulomb  radial  Green's  function.  Once  the  latter  is  known,  all  the 
results  of  the  Coulomb  problem  can  be  deduced:  the  discrete  spectrum  and  the 
normalized  eigenfunctions,  the  continuum  and  its  ^-normalized  eigenfunctions,  as  well 
as  the  completeness  property  of  the  eigenfunctions.  By  attaching  spherical  harmonics 
to  the  radial  Green's  function  appropriately,  and  summing  the  resulting  partial  wave 
series,  we  obtain  an  expression  for  the  complete  Coulomb  Green's  function  as  an 
integral  over  a  single  variable. 

The  main  advantage  of  our  method  is  that  there  is  no  need  to  deal  with  the 
continuum  and  discrete  states  of  the  Coulomb  potential  separately.  This  is  because  we 
consider  the  partial  wave  expansion  of  the  Green's  function,  rather  than  its  eigenfunc- 
tion  expansion,  and  focus  attention  on  the  radial  part  which  carries  all  the  essential 
information,  and  contains  so  to  speak  both  the  discrete  and  continuum  states.  The 
angular  degrees  of  freedom  can  thus  be  decoupled  at  the  beginning  and  brought  back  at 
the  end.  Further,  our  method  is  simple  and  transparent,  and  leads  to  the  final 
expression  for  the  Green's  function  with  much  less  labour. 

2.  Radial  Green's  function  for  the  harmonic  oscillator 

The  time-dependent  propagator  for  a  particle  of  unit  mass  moving  from  a  position  r'  at 
time  t'  to  a  position  r  at  time  t  in  a  potential  V  satisfies  the  equation  (h  =  1) 

0-  (1) 

If  the  potential  is  time-independent  the  propagator  depends  on  time  only  through 
T  =  r  —  t',  and  if  V  is  spherically  symmetric,  then  it  has  also  the  partial  wave  expansion 
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The  function  K:  (r,  r',  T)  defined  by  (2)  is  the  radial  propagator.  For  the  isotropic 
harmonic  oscillator  with  V  =  -|co2r2,  the  radial  propagator  is  known  to  be  given  by  the 
expression  [9] 


sino)T 


1+1/2 


tan' 


(3) 


where  Jl+1/2  is  the  Bessel  function  of  the  first  kind. 

The  Green's  function  G(r,  r',£)  which  is  a  function  of  the  energy  E  is  related  to  the 
propagator  by  a  Fourier  transformation.  For  a  central  potential  the  angular  depend- 
ence of  G  can  be  separated  out  exactly  as  in  (2),  with  the  radial  part  0,(r,  r',  E)  replacing 
Kt  on  the  rhs.  For  gl  we  adopt  the  following  definition: 


9l(r,r',E)  =  i       dTexp(i£T) *,(?•,/•',  T).  (4) 

Jo 

In  (4)  £  is  to  be  understood  as  £  -1-  id  whenever  necessary,  together  with  the  limit  5  ->  0  4- . 


Our  objective  now  is  to  obtain  an  expression  for  the  oscillator  Green's  function 
sc  using  (3)  and  (4).  To  this  end  we  consider  the  integral  [10] 
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This  integral  exists  provided  the  constants  a,/?,  /I  and  p.  satisfy  the  following  two 
conditions: 
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where  M  and  W  are  the  Whittaker  functions  defined  by  the  relations 

r),  (8) 


In  (8)  <X>(a,  c,  z)  is  the  confluent  hypergeometric  function  which  is  regular  at  the  origin. 
We  now  change  the  variable  in  (5)  by  writing  sinh  x  =  cosech  t.  This  transforms  the 
integral  into 


/=         dt  exp  (2At)  cosech  t  exp  [  —  /J  coth  f]  J2^  (a  cosech  t).  (10) 


The  two  conditions  given  in  (6)  ensure  the  convergence  of  the  integral  at  the  upper  and 
lower  limits  respectively.  Further,  the  first  condition  also  ensures  that  the  integrand  in 
(10)  [which  we  denote  by/(t)]  vanishes  exponentially  as  1  1  1  ->  oo  in  the  first  quadrant  of 
the  complex  t  plane.  Let  us  consider  now  a  quarter  circle  of  radius  R  in  the  first 
quadrant  centred  at  the  origin  and  bounded  by  the  axes.  Within  this  closed  contour  the 
function/(r)  has  no  singularities.  Therefore  an  application  of  Cauchy's  theorem  yields, 
in  the  limit  R-+  oo,  the  result 


Using  this  in  (10)  and  writing  t  =  it,  we  get 

r^ 

I  =  i\     dTexp(2£AT)cosecTexp(iJ&cotT)J2/((  —  iacosect).  (11) 

Jo 

If  we  make  the  replacements 

T  =  coT,     2Aco  =  E,    j8  =  ^co(r2  +  r'2),    a  =  icorr',    2ju  =  /  +  4  (12) 

in  (11),  the  result  is 
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It  follows  from  (3),  (4)  and  (13)  that  the  oscillator  radial  Green's  function  is  given  by 

(14) 

Using  (7)  to  replace  /  on  the  rhs  of  (14),  we  finally  arrive  at  the  explicit  expression 
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The  above  form  holds  for  r  >  r';  for  r  <  r'  the  arguments  of  the  two  Whittaker  functions 
must  be  interchanged.  In  obtaining  (15),  a  phase  factor  (—  i)l+3/2  has  been  absorbed,  by 
making  use  of  the  reflection  property 


We  note  that  whenever  the  argument  of  the  gamma  function  in  the  numerator  of  (15) 
is  a  non-positive  integer  —  nr,  gfsc  has  a  pole  in  the  energy  plane  at  E  =  (2nr  +  I  +  3/2) ox 
These  are  the  allowed  energies  of  the  oscillator  for  a  fixed  value  of/.  There  are  no  other 
singularities  because  MA  M  and  W^  are  entire  functions  of  L  This  follows  from  the 
definitions  (8)  and  (9)  and  the  fact  that  <£(a,  c,  z)  is  an  entire  function  of  a. 

3.  Coulomb  radial  Green's  function 

The  energy-dependent  Green's  function  associated  with  the  Coulomb  potential 
V(r)  =  —e2/r  (hydrogen  atom)  can  be  determined  from  a  knowledge  of  g°sc.  We  have 
shown  in  an  earlier  work  [1]  that  the  radial  part  of  the  Coulomb"  Green's  function  is 
related  in  a  remarkably  simple  manner  to  the  oscillator  radial  Green's  function.  If  we 
denote  by  gfc(r,r',£,co)  the  latter,  corresponding  to  an  oscillator  of  frequency  co  (in- 
dicated explicitly  now),  energy  e  and  angular  momentum  L,  the  precise  connection 
between  the  two  is 


gfov(r,  r',  E)  =  2(rr')1'4ffi,  ll2(Jr,  Jr',  e  =  4e2,  co  =  J-  8E).  (17) 

In  other  words,  the  Coulomb  radial  Green's  function  for  a  given  /  and  energy  E  is 
determined  by  the  radial  Green's  function  of  an  oscillator  of  frequency  ^/—  8£,  energy 
4e2  and  angular  momentum  continued  to  the  value  21  +  1/2.  Inserting  (  1  5)  into  (  1  7),  we 
obtain  the  expression 


,  r',  E)  =  M^fror')  W^(cor),  (18a) 


with 

co  =  v/-8£,    A  =  2e2/o>,    /*  =  /  +  £.  (18b) 

Using  (18)  we  can  determine  the  complete  Green's  function  for  the  Coulomb 
problem.  Deferring  this  to  a  later  section,  we  now  consider  how  the  Coulomb  wave 
functions  can  be  deduced  from  gfou  which  will  be  simply  called  gt  hereafter  for 
convenience. 


The  Coulomb  Green's  function 
Energy  eigenfunctions  of  the  hydrogen  atom 

e  singularities  of  the  function  gl  in  the  £  plane  constitute  the  spectrum  of  allowed 
;rgies  of  the  hydrogen  atom.  We  see  from  (17)  that  gt  has  simple  poles  when 


ese  correspond  to  bound  states  with  energies  (cf.  18b) 

£  =  £„=-  e*/2(nr  +  I  +  I)2  =  -  **/2n2,  (19) 

ere  n  =  nr  +  I  +  1  is  the  principal  quantum  number. 

Hie  bound  state  wave  functions  are  given  by  the  residue  of  0,  at  its  poles.  If  we  denote 

unl  the  reduced  radial  wave  function,  we  have 

,r',£)]£=£n=  lim  [(£„  -  £)&]  =«»un,(r').  (20) 


the  pole  co  =  con  =  2/no,  where  a  =  e'2  is  the  Bohr  radius,  and 

MrKi(r')=   lim     -(co2-  0)^)0,  .  (21) 

<o-"n|_5       f  J 

;o,  1=  —  n  in  the  limit;  for  this  value  of  /  the  Whittaker  functions  MA/  +  1/2  and 
.,1+1/2  are  both  expressible  in  terms  of  Laguerre  polynomials  in  a  standard  way. 
straightforward  evaluation  of  the  above  limit  yields  the  wave  function 


nay  be  noted  that  the  wave  function  comes  out  with  the  correct  normalization  factor. 
e  reason  for  this  is  to  be  traced  ultimately  to  the  normalization  of  the  propagator 
which  is  specified  by  the  inhomogeneous  eq.  (1). 

[n  addition  to  the  poles  g,  has  a  branch  point  at  E  =  0.  To  see  this  we  must  consider 
:  behaviour  of  the  Green's  function  near  co  =  0.  First,  we  note  that  the  Whittaker 
iction  MA/  +  1/2  for  A  =  2e2/a>  has  the  following  behaviour  as  co  ->  0: 


.is  may  be  readily  deduced  from  (8)  on  substituting  the  basic  series  representation  of 
:  confluent  hypergeometric  function  $.  Next  we  make  use  of  an  explicit  series 
pansion  of  the  Whittaker  function  Witlt  which  is  valid  when  2ju  +  1  is  a  positive 
eger  [11].  This,  coupled  with  the  asymptotic  expansion  of  F(z  +  a),  shows  that  for 
iall  to 


mce  the  behaviour  of  the  Green's  function  is  given  by 

gi(r,r>9  E)  ~  C0(r,r')  +  C^r^co  +  0(co2).  (23) 

lis  result  implies  that  g,  has  a  square-root  branch  point  in  the  energy  plane  at  the 
igin.  The  associated  branch  cut  can  be  taken  to  run  along  the  positive  real  axis.  This 
rresnonds  to  choosing  the  nrincioal  value  for  the  argument  of  £:  —  n  <  are  £  ^  n. 
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The  occurrence  of  the  branch  point  singularity  is  indicative  of  a  continuum  of 
positive  energy  eigenvalues  for  the  Coulomb  potential.  The  corresponding  eigenfunc- 
tions are  determined  by  the  discontinuity  of  gl  across  the  branch  out.  Denoting  by 
vt(r,E)  the  continuum  eigenfunction  belonging  to  some  £  >  0,  we  have  the  relation 

disc 9l(r, r', E)  =  lim  [0,(r, r', E  +  i6)  - gt(r, r', E  -  iS)~\ 

=  2mvl(r,E)v*(r',E).  (24) 

Since  for  £>0,  <z>(E±  10)  =  +  i\co\,  we  introduce  the  asymptotic  wave  number 
k  =  ^/2E  and  write  co(E  +  io)  =  +  2ik,  k>0.  Then  the  discontinuity  can  be  written  as 

-ik(2l+l)!discgl  =  r(-X  +  n  +  %)M^u(-2ikrf)  W^^(-2ikr] 

(25) 


where  A'  =  ie2/k  and  fi  =  l  +  1/2.  In  order  to  identify  the  eigenfunction  vlt  we  must  cast 
the  rhs  of  (25)  in  a  factorized  form.  To  this  end,  we  first  replace  the  W  functions  in  (25)  in 
terms  of  the  M's,  using  the  relation  (9).  The  resulting  expression  is  rewritten  by  invoking 
the  reflection  property  (16)  of  the  M  function,  and  then  simplified  by  using  the  standard 
property  r(z)F(l  -  z)  =  n  cosec nz  of  the  gamma  function.  The  entire  calculation  is 
best  done  keeping  \L  arbitrary,  and  putting  \JL  =  /  4- 1/2  only  at  the  end.  The  result  is  the 
product  form 


(26) 

From  (26),  (24)  and  (8),  we  get  (apart  from  an  over  all  phase  factor)  the  continuum  wave 
function: 

r(£  +  /  +  i) 

4-1J!          \k 

* ,  ie* 
x<D|— +  /+l,2/  +  2,2ikr).  (27) 

This  is  the  correct  reduced  radial  wave  function  which  is  normalized  according  to 


E').  (28) 

v     ' 

5.  Completeness  of  the  Coulomb  eigenfunctions 

The  important  property  of  completeness  possessed  by  the  Coulomb  eigenfunctions  can 
be  deduced  from  the  Green's  function  gl  as  a  direct  consequence  of  its  analytic 
properties  in  the  energy  plane.  We  present  here  an  explicit  demonstration. 

For  the  present  purpose  it  is  more  advantageous  to  consider  gt  as  a  function  of 
co  rather  than  of  E.  In  the  w  plane  the  only  singularities  are  simple  poles  located  on  the 
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E  plane,  there  is  no  singularity  at  co  =  0,  as  shown  by  (23).  As  co  -»  oo  in  the  half  plane 
Reco  >  0,  the  Whittaker  functions  have  the  asymptotic  behaviour 

(2/+  1)L.     /I      A      „,  /     1 


(^) 


•exp    -cor   |,     »rl.I+1/2-*exp    --cor  . 

,     2 

Therefore,  the  asymptotic  form  of  gl  in  the  right  half  plane  is 
2 


ar^L--a>(r-r') 

Consider  now  a  contour  in  the  co  plane  made  up  of  the  imaginary  axis  from  co  =  iR  to 
co  =  —  iR  and  a  semicircle  CR  lying  in  the  right  half  plane  with  centre  at  the  origin  and 
radius  R  >  2e2/(l  +1).  All  the  poles  of  gt  lie  within  the  contour.  Integrating  o)gl  around 
the  contour,  we  have 

/»  /» •« 

co^(co)dco+  co#,(co)dco  =  27n£  (residues  of  cogf,).  (29) 

J  CR  J  iR 

The  rhs  can  be  immediately  written  down  from  (21): 

00 

y  residues  =     Y    FCco 


=  4    £    "»">')•  (30) 

»  =  /+! 

Evaluating  the  Ihs  in  the  limit  R->  oo,  the  integral  over  the  semicircle  becomes 


dco 

CK 


r      r  i 

2  exp    --co(r-r') 
Jc«          L     z 

?*  iR  r"      1  — i 

=  lim  2  exp    —  -co(r  —  r'}    dco 

R-*oo       J   -iR  L       2  J 


r').  (31) 

As  for  the  other  integral  on  the  Ihs,  we  can  write  it  as 

~iR  ro  r-iR 

a)0j(co)dco=         co6f,(z|co|)dcL)+  co^(—  i|co|)dco. 

iR  J  iR  Jo 

In   view   of  the   relation   co  =  *J  —  8£,   we   can   identify,   as   was    done   earlier, 
#i(±  1  1  co  |)  =  ^(Zi1  +  to)  with  E  >  0.  Thus  we  have 


io). 

0 

Now,  using  (24)  and  proceeding  to  the  limit  R  -»•  co,  we  get 

(32) 


-ioo 


O 
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Putting  (30),  (31)  and  (32)  in  (29),  we  have  the  completeness  relation 

,E)v*(r',E)  =  d(r-r').  (33) 


0 

It  may  be  noted  that  the  above  relation  was  also  demonstrated  by  Mukunda  [12] 
starting  from  the  explicit  expressions  for  the  Coulomb  wave  functions. 

6.  The  complete  Green's  function 

The  function  gl  that  we  have  considered  so  far  is  the  radial  part  of  the  complete  Green's 
function  G(r,r',£),  the  connection  between  the  two  functions  being  given  by  the 
expansion 


G(r,r',E)  = 
Making  use  of  the  relation 


where  y  is  the  angle  between  the  two  vectors  r  and  r'  and  the  explicit  form  (18)  of  #„  we 
obtain  the  expression 


r).  (35) 

Now  the  use  of  the  formula  [10] 


exp    -- 


(coth 

\ 


^ 


which  holds  when  Re(l/2  +  p.  -  /.)  >  0,  Re  t  >  0  and  a  >  b  >  0,  enables  us  to  rewrite 
(35)  as 


G  = 


i 


2n*/rr' 


dxexp 


/          X\2A  r- 

X  VC°th2  J    ^i+i^jrr'smhx).  (36) 


Interchanging  the  order  of  the  summation  and  the  integration  and  applying  the 
identity 


2     V      2; 
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The  Coulomb  Green's  function 
the  summation  in  (36)  can  be  performed.  The  result  is 


co 
G=  — 

471 


dxexp  \  -- 

o          L   2 


r')coshx      coth-      sinhx 

V 


x  /Ol  co^/rr'  cos  -sinhx  I. 


(37) 


By  the  change  of  variable  cosech  t  =  sinh  x,  (37)  can  be  transformed  into 

cosech2  1 


=  —        diexp    --< 
47rJo  L     2 


x  70I  co^/rr'  cos-  cosech  M. 

In  this  expression,  co  =  ^/—  8E  and  1  =  2e2/co. 
Alternatively,  we  may  proceed  as  follows.  Starting  with 


(38) 


=  Z 


21+1 


we  use  (17),  (14)  and  (13)  to  get 


\2l+l 


dTexp(i4e2T) 


(39) 


Using  now  the  identity 


we  obtain  a  different  representation 


, 

4ie 


G  =  —        dTexp 

4ni 


-zJ0    zcos-   , 
2       \ 


•ico(r  +  r')cota>T 


x  cosec2coTJ0  I  co^/rr'  cos~cosec  coT  J. 


(40) 


The-above  expression  for  the  Coulomb  Green's  function  is  in  agreement  with  those  of 
Hostler  [5]  and  Ho  and  Inomata  [7]  (modulo  overall  constant  factors  arising  out  of 
different  definitions).  At  this  point  one  can  follow  Hostler's  steps  to  carry  out  the 
integration  and  express  G  in  closed  form  in  terms  of  Whittaker  functions. 

The  logical  end  to  the  above  analysis  would  be  to  compute  the  Coulomb  propa- 
gator in  closed  form  by  Fourier-inverting  the  function  G.  This  no  one  knows  how  to  do 
as  yet. 
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Abstract.  An  axially  symmetric  exact  solution  of  the  Einstein-Maxwell  equations  is  obtained 
and  is  interpreted  to  give  the  gravitational  and  electromagnetic  fields  of  a  charged  tachyon. 
Switching  off  the  charge  parameter  yields  the  solution  for  the  uncharged  tachyon  which  was 
earlier  obtained  by  Vaidya.  The  null  surfaces  for  the  charged  tachyon  are  discussed. 
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1.  Introduction 

The  tardyons,  the  luxons,  and  the  tachyons  are  the  names  given  to  particles  which 
move,  respectively,  with  velocities  smaller  than,  equal  to,  and  greater  than  the  speed  of 
light  in  vacuum  [1-2].  Among  these  three  classes  of  objects,  tachyons  have  remained 
undetected.  However,  the  subject  of  the  superluminary  objects  have  been  fascinating 
many  physicists'  minds.  Lucretius  (50  B.C.)  was  probably  the  first  scientist  who 
mentioned  such  objects  [3].  Few  years  before  the  special  theory  of  relativity  was  given, 
Thomson,  Heavyside,  and  Sommerfeld  had  investigated  the  questions  arising  from  the 
assumptions  that  the  superluminary  objects  exist  in  the  nature  [4].  However,  with  the 
outset  of  the  special  theory  of  relativity  (STR),  due  to  a  misinterpretation  by  Einstein 
himself,  a  wrong  idea  prevailed  that  the  non-existence  of  tachyons  is  a  direct  conse- 
quence of  the  STR  [1,4].  To  this  end,  Bilaumik  et  al  (BDS)  [4]  reexamined  this  subject 
and  found  that  it  is  rather  the  special  theory  of  relativity  that  suggests  a  possibility  for 
existence  of  superluminary  objects.  In  order  to  have  the  energy  and  momentum  (which 
are  measurable  quantities)  real,  BDS  [4]  hypothesized  the  tachyons  to  have  their 
proper  masses  imaginary.  Likewise,  the  proper  lengths  and  the  proper  times  of 
tachyons  are  also  imaginary  quantities.  The  velocity  of  a  tachyon  increases  on  the  loss 
of  its  energy. 

The  causality  problems  (raised  by  Tolman,  Schmidt,  and  Terletski)  due  to  the 
assumption  that  the  tachyons  exist  were  resolved  by  Sudarshan  and  his  co-workers 
[2, -3].  BDS  [4]  suggested  that  the  Cerenkov  effect  is  likely  to  be  an  avenue  for  the 
detection  of  the  faster- than-light  objects  if  they  carry  electric  charge.  Many  experimen- 
talists [5-8]  have  put  considerable  effort  to  detect  tachyons.  Their  experiments  could 
not  confirm  the  existence  of  tachyons.  However,  inspired  by  the  Gell-Mann's  totalitar- 
ian principle  (which  states  that  in  physics  anything  which  is  not  forbidden  is  compul- 
sory) [2, 3],  the  subject  of  tachyons  continued  to  be  of  interest  to  many  researchers. 
Recami  [3]  presented  a  review  on  this  subject. 
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Nariikar  and  Sudarshan  [9]  studied  the  propagation  of  tachyons  in  an  expanding 
universe.  They  showed  that  the  pre-mordial  tachyons  in  a  big-bang  universe  could  not 
survive  unless  it  had  very  large  energy  in  the  beginning.  The  trajectories  of  tachyons  in 
the  Schwarzschild  background  were  studied  by  Nariikar  and  Dhurandhar  [10].  These 
investigations  were  further  extended  by  Dhurandhar  [11]  in  the  Kerr  geometry. 
Vaidya  [12]  obtained  a  static,  axially  symmetric  vacuum  solution  of  the  Einstein 
equations  and  interpreted  it  as  describing  the  gravitational  field  of  a  tachyon.  He  found 
that  there  is  a  gravitational  repulsion  between  a  tachyon  and  a  tardyon.  Raychaudhuri 
[13]  showed  that  there  is  an  attraction  between  tachyons  themselves.  We  [14]  obtained 
the  gravitational  field  of  a  tachyon  (uncharged)  in  a  de  Sitter  universe.  The  electromag- 
netic interactions  between  a  charged  tachyon  and  a  charged  tardyon,  and  between 
charged  tachyons  themselves  have  not  been  investigated.  Therefore,  it  is  of  interest  to 
obtain  the  gravitational  as  well  as  the  electromagnetic  fields  of  a  charged  tachyon.  In 
the  present  paper,  we  obtain  an  axially  symmetric  exact  solution  of  the  Einstein- 
Maxwell  equations  and  interpret  it  to  give  the  gravitational  and  electromagnetic  fields 
of  a  charged  tachyon.  We  use  the  geometrized  units  (G  =  l,c  =  1)  and  follow  the 
convention  that  the  Latin  indices  run  from  0  to  3. 

2.  Einstein-Maxwell  equations 

The  Einstein-Maxwell  equations  are 

where 

1 
£fc=  —  [-F.  Fkm  +  ±qkF    Fm"l  (2) 

i        A  fff  fw  4-  *y  i     mti          — i  *  ^    J 


(3) 

(4) 

Rk  is  the  Ricci  tensor.  Tk  and  Ek  are  the  energy-momentum  tensors  due  to  the  matter 
and  the  electromagnetic  field,  respectively.  J'  stands  for  the  electric  current  density 
vector. 

3.  Solution  for  charged  tachyon 

By  transforming  the  Reissner-Nordstrom  solution,  we  have  obtained  a  static  and 
axially  symmetric  exact  solution  of  the  Einstein-Maxwell  equations  which  is  given  by 
the  line  element, 

dr2  =  Bdt2  - B~ldp2  -- 9- ^(d02  +  sin20d</>2),  (5) 

(1  —  - 

where 


The  non-vanishing  components  of  the  energy-momentum  tensor  of  the  electromagnetic 
field  are 


2\2 


The  current  density  vector  and  the  energy-momentum  tensor  of  matter  are  given  by 
j'  =  0,     T?  =  0.  (9) 

We  now  proceed  to  show  that  the  above  solution  gives  the  gravitational  and  elec- 
tromagnetic fields  of  a  charged  tachyon. 

Using  the  retarded  time  coordinate  u  given  by 


u  =  t-     lB(p)Yldp  (10) 

J 

in  place  of  t,  the  above  line  element  can  be  written  as 


dr2  =  Bdu2  +  Idudp  --  -  -  ^  (d92  +  sin2  9  d^2  )  (11) 

(1  —  vcosB) 

and  the  non-vanishing  component  of  the  electromagnetic  field  tensor  is  given  by 


Now  by  a  coordinate  transformation  p  going  to  r,  where  p  =  r(l  —vcos  9),  the  line 
element  is 


dt2  =  Ddw2  +  2dud[r(l  -  ucos  0)]  -  r2(d02  +  sin20  d$2),  (13) 

where 


22 


r(l-ucos0)     r2(l-t;cos0)2 
and  the  surviving  components  of  the  electromagnetic  field  tensor  are 


r2(l-ucos0) 

r  Q(l-^2)^sin0 

9u  ' 


For  |  u|  <  1,  the  solution  can  be  transformed  to  the  Reissner-Nordstrom  solution  given 
by  the  line  element, 

dr2  =  B'du'2  +  2du'dr'  -  r'2(d6'2  +  sin26'd(f)2\  (16) 
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where 
and 


2 
and  the  non-vanishing  component  of  the  electromagnetic  field  tensor 


by  the  coordinate  transformations 
u'  = 


Y     — 
' 


(20) 

We  obtained  the  axially  symmetric  solution  by  following  the  reverse  process  given  here, 
i.e.,  we  began  with  the  Reissner-Nordstrom  solution  and  obtained  the  axially  symmet- 
ric solution  by  a  complex  transformation  (|y|>l).  However, .for  convenience  in 
presentation  we  have  first  written  the  axially  symmetric  solution  (the  line  element  in 
diagonal  form)  and  then  have  shown  that  it  transforms  to  the  Reissner-Nordstrom 
solution  for  |u|  <  1.  The  coordinate  transformations  expressed  by  (20)  show  that  the 
origin  of  the  frame  S'  moves  with  respect  to  the  frame  5  with  a  uniform  velocity  v  along 
the  common  Z-axis  in  the  flat  spacetime  background  [12].  The  solution  of  the 
Einstein-Maxwell  equations  obtained  by  us  has  the  following  characteristics:  (a)  the 
gravitational  field  given  by  the  line  element  (5)  has  all  the  geometrical  characteristics  of 
the  Reissner-Nordstrom  field,  (b)  the  solution  is  axially  symmetric  which  is  required  as 
there  cannot  be  a  tardyonic  observer  which  can  find  a  tachyon  at  rest,  and  (c)  it  contains 
a  velocity  parameter  u,  which  is  not  possible  to  be  transformed  away  for  |  v  \  >  1 .  However, 
for  |y|  <  1,  the  solution  transforms  to  the  Reissner-Nordstrom  solution.  Therefore,  we 
interpret  the  solution  to  give  the  gravitational  and  the  electromagnetic  fields  of  a  non- 
rotating  charged  tachyon.  Switching  off  the  charge  parameter  Q,  one  gets  the  solution 
for  the  uncharged  (non-rotating)  tachyon  which  was  obtained  by  Vaidya  [12]. 

4.  Discussion 

The  solution  given  by  us  is  singular  at  r  =  0  and  sec  B  =  v.  The  latter  gives  a  right 
circular  cone  of  semi- vertical  angle  arcsec(t;)  as  the  singularity  surface.  Again  the  null 
surfaces  are  given  by  the  equation 

(v2  -  \}p2  -mp-  Q2(v2  -  I)2  =  0.  (21) 

This  equation  has  two  roots  given  by 

P  =  P±,  (22) 
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where 

_  m  +  ^/m2  +  4(v2  -  1)3<22 
P+ =  ^772 r\ ' 


m  — 


Using  p  =  r(l-ucos0),  r  =  (x2  +  y2  +  z2)1/2,  and  rcos^  =  z,  the  equations  of  null 
surfaces  given  by  (22)  can  be  written  as 

/    2  VP±       V  2  2\_       P± 

z\/v      1  "f     /          I      (^  +  y  ) =  2  ~  1  (24) 


which  give  two  hyperboloids  of  revolution  of  two  sheets  in  the  three  (spatial)  space 
corresponding  to  p  equals  to  p  +  and  p  equals  to  p  _ .  It  is  clear  from  (22)-(23)  that  the 
null  surfaces  for  the  charged  tachyon  are  different  from  that  of  the  uncharged  tachyon. 
It  is  of  interest  to  study  the  trajectories  of  the  charged  tardyons  in  the  field  of  a  charged 
tachyon. 

After  the  completion  of  this  work,  a  paper  [15]  came  to  our  notice  wherein  the 
authors,  using  the  complex  tetrads,  obtained  a  general  Kerr-Schild  class  of  solutions. 
They  mentioned  that  a  particular  solution  gives  the  charged  rotating  tachyon.  The 
solution  given  by  us  has  a  clear  physical  interpretation.  Dadhich  brought  to  our  notice 
his  paper  on  charged  tachyon  [16].  Our  method  of  obtaining  the  gravitational  and 
electromagnetic  fields  of  a  charged  tachyon  is  different  from  his. 
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Abstract.  We  give  a  non-static  exact  solution  of  the  Einstein-Maxwell  equations  (with  null 
fluid),  which  is  a  non-static  magnetic  charge  generalization  to  the  Bonnor-Vaidya  solution  and 
describes  the  gravitational  and  electromagnetic  fields  of  a  non-rotating  massive  radiating  dyon. 
In  addition,  using  the  energy-momentum  pseudotensors  of  Einstein  and  Landau  and  Lifshitz  we 
obtain  the  energy,  momentum,  and  power  output  of  the  radiating  dyon  and  find  that  both 
prescriptions  give  the  same  result. 

Keywords.     Dyon;  Einstein-Maxwell  equations;  energy-momentum  pseudotensor. 
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There  has  been  considerable  interest  in  obtaining  non-static  solutions  of  Einstein's 
equations  describing  the  gravitational  field  of  a  star  radiating  null  fluid  [1-3].  In  1953, 
Vaidya  [2]  obtained  a  nice  form  of  non-static  generalization  to  the  Schwarzschild 
solution  and  it  became  well-known  in  the  literature  after  the  discovery  of  quasars.  The 
quasars  are  high  energy  sources  and  therefore  their  gravitational  field  cannot  be 
described  by  the  Schwarzschild  metric  while  the  Vaidya  metric  is  relevant  to  the  study 
of  such  objects.  Vaidya  and  others  gave  non-static  generalizations  to  the  Kernsolution 
[3].  Bonnor  and  Vaidya  [4]  obtained  a  non-static  generalization  to  the  Reissner- 
Nordstrom  (RN)  solution  describing  the  emission  of  charged  null  fluid  from  a  spheri- 
cally symmetric  charged  radiating  body.  Mallett  [5]  gave  an  exact  solution  describing 
the  radiating  Vaidya  metric  [1]  in  a  de  Sitter  universe.  Patino  and  Rago  [6]  obtained 
a  solution  of  the  Einstein-Maxwell  (EM)  equations  with  null  fluid  for  a  spherically 
symmetric  radiating  massive  charged  (electric)  object  in  a  de  Sitter  universe. 

The  existence  of  magnetic  monopoles  is  not  yet  confirmed,  but  it  has  been  a  subject  of 
interest  of  many  physicists  (see  in  ref.  7).  The  Bonnor-Vaidya  solution  [4]  is  not 
enriched  with  a  magnetic  charge  parameter.  Therefore,  it  is  of  interest  to  obtain 
a  non-static  magnetic  charge  generalization  to  the  Bonnor-Vaidya  solution,  character- 
ized by  three  time-dependent  parameters:  mass,  electric  charge,  and  magnetic  charge. 
Further  we  calculate  the  energy,  momentum  and  power  output  of  the  radiating  dyon  in 
prescriptions  of  Einstein  as  well  as  Landau  and  Lifshitz  (LL).  Throughout  this  paper  we 
use  geometrized  units  where  the  gravitational  constant  G  =  1  and  the  speed  of  lighUn 
vacuum  c  =  1.  We  follow  the  convention  that  Latin  indices  take  values  from  0  to  3  (x°  is 
the  time  coordinate)  and  Greek  indices  take  values  from  1  to  3. 

The  EM  equations  with  null  fluid  present  are  [4] 


-9 

where 

1   r  1  1 

(4) 


_nkp      pm» 

4  '   m" 


is  the  energy-momentum  tensor  of  the  electromagnetic  field  and 

A^=F{Kfc  (5) 

is  the  energy-momentum  tensor  of  the  null  fluid.  Vk  is  the  null  fluid  current  vector 
satisfying 

gikVlVk  =  0.  (6) 

*Ffc,  the  dual  of  the  electromagnetic  field  tensor  Fik,  is  given  by 

*F*  =  —  L=  £*'"•*•..  (7) 

21  ifn 

- 


gtfcim  -IS  tke  Levi_civita  tensor  density.  R*  is  the  Ricci  tensor.  Jie)(J('m))  stands  for  the 
electric  (magnetic)  current  density  vector. 

An  exact  solution  of  the  above  equations  describing  the  gravitational  and  elec- 
tromagnetic fields  of  a  non-rotating  radiating  dyon  is  given,  in  coordinates 
x°  =  u,  x1  =  r,  x2  =  Q,  x3  =  0,  by  the  metric, 

ds2  =  Bdu2  +  2dwdr  -  r2(d02  +  sin20d<£2),  (8) 

where 

qe(2  2 

+ 


and  the  non-zero  components  of  the  electromagnetic  field  tensor, 
_>(") 


(10) 

e("X  and  qm(u)  are  mass,  electric  and  magnetic  charge  parameters,  respectively. 
These  parameters  depend  on  the  retarded  time  coordinate  u. 

The  surviving  .components  of  the  Einstein  tensor,  Gf  =  jRf-£0fjR,  the  energy- 
momentum  tensor  of  the  electromagnetic  field,  Ek{,  and  the  energy-momentum  tensor  of 
the  null  fluid,  Nf  ,  are 


G  =  K,  (11) 
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where 

X2  =  2(gege  +  ym-    ^  (U) 

The  dot  denotes  the  derivative  with  respect  to  the  retarded  time  coordinate  u.  The  null 
fluid  current  vector,  the  electric  and  magnetic  current  density  vectors  are 

•^ 


A-  (16) 

The  radiating  dyon  solution  given  by  us  yields:  (a)  the  Bonnor-Vaidya  solution  when 
qm  =  0,  (b)  the  Vaidya  radiating  star  solution  [2]  when  qe  =  qm  =  0,  (c)  the  Reissner- 
Nordstrom  solution  when  qm  =  0  and  qe  and  M  are  constants,  (d)  the  Schwarzschild 
solution  when  q6  =  qm  =  0  and  M  is  constant  and  (e)  the  static  dyon  solution  [8]  when 
M,  <je,  and  qm  are  constants. 

Using  the  Tolman  pseudotensor,  Vaidya  [9]  calculated  the  total  energy  of  a  spheri- 
cally symmetric  radiating  star  and  got  E  =  M.  Further  Lindquist  et  al  (LSM)  [10], 
using  the  LL  pseudotensor,  obtained  the  energy,  momentum,  and  power  output  for  the 
Vaidya  spacetime  and  found  that  the  total  energy  and  momentum  components  are 
pl  —  M;  0, 0, 0  and  the  power  output  is  —  dM/dw.  One  of  the  present  authors  [1 1],  using 
several  energy-momentum  pseudotensors,  calculated  the  energy  and  momentum 
components  for  the  Vaidya  metric  and  found  the  same  result  as  obtained  by  LSM.  Now 
we  obtain  the  energy,  momentum  and  power  output  for  the  radiating  dyon  in 
prescriptions  of  Einstein  as  well  as  LL  and  show  that  both  give  the  same  result.  The 
energy-momentum  pseudotensors  of  Einstein  [12]  and  LL  [13]  are 

1    ..,,  (1?) 


where 
and 
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They  satisfy  the  local  conservation  laws: 


dx 
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,  (21) 


=  0.  (22) 


©o  is  the  energy  density.  0°  and  0£  are  the  momentum  and  energy  current  density 
components,  respectively.  L°°  and  La°  give,  respectively,  the  energy  density  and 
momentum  (energy  current)  density  components  in  LL  prescription.  The  energy  and 
momentum  components  in  the  prescription  of  Einstein  are 


0°dx1dx2dx3,  (23) 

whereas  in  the  LL  prescription  are 

Pl=  f  f  I  L^dx'dxMx3.  (24) 


i  =  0  gives  the  energy  and  i  =  1, 2, 3  give  the  momentum  components.  One  knows  that 
the  energy-momentum  pseudotensors  give  the  correct  result  if  calculations  are  carried 
out  in  quasi-cartesian  coordinates  [9-14].  The  quasi-cartesian  coordinates  are  those  in 
which  the  metric  gik  approaches  the  Minkowski  metric  77^  at  great  distances.  Therefore, 
one  transforms  the  line  element  (8),  given  in  u,  r,  6, 0  coordinates,  to  quasi-cartesian 
coordinates  t,  x,  y,  z  according  to 

u  =  t  —  r, 

.x  =  r  sin  9  cos  </>, 

(25) 
y  =  r  sin  9  sin  </>, 

z 
and  gets 


\     r  r2 

xdx  +  ydy  +  zdz~\2 
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where, 


We  are  interested  in  calculating  the  energy-momentum  components  and  the  power 
output  of  the  radiating  dyon.  Therefore,  the  required  components  of  Hf  are 
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where 

A  =  2Mr-<22, 


<22  =  3e2  +  e  (30) 

We  use  (17)  and  (28)  in  (23),  and  (19)  and  (29)  in  (24),  apply  the  Gauss  theorem  and 
evaluate  the  integrals  over  the  surface  of  two-sphere  of  radius  r0  .  The  energy  and 
momentum  components  in  both  prescriptions  (Einstein  as  well  as  LL)  are 

(31) 


2r0 
and 

Px  =  Py  =  P2  =  Q.  (32) 

Thus  the  total  energy  and  momentum  of  the  radiating  dyon  is  P'  =  M;  0,  0,  0  as 
expected.  Now  using  (17)-(20)  and  (28)-(30)  we  calculate  the  energy  current  density 
components  in  both  prescriptions  and  get 

0i=L01=xA,  (33) 

©3=L02=.yA,  (34) 

03=L03  =  zA,  (35) 


where 

A         Me  +  tfmtfin-i 


(     } 


Again  we  get  the  same  result  in  both  prescriptions.  Switching  off  the  electric  and 
magnetic  charge  parameters  we  get  the  result  for  the  Vaidya  metric  which  were  earlier 
obtained  by  one  of  the  present  authors  [11].  Using  the  energy  current  density 
components,  given  by  (33-36),  we  calculate  the  power  output  across  a  2-sphere  of 
radius  r0  and  get, 


M- 


2r0 


(37) 


4e  =  qm  =  0  in  (37)  gives  the  power  output  for  the  Vaidya  spacetime  [10].  The  total 
power  output  (r0  approaching  infinity  in  (37))  is  -dM/du.  Without  using  any 
energy-momentum  pseudotensor,  Bonnor  and  Vaidya  [3]  obtained  the  power  output 
for  the  metric  given  by  them.  They  got  the  same  result  as  obtained  by  us. 
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Abstract.  Exact  solutions  to  Einstein's  equations  for  a  cloud  of  massive  strings  with  a  general 
static  metric  representing  spherical  plane  and  hyperbolic  symmetries  are  derived.  Some  proper- 
ties of  massive  strings  for  different  cases  are  also  discussed. 
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1.  Introduction 

Cosmic  strings  have  received  considerable  attention  as  they  are  believed  to  have  served 
in  the  structure  formation  in  the  early  stages  of  the  universe.  Cosmic  strings  may  have 
been  created  during  phase  transitions  in  the  early  universe  [1]  and  they  could  act  as 
a  source  of  gravitational  field  [2].  The  existence  of  strings  in  the  early  universe  can  be 
used  to  introduce  density  fluctuations  leading  to  the  formation  of  galaxies  [3-5]. 
A  gauge-invariant  model  of  a  cloud  of  geometric  strings  with  spherical,  plane  and 
cylindrical  symmetries  has  been  studied  by  Letelier  [6].  He  has  suggested  that  the 
massive  strings  are  being  formed  by  geometric  strings  with  particle  attached  to  them 
along  their  extension.  The  general  relativistic  treatment  of  strings  has  been  originally 
given  by  Letelier  [6]  and  Stachel  [7].  Relativistic  models  of  cosmic  strings  in  Bianchi 
type  II,  VLj ,  VIII  and  IX  space  times  have  been  studied  by  Kr ori  et  al  [8] .  Banerjee  et  al 
[10]  have  obtained  Bianchi  type  I  string  cosmological  models  with  magnetic  field. 
Chakraborty  [9]  and  Tikekar  and  Patel  [11]  have  studied  different  cases  of  Bianchi 
type  VI0  string  cosmology  with  and  without  magnetic  field.  Recently,  Tikekar  et  al  [1 2] 
have  presented  a  new  class  of  singularity  free  cylindrically  symmetric  models  in  string 
cosmology. 

The  study  of  static  models  of  the  universe  is  appealing  not  only  for  theoretical 
reasons,  but  also  because,  if  there  exists  an  explanation  for  the  observed  red  shift  other 
than  the  expansion  of  the  universe,  the  static  models  could  gain  importance  [13].  Gott 
[14]  and  Linet  [15]  had  considered  static  cylindrically  symmetric  interior  and  exterior 
space-times  of  an  extended  cosmic  strings  with  uniform  and  non-uniform  energy 
densities.  The  metric  of  the  static  space-time  describing  infinite  straight  cosmic  strings 
of  linear  mass  density  has  been  obtained  by  Letelier  [16].  Recently,  Krori  et  al  have 
studied  the  field  of  a  stationary  cosmic  string. 

In  this  paper  we  are  considering  a  more  general  static  metric  with  spherical,  plane 
and  hyperbolic  symmetries  to  study  a  cloud  of  massive  strings  in  different  cases. 
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2.  Field  equations 

We  consider  the  most  general  static  line  element  for  matter  distribution  given  by 

ds2  =  g2 (x)dt2  -  dx2  -  f2 (x)[_dy2  +  F(y)dz2]  (1) 

which  represents  spherical  \_F(y)  =  sin2;;],  plane  [F(y)  =  1]  or  hyperbolic  [F(y)  =  e2y] 
symmetry. 
The  non-zero  components  of  the  Ricci  curvature  tensor  are 
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1     for  F  =  sin2  y 

0  forF=l 

1  for  F  =  e2y 


and  Ricci  scalar 


n"  f" 


f    ' 


The  Einstein's  equation  for  a  cloud  of  strings  is 


where  the  energy-momentum  tensor  of  a  cloud  of  massive  strings  has  been  given  by 

Tij  =  puiuj-).xiXj  (5) 

with 

ulut  =  —  xlxt  =  1,    M'X,-  =  0. 

Here  p  denotes  the  rest  energy  density  of  the  strings  with  particles  attached  to  them 
(P-strings)  and  /I  is  the  string  tension  density.  If  we  denote  particle  energy-density  by  pp, 
then  we  have 

P  =  PP  +  ^.  (6) 

The  vector  ul  describes  the  cloud  four-velocity  and  xl  is  a  unit  space-like  vector 
representing  the  direction  of  the  string. 

If  we  assume  that  the  space-like  vector  xl  is  parallel  to  d/dx  (xl  must  be  taken  along 
any  one  of  the  three  directions  d/dx,  d/dy,  d/dz)  so  that  the  energy-momentum  tensor 
has  the  form 

Tg  =  p;     T«  =  A.  (7) 
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For  the  metric  (1),  the  field  (4)  with  the  help  of  (2),  (3)  and  (7),  may  be  written  as 
f"          '2 


f     9     fg 
By  combining  (8)-(10),  we  have 


P  9  f 

3.  Solutions  of  the  equations 

To  solve  the  Einstein's  equations  we  require  one  more  relation  between  the  variables 
because  there  are  four  variables  in  three  independent  equations. 

Case  I:  Nambu  strings  "" 

Consider  the  equation  of  state  for  a  cloud  of  strings  as 

p  =  A    (pp  =  0)  (12) 

then  (11)  gives  the  solution 

.      D 


where  D  is  a  constant. 

Using  (12)  and  (13)  in  (8)  and  multiplying  by/2/'  and  integrating  over  x,  we  have 

(14) 


where  E  is  an  integration  constant. 
Substracting  (9)  from  (8)  and  using  (12),  and  then  integrating  over  x,  we  get 

g  =  K  (constant).  (15) 

From  (12)-(14)  we  see  that  the  rest  energy  density  of  the  cloud  of  strings  and  the 
string  tension  density  are  decreasing  as  the  field  is  moving  along  the  x-axis  in  all  the 
three  cases  viz.,  spherical,  plane  and  hyperbolic  symmetries. 

Case  II. 

In  this  case  we  consider  a  relation  between  the  metric  coefficients  as  given  by 

f=gn  (16) 


G  P  Singh 
Substituting  this  value  of/  in  (10),  we  get 

(n+l}2-  +  n2(9-  )   =0  (17) 

9          \9j 

for  which  the  solution  is 

g  =  [Ax  +  B~](n+im"*+n+i)  (18) 

where  A  and  B  are  arbitrary  constants. 
Using  the  value  of  /  and  g  from  (16)  and  (18)  in  (8)  and  (9),  we  have 


8  ___  _ 

P        2  2  +2++  (    ] 


(n  +  n+\)2(Ax  +  B)2     (Ax 
From  (6),  (19)  and  (20)  we  get 


Substituting  the  value  of  s  =  0,  1,  -  1  we  can  obtain  the  solutions  of  massive  cosmic 
strings  in  plane,  spherical  and  hyperbolic  symmetric  space-times  respectively. 

From  (19)-(21)  one  can  see  that  the  rest  energy  density,  tension  density  and  particle 
density  of  the  strings  are  decreasing  as  the  field  is  moving  along  the  x-axis  in  all  the 
three  symmetries  and  all  the  three  densities  vanish  when  x->  oo. 

Case  III:  Uniform  density 

If  we  consider  the  rest  energy  density  of  the  cloud  of  strings  to  be  uniform  i.e.,  p  =  p0 
(constant),  then  (8)  reduces  to 


(22) 

J 

Multiplying  (22)  by/2/'  and  integrating,  we  obtain 

(23) 


Now,  we  consider  a  plane  symmetric  space-time  (e  =  0).  Substituting  e  =  0  in  (23)  yields 
the  solution 

"2  (24) 


where  b  is  an  integration  constant. 
Using  (24)  in  (10),  one  can  obtain 


g"  +  (STrpo/S^Y  +  (8rcp0/3)0  =  0  (25) 
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which  has  the  solution 

^  =  (rfcos(27rp0)1/2x)  +  csin((27tp0)1/2x))exp[-(27cp0)1/2x]  (26) 

where  c  and  d  are  arbitrary  constants. 
From  (9),  (24)  and  (26),  we  have 

1/2x  +  ccos27r)1^x)) 

1J2        '  (27) 


Equations  (6)  and  (27)  give 


c 
From  (27)  it  is  clear  that  when  x  approaches  (27ip0)   1/2tan    l[-\  the  string  tension 

W 
density  vanishes  and  the  cloud  of  particles  would  attain  uniform  energy  density.  Thus 

in  this  model  a  cloud  of  strings  with  particles  attached  to  them  asymptotically  tends  to 
a  cloud  of  particles. 
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Abstract.  We  evaluate  the  emissivity  rates  for  d-decay  and  s-decay  by  exactly  solving  the 
angular  integrals  involved  and  without  assuming  the  degeneracy  of  electrons.  We  have  also 
studied  the  effects  of  QCD  coupling  constant  as  well  as  the  s-quark  mass  on  the  emissivity  rates. 
We  find  that  these  parameters  are  important  in  determining  the  threshold  and  extinction 
densities  for  d-  and  s-decays. 

Keywords.    Quark  stars;  bag  model;  emissivity  rate;  mean  free  path. 
PACS  Nos    12-38;  95-30;  97-60 

1.  Introduction 

The  possibility  of  dense  stars  having  a  core  of  essentially  free  quarks  rather  than 
nucleons  has  been  studied  for  a  long  time  now  [1,  2].  If  such  stars  do  exist  they  have  to 
be  isolated  observationally  from  other  types  of  neutron  stars.  One  distinguishing 
feature,  as  is  well-known,  is  the  surface  emission  of  radiation  which  leads  to  stringent 
conditions  on  the  temperature  of  the  quark  matter  stars.  Again,  as  is  well-known,  the 
neutrino  emission  plays  an  important  role  in  the  cooling  of  quark  stars  in  the  early 
stages  of  star  evolution.  The  formalism  for  the  neutrino  emissivity  rate  by  quark  matter 
on  the  basis  of  ^-equilibrium  and  charge  neutrality  condition  has  been  given  by 
Iwamoto  [3].  The  basic  jS-decay  processes  are 


d^u  +  e~  +  ve  (1.1) 

along  with  their  inverse  reactions 


(1.2) 

Iwamoto  has  given  a  simple  formula  for  neutrino  emissivity  rate  from  /3-decay  of 
quarks  (see  eq.  (3.12)  of  ref.  [3])  viz. 

E4))  (1.3) 


where;  =  1  to  4  refers  to  s(or  d),  ve,  u,  e~  respectively  and  i  =  s  or  d  as  the  case  may  be, 
n(Ej)  is  the  distribution  function  of  jth  particle  species  and  Wti  is  the  weak  decay 
transition  rate.  Using  the  Weinberg-Salaam  model,  as  extended  to  semi-leptonic 
processes  to  describe  weak  interactions,  the  transition  rate  for  jS-decay  of  s-  or  d-quark 


is  given  by 

Wn  =  64G2(27r)4<5*(p1  -  p2  -  p3  -p4)Qvp2)(jVP4)    A  (2£j  ^)         (1-4) 


where  G2  =  G2  sin2  9C  for  i  =  s-quark  and  G?  =  G2cos20c  for  i  =  d-quark.  Neglecting 
the  neutrino  momentum  p2  in  the  <5-function  and  by  replacing  the  particle  energies  and 
momenta  by  their  respective  chemical  potentials  and  fermi-momenta,  Iwamoto  has 
given  the  following  formulas  for  neutrino  emissivity  rate  (see  eq.  (3.21)  and  (4.1  1)  of  ref. 
[3]) 

914 
Evd  =  —  G2cos20cacpf(d)pf(u)pf(e)  T6  (1.5) 

spf(u)pf(e)(l  -cos6>34)  T6.  (1.6) 


Equation  (1.5)  is  the  emissivity  rate  for  the  massless  d-quark  decay  while  (1.6)  is  for  the 
s-quark  decay.  In  deriving  these  results,  Iwamoto  also  assumes  that  electrons  are 
completely  degenerate  and  so  their  energies  and  momenta,  like  those  of  the  quarks,  are 
replaced  by  their  chemical  potentials  and  fermi-momenta.  Burrows  [4]  improved  upon 
this  calculation  by  including  the  effects  of  neutrino  momentum  and  obtained  a  T7 
behaviour  for  the  emissivity  rate  instead  of  T6.  Duncan  et  al  [5]  (1983,  paper  I;  1984, 
paper  II)  studied  in  detail  the  dependence  of  emissivity  on  the  QCD  coupling  ac.  They 
also  show  that  the  electron  fraction  in  quark  matter  vanishes  at  a  finite  baryon  density, 
nB  =  nex  (extinction  density)  and  so  the  emissivity  becomes  zero  at  nB  ^  nex  .  Also, 
recently  Ghosh  et  al  [6]  have  reported  a  calculation  of  the  emissivity  rate  without 
making  the  assumptions  of  Iwamoto.  They  could,  however,  reduce  (1.3)  to  a  five- 
dimensional  integral  which  they  had  to  do  numerically. 

We,  in  this  paper,  have  reanalyzed  the  emissivity  rate  and  mean  free  path.  Our 
starting  point  is,  of  course,  again  the  same  as  (1.3).  We  have  demonstrated  that  the 
angular  integrals  can  be  done  exactly  without  requiring  any  assumptions  to  be  made 
and  leading  to  an  expression  for  emissivity  rate  in  terms  of  a  three-dimensional  integral. 
Further  simplification  requires  one  to  assume  electrons  to  be  degenerate.  However 
such  an  assumption  is  not  valid  at  high  temperatures,  therefore  we  perform  our 
calculations  both  with  and  without  this  assumption.  For  the  thermodynamics  of  quark 
matter  we  retain  terms  to  first  order  in  ac  and  include  the  effects  of  finite  temperature 
and  finite  mass  of  the  s-quark. 

In  §  2  we  indicate  the  main  steps  involved  in  the  derivation  of  emissivity  rate  and 
mean  free  path.  The  technical  details  are  Delegated  to  the  Appendix.  In  §  3,  we  describe 
our  results  for  emissivity  rate  and  mean  free  path  as  functions  of  ac,  T,  ms  and  nB. 
Finally,  we  compare  these  results  with  those  of  Iwamoto  and  Duncan  et  al  and  discuss 
their  significance. 

2.  Formalism 

We  consider  three-component  quark  matter  (u,d,s)  in  ^-equilibrium  with  a  small 
fraction  of  electrons.  The  thermal  equilibrium  condition  leads  to  the  following  relation 
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(2-1) 

/i 

If  one  assumes  that  neutrinos  escape  freely  from  quark  matter  and  hence  do  not  take 
part  in  thermal  equilibrium,  the  neutrino  chemical  potential,  fiv,  can  be  put  equal  to 
zero.  Charge  neutrality  condition  sets  an  additional  constraint  on  the  number  densities 
n((i  —  u,  d,  s,  e)  viz. 

-  =  ^u--nd--ns-ne  =  0  (2.2) 


The  total  baryon  density  of  quark  matter  is  given  by 


(2.3) 


The  above  equations  imply  that  only  one  parameter,  say  //u,  is  left  free.  Now  consider 
the  thermodynamics  of  the  system.  The  pressure  and  energy  are  given  by  [2,  7] 

P=-£Q,-B  (2.4a) 

i 

£=-P+S^-TS^  (2.4b) 

t  i      "  * 

where  the  thermodynamic  potentials  are  given  by 

-2ac/n)  +  27L2(T/^)2(\  -2ac/37i)]  (2.4c) 

-  2-5P)(l  -  A2)0'5  +  l-514ln(l  +  (1  -  A2)°'5M) 
+  2;t2(l  -  A2)°'5(  T/fis)2  -  (2ac/7c)(3((l  -  A2)0'5  -  A2ln(l 
-  2(1  -  A2)2  +  3A4(ln(A))2  +  7r2(T/^s)2((l  -  2/3A2)  -  12 
+  (1  -  iTWa  -  A2f5)  -  6(A2(1  -  A2)0'5 
+(1  -P)°- 


(2.4d) 
Number  densities  of  the  particle  species  are  obtained  as 

nu  =  (/*>2)(1  -  2ac/7r)  +  Mu  T2(l  -  2ac/37r)  (2.4e) 

nd  =  nu(u~d)  (2.4f) 

-  m2)1'5  +  2;r2  T2(2^2  -  m2)/(^2  -  m2)0'5  -  (8a»M>2  -  m2) 
(2ac/7c)(-  12msV  -  m2)0'5  ln[(^  +  (^  -  m2)°-5)/ms] 
6(m4/MjlnK/^))  +  (4rc2/3)  T2^  -  6(m2(^2  -  m2)0'5 
(7t2/3)  T2(m2/(^2  -  m2f5)  -  K2/M,)ln[(M»  +  (^  -  ms2)°-5)/ms]) 
(M2  -  m2)0'5  -  (7r2/3)m2  T2/(^2  -  m2)1'5) 
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-m2)°-5/ms)])](l/47r2)  (2.4g) 

ne  =  (£/3n2)(l  +  n2T2/n2e)  (2.4h) 

where  i  =  u,  d,  s  in  (2.4a),  (2.4b)  and  i  =  u,  d  in  (2.4c),  ac  =  #2/47r  and  A  =  ms/fis.  Note  that 
Iwamoto  and  Duncan  et  al  define  QCD  coupling  constant  through  ac  =  g  /I6n.  In  the 
above  expressions  the  terms  up  to  first  order  in  ac  and  to  second  order  in  temperature 
T  have  been  retained. 
The  relation  between  fermi-momenta  and  the  particle  densities  is  given  by 


for  i  =  u,d,s,e~.  For  eqs  (2.4e-2.4h)  and  (2.5)  we  then  obtain  relations  between  the 
chemical  potentials  and  fermi-momenta  of  zth  particle  species.  In  the  T  =  0  limit  these 
equations  reduce  to 

ft  -  flPn  '  (2-6) 

where  a  =  1  +  (2ac/37i)  and  i  =  u,  d.  For  the  s-quarks,  we  obtain 

x2)0-5)/(l+x'2)0-5}])     (2.7) 


where  x  =  pfjms.  Equations  (2.6)  and  (2.7)  are  identical  to  (8,  9)  and  (17a)  of  Duncan  et  al 
[5]. 

The  stability  of  quark  matter  at  finite  temperature  is  determined  by  the  value  of  energy 
per  baryon,  E/nB.  We  consider  the  quark  matter  as  stable  so  long  as  E/nB  stays  below 
930  MeV  at  zero  pressure.  For  choosing  the  various  parameters  viz.  ac,  B,  fi0  and  ms,  the 
criterion  is  to  determine  the  stability  region  discussed  above.  Once  these  values  are  chosen 
we  are  then  left  with  one  parameter,  say  jUd  =  fj.s  —  /j..  This  can  be  determined  by  fixing  the 
baryon  density  nB. 

Substituting  expression  (1.4)  in  (1.3),  the  emissivity  rate  for  the  ith  quark  is  given  by 

24        /  4    f         \ 

2"(£i)(l  -  n(E3))(l  -  n(JE4)) 


p2-p3-p4).    (2.8) 

The  distribution  function  «(£,)  of  theyth  particle  species  is  given  by  (for;  =  u,  d,  s,  e~  ) 

n(£;)  =  (exp[(£y-^VT]  +  I)'1.  (2.9) 

Defining  the  angular  integrals  in  (2.8)  by 


The  angular  integrations  involved  in  the  above  expression  can  be  performed  without 
recourse  to  any  approximations.  The  rather  complicated  expression  for  /(£1,£2,£3,£4) 
and  some  of  the  steps  involved  in  the  evaluation  of  angular  integrals  are  given  in  the 
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Appendix.  We  can  rewrite  (2.8)  as 
12G      00 


d£t        d£2       d£3 

m2  J  m3 


E.^.fiJ  (2.10) 

At  this  stage  this  eq.  (2.10)  can  be  compared  with  (3.13),  (3.15)  and  (3:18)  of  ref.  [3].  Instead 
of  the  factor  pf(d)pf(u)pf(g)£|  of  Iwamoto,  we  obtain  the  function  /,  which  is  a  function  of 
all  the  four  variables  E1,E2,E2  and  £4.  The  most  crucial  difference  is  that  the  function  /  is 
non-zero  only  for  a  certain  range  of  values  of  these  variables.  We  now  introduce  the 
following  dimensionless  parameters  in  expressions  (2.9)  and  (2.10) 


=  E2/T, 


On  substituting  for  the  distribution  function  «(£,-)  from  (2.9)  into  (2.10),  the  emissivity  rate 
can  be  rewritten  as 

19  /*°°          C°°         r-oo 

Ev(0  =  -6<j?  T4       xdx       dXi          dx3 

31  Jo          J;s          Ji2 


x  /(£lf£2,£3,JE4)/(l  +  exp(x1))(l  +exp(x3))(l  +exp(jc4))        (2.11) 
where 


'3  =  0*4  ~  m4)/^- 

Carrying  out  a  similar  analysis  for  the  mean  free  path,  we  arrive  at  the  following  formula 
24G? 


x  I'(EltE2tE3,EJ/(l  +  exp(xl))(l  +  exp(x3))(l  +  exp(x4))        (2.12) 

where  l^  ,  12,  13  are  the  same  as  defined  before  for  (2.1  1).  The  expression  for  /',  which  differs 
from  that  for  /  only  in  the  sign  of  certain  terms,  is  given  in  the  Appendix. 

Let  us  now  look  at  the  temperature  dependence  of  emissivity  rate.  Since  the  chemical 
potentials  of  the  quarks  are  a  few  hundred  MeV  for  temperatures  up  to  a  few  MeV,  we  can 
treat  u,d  and  s  quarks  as  completely  degenerate.  The  limits  of  xl  and  x3  integrals  now  go 
from  (—  oo,  oo  )  and  two  of  the  integrations  can  be  performed  exactly.  The  results  are 

xdx 

o  -oo 

x  (x  -  x4)/(l  +  exp(x4))(exp(x  -  x4)  -  1)  (2.13) 
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In  figure  3a  we  show  the  effect  of  a  on  the  emissivitv  ™t*  P  ^  «      r 


Figure  3a.  Emissivity  rate  Ev(d)  (in  erg/cm3  sec),  for  d-quark  as  a  function  of 
baryon  density  nB(mfm~3)  at  T  =  0-lMeV  for  different  values  of  ac.  Curves  1,2,3 
refer  to  ac  =  0-05, 0-6  and  0-065  respectively.  Curves  la,  2a,  3a  refer  to  the  Duncan 
et  al  results  for  the  same  values  of  <*„. 


ij; /; -_ 


Figure  3b.  Same  as  in  figure  3a  but  for  ms  =  300  MeV.  Curves  1, 2, 3  correspond  to 
our  results  at  ac  -  0-065, 0-08  and  0-1  respectively.  Curves  la,  2a,  3a  refer  to  Duncan 
et  al  results  for  the  same  values  of  «„. 


density  remains  much  lower  than  that  of  Duncan  et  al  as  we  increase  ac.  This  difference  is 
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integral  results  for  ac  =  0-025  and  0-06  respectively 

the  prison  of  1-dimensional  (electrons  degenerate  case)  and 
(electrons  non-degenerate  case)  results  at  T=  0-1  and  4MeV  for  d-decay 
We  observe  a  fairly  close  agreement  of  1-dim.with  the  2-dirn  .  results  for  a  =0-025 
v  ™=    ?65  dlffefnCeS  Start  aPPearinS  only  at  high  densities  for  both  7  =  0-1 

y  has  Slgmficant  effect  only  at  densities  very  close  to 


Finally,  in  figure  5  we  show  the  mean  free  path  A,  as  a  function  of  the  baryon  density  nE 
at  T=  1  and  4MeV  and  for  neutrino  energies,  £2  =  T.  We  find  that  A  drops  below  the 
10  km  limit  (approximate  radius  of  a  neutron  star)  at  T  >  4  MeV 

To  summarize,  in  this  paper  we  have  calculated  the  emissivity  and  mean  free  path  for  the 
u,  d,  s-quark  matter.  We  have  followed  the  basic  formalism  of  Iwamoto  and  others  but  we 
have  done  the  various  angular  integrals  involved  exactly.  As  a  consequence,  the  results 
obtained  are  not  in  terms  of  analytic  expressions  but  in  terms  of  certain  integrals  which  are 
evaluated  numerically.  For  d-decay,  we  find  a  reasonably  close  agreement  with  Duncan  ei  al 
at  higher  temperatures  while  at  lower  temperatures  there  is  a  difference  of  up  to  one  order  of 
magnitude  in  the  emissivity  rate.  For  s-decay,  our  results  agree  broadly  with  those  of  Duncan 
et  al  particularly  at  higher  densities.  However  in  our  case  there  is  no  threshold  density  below 
which  the  emissivity  vanishes,  unlike  in  their  case  where  such  threshold  exists.  The  effect  of 
QCD  coupling  constant  ac,  and  the  strange  quark  mass  ms,  on  the  emissivity  rate  has  been 
sudied  in  detail.  Apart  from  the  magnitude  of  emissivity  rate,  we  find  that  the  extinction  density 
also  depends  critically  on  these  parameters.  As  for  the  mean  free  path  we  have  shown  that  it  is 
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Figure  5.  Mean  free  path,  A  (in  km),  as  a  function  of  baryon  density  nB  (in  fm  ~  3  ),  at 
different  temperatures  and  neutrino  energies.  Curves  1,  2  refer  to  T  =  1  and  4  MeV 
respectively  and  ev  =  T.  Curves  designated  a  refer  to  the  results  obtained  using  the 
Iwamoto  expression  for  the  same  temperatures  and  neutrino  energies. 


chemical  equilibrium  is  valid  only  for  temperatures,  T<4  MeV.  Though  we  have 
shown  results  for  one  choice  of  the  parameters,  quite  similar  results  follow  for  any  other 
suitable  choice.  The  basic  formalism  for  many  other  physical  processes  like  bulk 
viscosity  and  cooling  rates  is  similar.  Calculations  in  this  direction  are  in  progress. 

Appendix 

In  this  we  describe  some  of  the  steps  involved  in  the  evaluation  of  the  angular  integral 
/(#!  ,E2,E3,  E4),  which  is  defined  as 


(Al) 
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where 


X%1-P2-P3-P4)-  (A2) 

Separating  the  three-dimensional  <5-function  into  a  delta  function  over  the  magnitude 
and  one  over  the  solid  angles: 


Q0)/r2  (A3) 

and  using  the  latter  to  get  rid  of  the  integration  over  Q2,  we  obtain 


(A4) 
Next,  putting 


and  using  the  remaining  delta  functions  to  perform  the  integration  over  x,  we  obtain 

^3dQ40(l-)(p2  +  P2-p2)/2p1P|) 

x  [£3£4  +  0-5(p2  +  p2)  -  0-5P2]  [E! E2  -  0-5(p2  +  p2)  +  0-5P2]/P. 

(A5) 

The  theta  function  has  made  its  appearance  because  the  integration  over  x  is  performed 
only  from  —  1  to  +  1.  The  remaining  integrations  over  dQ3  dQ4  can  now  be  performed 
easily  because  only  the  angle  in  p3-p4  is  involved;  one  only  has  to  be  careful  about  the 
limits  of  integration  which  are  determined  by  the  theta  function.  This  leads  to  the  result 


where 
and 


/  =  (jc/S)!*1.2*!4/^!,  2;  3,  4)  +  (l/6)(/c3+4  -  kl2)I2(l,  2;  3,  4)  -  (1/20) 

x/3(l,2;3,4)]  (A6) 


n(i,jl  k,  1)  =  {Py  l  -  Pfr  1  )0(PU  -  Py)0(py  -  PU) 
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where 

Pij  =  Pt  +  Pjl    PU  =  \Pi-Ps\- 

Similarly  by  considering  expression  (Al)  for  the  mean  free  path  and  performing  the 
angular  integrations  in  the  same  way  as  for  emissivity  (except  for  taking  care  that  in  this 
case  the  integration  over  neutrino  momentum  is  absent)  we  obtain  (2-12)  for  the  mean 
free  path  with  the  function  /'(£1,jE2,£3,£4)  given  by 


x  (/c3+4  +  fc1_2)/2(l,2;  3,4)  +  (l/20)/3(l,2;  3,4)].    (A7) 
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Abstract.  The  efforts  of  Roy-Sridhar-Close-Cho-Wise-Trivedi  to  resolve  the  CDF  $'  anom- 
aly with  cascades  from  above-threshold  x'c  states  require  well  defined  signatures  [a  small  total 
width  and  a  large  branching  fraction  for  x'cj^>y  + 1/*']  for  the  solution  to  be  viable.  Here  we 
estimate  the  production  of  such  states  from  BR(J3 -»•;£,  +  X)BR(x'cJ->y{j/')  and  yy  production  of 
X'c2  at  CLEO  II,  and  comment  on  the  feasibility  of  testing  the  hypothesis  in  terms  of  current 
experimental  capabilities. 

Keywords.    CDF  \l/'  anomaly;  fa  search. 
PACS  Nos     13-20;  13-65;  14-40 

1.  Introduction 

The  CDF  measurement  [1]  of  J/if/  and  \f/'  production  at  large  transverse  momentum 
(PT)  in  1-8  TeV  pp  collisions  at  the  Tevatron  has  produced  one  of  the  most  intriguing 
experimental  results  in  recent  times.  At  low  transverse  momentum  the  production  of 
j/r's  in  pp  in  collisions  is  expected  to  proceed  via  the  production  of  %  states  followed  by 
their  decay  [2],  i.e.  fusion  process  g +  g-+%-+\l/ +  X.  The  analogous  process  at  larger 
transverse  momentum  is  gg  ->  gift.  This  process  generates  a  cross-section  that  falls  off  at 
large  PT  much  faster  than,  say,  the  jet  rate.  This  observation  led  to  an  expectation  that 
i/f's  produced  at  large  PT  came  almost  exclusively  from  fr-quark  decay.  By  detecting 
whether  or  not  the  ^'s  come  from  the  primary  event  vertex,  CDF  has  tested  this 
expectation.  The  fraction  of  ^'s  produced  directly  is  almost  independent  of  PT  and  the 
rate  of  direct  \j/  production  at  large  PT  is  (substantially)  larger  than  expected.  Thus,  this 
expectation  is  now  known  to  be  false. 

The  dominant  production  mechanism  of  J/i/^'s  at  large  transverse  momentum  is  now 
believed  to  be  the  fragmentation  of  light  (and  charm)  quark  and  gluon  jets  into  %c 
mesons  that  subsequently  decay  to  J/\j/  [2]  via  the  radiative  mode  as  stressed  by  Cho 
et  al  [3]  where  the  gluon  fragmentation  is  found  to  be  particularly  important.  When 
this  J/\j/  source  is  included,  the  theoretical  prediction  for  dcr(pp-*  J/\j/  +  X)/dPT  at 
^/s  =  1-8  TeV  agrees  within  a  factor  of  2  with  recent  CDF  data  [1]. 

While  the  rate  for  J/\j/  production  is  in  agreement  with  expectations,  given  the 
inherent  theoretical  uncertainties,  the  rate  for  \l/'  production  at  CDF  is  at  least  a  factor 
of  20  above  theoretical  expectations  [4].  The  calculation  does  not  include  the  possibi- 
lity of  \l/'  production  from  the  decay  of  j^,  2P  states.  These  2P  charmonium  states  are 
above  DD  threshold,  however  a  branching  ratio  of  a  few  %  to  if/'  could  be  sufficient  to 
explain  the  deficit.  Roy  and  Sridhar  as  well  as  others  investigated  this  possibility 
quantitatively  [3]. 
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The  basic  premise  is  to  recall  that  if/'  is  the  heaviest  cc  bound  state  which  lies  below 
the  DD  threshold.  Therefore,  n  —  1  %cj  states  cannot  radiatively  decay  to  \j/'  but  their 
n  —  2  counterparts  can.  None  of  these  %'cj  (or  %cj  (2P))  states  which  lie  above  the  DD 
threshold  have  been  observed.  Estimates  of  their  masses  yield  M(%c0(2P))  =  3920  MeV, 
M(xcl(2P)  =  3950  MeV,  and  M(xc2(2P))  =  3980MeV  [5].  These  mass  values  taken 
literally  would  kinematically  allow  the  S-wave  transitions  xe0(2P)  ->-DD  and  #cl(2P)  -> 
D*D  to  occur.  We  therefore  expect  that  the  J  =  0  and  perhaps  the  J  =  1  excited  %C/(2P) 
states  will  be  broad  and  have  negligible  branching  fractions  to  lower  cc  bound  states. 
However,  angular  momentum  ajnd  parity  considerations  require  the  analogous  decay 
Xc2  (2P)  ->  DD  and  xc2  (2P)  -»>  D*  D  for  the  J  =  2  state  to  proceed  via  L  =  2  partial  waves. 
Although  we  cannot  readily  compute  by  how  much  these  D-wave  decays  will  be 
suppressed,  it  impossible  that  the  branching  fractions  for  #c2(2P)  states  to  charmonium 
states  below  DD  threshold  could  be  significant.  Close  [3]  suggests  that  the  %cl(2P)  may 
also  have  suppressed  hadronic  widths  due  to  quantum  numbers  or  nodes  in  form  factors 
manifested  in  decays  near  threshold,  e.g.  since  3PX  ->DD*  is  near  threshold  the  S-  and 
D-waves  present  are  affected  by  radial  wave  function  nodes  which  can  conspire  to  reduce 
the  width.  Hence  in  §  2  below  we  will  consider  the  search  for  both  %cl(2P)  and  xc2(2P). 
Although  the  1D2  and  3D2  charmonium  states  may  be  present  in  the  CDF  data  and 
detectable,  they  are  unlikely  to  explain  the  \l/'  (3685)  enhancement.  For  instance  [3]  1Z)2 
production  is  suppressed,  and  it  is  expected  to  have  a  very  small  branching  ratio  to  \j/'y. 
On  a  quantitative  basis  Page  [6]  has  calculated  the  total  widths  of  radial  Xcj  states 
and  found  that  they  could  be  as  small  as  1-5  MeV.  We  need  an  estimate  of  the  branching 
ratio  BR(^7  -»y  +  \l/').  A  very  rough  estimate  [7]  can  be  obtained  using  the  known  experi- 
mentally measured  branching  ratio  BR(x62(2P)  -»  Y(2S)  +  y)  «  16%  [8]  as  follows: 

/n\2/k\3  p    (      (IP}) 

y^')=  -£     BR(yb2(2P)->yY(2S))   "b2  (1) 


Here  (Qc/Qb)2  =  (4/9)/(l/9)  =  4,  and  (/cc//eb)3  is  the  modification  due  to  El  phase  space; 
there  will  be  some  changes  due  to  the  actual  size  of  bb  relative  to  that  of  cc  but  this 
will  be  a  factor  of  2  or  3  and  the  estimate  (1)  may  not  be  accurate  to  better  than  a 
factor  of  5  to  10  anyway.  For  instance,  taking  M(%c2(2P))  =  3980  MeV  and 
rtot(&2(2P))«rtot(x£2(2P))  gives  BR(xc2(2P)^y  +  ^')~100%!  Thus  the  value 
BRGtc2(2P)->y  +  J!r')«10%  suggested  by  Roy  and  Sridhar  [3]  is  by  no  means 
unreasonable.  Another  back  of  the  envelope  ansatz  [7]  is  to  take  the  1P->1S 
charmonium  data  and  assume  the  2P-+2S  overlaps  will  have  the  same  order  of 
magnitude,  then  B(%c2(2P)  ->y  +  \j/')  «  B(*c2(lP)  ->y  +  J/\l/)  =  13-5%,  with  a  measured 
rc2(lP)  full  width  =2  MeV  [8].  Hence  if  one  of  the  %CJ(2P)  states  is  calculated  to 
have  a  total  width  in  the  range  1  MeV  to  5  MeV  [6],  a  branching  ratio  5(%CJ(2P)-» 
y  +  1//)  >  5%  can  be  expected  (a  value  in  the  range  5-10%  would  be  needed  to  explain 
the  CDF  \li'  anomaly  [1,3]).  To  summarize,  the  result  is  that,  in  order  of  magnitude, 
one  expects  the  radiative  transition  to  be  0(100  KeV)  and  hence  the  BR  is  0(1-10%)  if 
the  hadronic  width  is  O(  10-1  MeV).  It  would  be  surprising  if  the  branching  ratio  is  less 
than  1%  or  much  greater  than  10%.  To  give  an  adequate  spread  for  illustration,  we 
shall  take  in  §  2, 

=  1,5,  10%.  (2) 


the  inclusive  decays  of  J5  mesons  to  charmonium.  Hence  we  seek  to  estimate 

BJ  =  BR[B^J  +  *]BR[x;,-+y<//].  (3) 

The  branching  ratio  BR[J5->Xc/  +  %]  *s  given  by  Bodwin  et  al  [9]  as 

Rfrcj)  x  10-7%  x  \R'x,(0)/R'tc(Q)\2  (4) 

where  ^(^J)  =  r(5-»-xej  +  ^)/r(fc->'e~  ve  +  X),  10-7%  is  the  observed  semileptonic 
branching  ratio  for  the  B  meson,  and  multiplicative  last  term  |  jR'x.  (0)/R'Xc(0)  |  2  is  Braaten's 
correction  factor  [10]  for  estimating  BR(5  ^x'c  +  X)  from  BR(B  ->  xc  +  AT).  The  deriva- 
tives of  the  wave  functions  at  the  origin  can  be  obtained  from  potential  models.  This 
procedure  is  certainly  correct  for  the  color-singlet  matrix  element  (the  cc  contribution) 
and  it  may  also  be  correct  for  the  color-octet  matrix  element  (the  ccg  contribution)  if 
the  latter  is  dominated  by  the  radiation  of  soft  gluons  from  the  cc  state.  Whether  or  not 
this  is  the  case  remains  to  be  seen  [10]. 

The  expression  R(xcj)  in  (4)  is  given  in  terms  of  the  non-perturbative  parameters 
H1  and  H'8  (proportional  to  the  probabilities  for  a  cc  pair  in  a  color-singlet  P-wave  and 
a  color-octet  S-wave  state,  respectively,  to  fragment  into  a  color-singlet  P-wave  bound 
state)  and  takes  the  following  form  [9} 

£  12-4(2C+  -C_)2H1/M6  +  9-3(C+  +  C.)2H'B(Mb}/Mb 

(5) 


Here  C+  and  C_  are  Wilson  coefficients  that  arise  from  evolving  the  effective 
4-quark  interactions  mediated  by  the  W  boson  from  the  scale  M  w  down  to  the  scale 
Mb.  Numerically  C+(Mb)  s  0-87,  CL(Mt)^  1-34,  H1  *  15  MeV,  H'8(Mb)  s  2-5  MeV, 
andM6  =  5-3GeV. 

For  the  value  |/?^(0)/R'Xc(0)|2,  we  use  a  recent  quark  potential  model  [11]  which 
takes  into  account  that  the  value  of  R'(G)  for  P-state  charmonium  is  sensitive  to  the 
short  distance  behaviour  of  the  potential,  so  that  it  is  better  to  use  values  obtained  from 
potentials  whose  short  distance  behaviour  is  more  reliable.  The  potential  [11]  (an 
improved  version  of  the  Buchmiiller-Grunberg-Tye  potential)  approaches  the  2-loop 
QCD  result  at  short  distance,  leads  to  energy  spectra  and  leptonic  widths  in  very  good 
agreement  with  experiment.  The  values  for  R'(0)  for  the  P-wave  charmonium  states 
from  this  potential  model  Program  [12]  are 

State    K'(0)  in  GeV5/2 

&  0-20  (6) 

X'c  0-23 

For  comparison,  the  value  of  R'(G)  for  xc[10]  obtained  directly  from  the  measured 
widths  of  xci  and  %C2  was  about  0-15  GeV5/2,  hence  one  should  ascribe  an  error  of  not 
less  than  30%  to  any  of  these  values.  It  is  nevertheless  reassuring  that  a  recent 
compilation  [13]  of  first  non-vanishing  derivative  at  zero  cc  separation  for  radial 
Schrodinger  wave  function  of  earlier  potential  models,  give  for  \Rfrc(0)/R'Xc(0)\2  values 
1-36  (Buchmiiller-Tye),  1-05  (Power-law),  0-97  (Logarithmic),  and  1-42  (Cornell),  quite 
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of  the  recent  potential  model  [11]. 
Assembling  the  pieces  together  from  (3),  (4),  (5)  and  (7),  we  have  for  (3)  and  J  =  1. 

B1=2-89x  l(T5,l-45x  10~4,2-89x  10~4  (8) 

for  the  respective  values  of  BRj^  -  y\l/'\  given  in  (2).  The  corresponding  values  for 
J  =  2  are 

B2  =  3-77x  KT5,l-89x  10~4,3-77  x  KT4.  (9) 

The  world  average  (WA)  and  CLEO  II  measurement  [14]  for  BR(B-»^C  +  X)  are 
BR(B-+xcl  +X)  =  0-42  ±  0-07%  (WA) 
BR(B->xc2  +  AT)  =  0-25  ±  0-10%  (CLEO).  (10) 

For  central  values  of  (10),  multiplying  the  above  experimental  numbers  by  correction 
factor  (7),  we  have 

5-54x  10-  3 

3-30x  lO'3.  (11) 

This  compares  with  the  values  obtained  by  scaling  the  predictions  of  Bodwin  et  al 
[9]  for  BR(B  -»xcj  +  *]  usin§  the  correlation  factor  (7)  of 

2-8x  KT3 

3-77x  10-3.  (12) 


The  agreement  seems  good  for  the  %'c2  case. 

The  final  step  is  to  estimate  the  number  of  observed  events,  N}bs .  First  we  note  that  at 
CLEO  II,  the  number  of  produced  B  mesons  is  given  by 

/  f         \ 
Deduced  =        £>dt\(r(e+e-  ->Y(4S)-+BB)2  (13) 


where  on  a  good  year  the  integrated  luminosity  J^dt  is  2fb~l  of  data  on  tape,  the 
a(e+e~  -»Y(4S)->J3B)  cross-section  is  about  1-07  nb,  and  the  factor  of  2  takes  into 
account  productions  of  pairs  of  B  mesons  in  Y(4S)  decays.  The  number  of  observed 
events  NJbs  is  then  given  by 

(14) 

where  B7  is  denned  in  (3),  and  e  is  the  efficiency  for  detecting  \f/'  ^  J '/\l/K*  n~  in  the 
dilepton  mode  (about  20%  in  the  CLEO  II  detector  [14]).  Hence  using  theory  (8),  (9)  or 
(12),  we  have 

N°bs  =  0-95,  4-77,  9-51  events 

Afobs  _  i  .-)A    £.<•><•)     -I  ^.Af\ 


%'c  Charmonium  states  as  solution  to  the  CDF  \l/'  puzzle 

for  the  respective  values  of  BR(%'CJ-+y\l/')  given  in  (2).  If  we  take  advantage  of  the 
experimentally  known  branching  ratios  (10)  in  deducing  (11),  we  have 

Wfs  =  1-82,  9-11, 18-23  events 

iV°bs  =  1-09,  5-43, 10-86  events  (16) 

for  the  respective  values  of  BR(x'CJ^y\{/').  The  theoretical  mass  estimates  [5] 
M(&i(2P))  =  3950  MeV  and  M(xc2(2P))  =  3980  MeV  are  also  useful.  The  approximate 
locations  of  these  resonances  are  needed  to  conduct  the  experimental  search  and  reduce 
background. 

At  CLEO  II,  the  %'c2  state  can  also  be  searched  for  via  the  two  photon  production  of 
this  J  -2  state.  For  instance,  the  number  of  events  N^  from  e+  e~  ->•  e+  e~ yy  -> 
e+ e~  %'c2  is  estimated  to  be 


-.         (17) 

In  a  recent  paper  on  measurement  of  two-photon  production  of  the  #C2,  Dominick 
et  al  [15],  using  1-5  fb  ~ 1  of  data  taken  with  beam  energies  near  the  Y(4S),  25-4  +  6-9  N^ 
events  were  obtained,  with  efficiency  s(l+l~y)  =  0-187  +  0-003.  Taking  into  account 
that  data  accumulation  is  now  2fb~1  on  Y(4S)  and  Ifb"1  in  the  continuum,  the 
number  NX(2  can  be  doubled  and  we  assume  that  £(l+ 1~  n  + rc~  y)  ^  a(l+ 1~  y)/2.  For 
BR(Xc2  ~*  ^'y}  =  100//0  and  taking  central  values  for  experimental  numbers  to  illustrate, 
we  have 

^  =  6  x  ffJl~**c\ events.  (18) 


The  cross-section  ratio  is  equal  to  the  T(x'c2  -*yy)/r(x,c2-*yy)  and  is  not  yet  known 
since  the  total  width  F(^2)  has  not  yet  been  measured. 

3.  Conclusions 

We  have  presented  above  in  §  2  event  rates  for  observing  the  ^cJ(J  —  1, 2)  in  B  decays  at 
CLEO  II.  Though  the  eventuates  AT°bs  given  by  (15)  and  (16)  are  not  large  even  with 
a  year's  accumulation  of  BB,  they  can  be  steadily  increased  by  extending  the  BB 
accumulation  over  a  period  of  several  years.  Furthermore  we  have  been  surprised  by 
how  large  the  branching  ratios  BR(%^  -»  y\\i'}  can  be  in  §  1,  given  the  known  and  sizable 
[8]  BR^  ->y  +  T(2S)) « (21  ±  4%)  and  BR(&,2  ->y  +  Y(2S))  S  (16-2  ±  2-4)%.  Hence 
optimistically  we  can  expect  Njbs  to  be  in  the  range  of  10-20  events  for  an  integrated 
luminosity  of  2fb~~1,  as  needed  to  explain  the  CDF  ^'  anomaly  [1].  Estimates  for  2y 
production  of  x'c2  are  given  in  (17)  and  (18).  At  CDF  the  invariant  mass  spectrum  of  yij/' 
combination  can  be  studied.  This  possibility  should  be  explored  in  parallel  with  the 
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Abstract.  Recently  Rosen  and  the  present  author  obtained  the  energy  and  momen  turn  densities 
of  cylindrical  gravitational  waves  in  Einstein's  prescription  and  found  them  to  be  finite  and 
reasonable.  In  the  present  paper  we  calculate  the  same  in  the  prescriptions  of  Tolman  as  well  as 
Landau  and  Lifshitz  and  discuss  the  results. 

Keywords.    Einstein-Rosen  metric;  energy-momentum  pseudotensors. 
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1.  Introduction 

Long  ago  Scheidegger  [1]  raised  doubts  whether  gravitational  radiation  has  well- 
defined  existence.  To  this  end,  Rosen  [2]  investigated  whether  or  not  the  cylindrical 
gravitational  waves  have  energy  and  momentum.  He  used  the  energy-momentum 
pseudotensors  of  Einstein  and  Landau  and  Lifshitz  (LL)  and  carried  out  calculations  in 
cylindrical  polar  coordinates  and  found  that  the  energy  and  momentum  density 
components  vanish.  The  results  obtained  by  him  fit  in  with  the  conjecture  of  Scheideg- 
ger that  a  physical  system  cannot  radiate  gravitational  energy.  Two  years  later,  Rosen 
[3]  realized  the  mistake  and  carried  out  the  calculations  in  "cartesian  coordinates"  and 
reported  that  the  energy  and  momentum  density  are  nonvanishing  and  reasonable. 
Though  in  the  letter  he  remarked  that  he  would  publish  the  details  elsewhere,  it  was  not 
done  until  recently  he  and  the  present  author  [4]  re-calculated  the  energy  and 
momentum  density  components  in  "cartesian  coordinates"  in  Einstein's  prescription. 
They  found  finite  and  reasonable  results. 

The  physical  interpretation  of  Einstein's  energy-momentum  complex  has  been 
questioned  by  several  physicists,  notably  by  Weyl,  Pauli  and  Eddington  [5].  Their 
objections  were  that  the  energy-momentum  complex  of  Einstein  was  neither  a  tensor 
nor  was  it  symmetric.  However,  Palmer  [6]  discussed  the  importance  of  Einstein's 
energy-momentum  complex  in  detail.  Since  the  first  energy-momentum  complex  of 
Einstein  was  given,  a  large  number  of  prescriptions  to  obtain  energy  and  momentum  in 
a  general  relativistic  system  have  been  proposed  by  many  authors  [7].  Since  there  is  no 
unique  way  of  defining  energy  and  momentum  in  a  curved  spacetime,  and  also  the 
various  energy-momentum  complexes  are  not  tensors,  some  researchers  do  not  take 
them  seriously.  They  even  guess  that  the  different  energy-momentum  complexes  could 
give  different  and  therefore  unacceptable  energy  distribution  in  a  given  spacetime. 
However,  we  have  shown  that  the  energy-momentum  pseudotensors  of  Einstein, 
Tolman  and  LL  give  the  same  result  for  energy  distribution  in  the  Kerr-Newman  as 
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well  as  in  the  Vaidya  spacetimes  when  calculations  are  carried  out  in  "cartesian 
coordinates"  [8-10].  We  have  already  mentioned  that  Rosen  and  the  present  author 
obtained  energy  and  momentum  density  for  the  cylindrical  gravitational  waves  in 
"cartesian  coordinates"  in  Einstein's  prescription  and  found  reasonable  result.  There- 
fore, it  is  of  interest  to  investigate  whether  or  not  other  energy-momentum  complexes 
give  the  same  result.  This  is  the  aim  of  this  paper.  We  consider  here  the  energy- 
momentum  complexes  of  Tolman  and  LL.  This  paper  is  organized  as  follows.  Section  2 
gives  the  Einstein-Rosen  metric  describing  the  cylindrical  gravitational  waves.  In  §  3 
and  §4,  we  obtain  energy  and  momentum  density  of  cylindrical  gravitational  waves  in 
prescriptions  of  Tolman  and  LL.  In  §  5  we  discuss  the  results  obtained.  We  use  the 
geometrized  units  such  that  G  =  1,  c  =  1  and  follow  the  convention  that  the  Latin  and 
Greek  indices  take  values  0  to  3  and  1  to  3,  respectively.  x°  is  the  time  coordinate. 

2.  Einstein-Rosen  metric 

The  Einstein-Rosen  metric  is  a  non-static  vacuum  solution  of  Einstein's  field  equations 
and  it  describes  the  gravitational  field  of  cylindrical  gravitational  waves.  It  is  given  by 
the  line  element  [1 1]: 

ds2  =  e2»-2*(dt2  -dp2)- e-2*p2d(f)2 - e2*dz2,  (1) 

where  y  =  y(p,  t),  \f/  =  \j/(p,  t),  and 

\h 

\l/    -\ — -  —  i//n  =  0,  (2) 

(3) 
(4) 

The  subscripts  on  \j/  and  y  denote  partial  derivatives  with  respect  to  the  subscripts.  It 
is  known  that  the  energy-momentum  complexes  of  Tolman  and.  LL,  like  that  of 
Einstein,  give  correct  result  if  calculations  are  carried  out  in  "cartesian  coordinates" 
[12-17].  Therefore,  one  transforms  the  line  element,  given  by  (1),  according  to 

x  =  p  cos  0, 

y  =  p  sin  0,  (5) 

and  gets  the  line  element  in  t,x,y,  z  coordinates  [4], 

<i> 

(6) 


3.  Energy  and  momentum  of  cylindrical  gravitational  waves  in  prescription  of  Tolman 

The  energy-momentum  complex  of  Tolman  is  [13-14] 
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+±gtnr°M  (8) 

is  the  energy  density,  ^~°  are  the  momentum  density  components,  and  y\  are  the 
nponents  of  energy  current  density.  The  required  nonvanishing  components  of  C/J/ 


rroi.- 
0   ~      2p2      ' 


2p 


" 


22 

u°  = 


l(p^-l).  (9) 

ig  (9)  in  (7)  we  obtain 


°  87T 

X^ 

~ 


-3  =  ^ro  =  0.  (10) 
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4.  Energy  and  momentum  of  cylindrical  gravitational  waves  in  prescription  of  LL 

The  energy-momentum  complex  of  LL  is  [15] 

j  mn  _  cmjnk  (11) 

L    ~16ff        Jfc' 
where 

gnjnk  _  _  g(gmngJk  —  gmkgj").  ( * ^) 

L""1  is  symmetric  in  its  indices.  L°°  is  the  energy  density  and  L0ot  are  the  momentum 
(energy  current)  density  components.  Smjnk  has  symmetries  of  the  Reimann  curvature 
tensor.  The  required  nonvanishing  components  of  Smjnk  are 


n.n, 


-  1) 


00202  _        V^      ^      '    y    ' 

O  — 2  '    ' 

O0303  __  .,2(y-2^) 

Making  use  of  (13)  in  (11)  we  obtain  energy  and  momentum  (energy  current)  density 
components  for  the  cylindrical  gravitational  waves  in  LL  prescription 


8?r 
L01=-~"° 


4np 


L03  =  0.  (14) 

5.  Discussion 

One  can  see  from  (10)  and  (14)  that  the  energy-momentum  complexes  of  Tolman  and 
LL  give  the  same  energy  and  energy  current  densities  as  given  by  Einstein's  prescription 
[4].  The  momentum  density  components  in  the  Tolman  prescription  are  the  same  as  in 
the  Einstein  prescription.  The  momentum  density  components  in  Tolman  and  LL 
prescriptions  differ  by  a  sign,  as  expected.  The  energy  density  of  the  cylindrical 
gravitational  waves  is  finite  and  positive  definite,  and  the  momentum  density  compo- 
nents reflect  the  symmetry  of  the  spacetime.  Thus  our  investigations  do  not  support  the 
conjecture  of  Scheidegger  [1]. 

Due  to  non-tensorial  nature  of  energy-momentum  complexes,  some  physicists  do 
not  take  them  seriously.  They  even  conjecture  that  different  energy-momentum 
complexes  could  give  different  energy  distributions  in  a  given  spacetime.  To  this  end  we 
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ve  shown  that  the  well-known  energy-momentum  complexes  of  Einstein,  Tolman 
d  LL  give  the  same  and  reasonable  result  for  the  Kerr-Newman,  Vaidya,  and 
nstein-Rosen  spacetimes  (see  also  ref.  4,  7-10). 
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Abstract.  Acoustic  non-linearity  parameter  B/A  is  calculated  for  five  binary  liquid  mixtures 
using  Tong  and  Dong  equation  along  with  the  Flory's  statistical  theory.  Similar  to  other  excess 
thermodynamical  quantities  an  excess  non-linearity  parameter  (B/A)E  is  defined  for  binary  liquid 
mixtures.  The  interactions  in  the  liquid  mixtures  are  explained  on  the  basis  of  the  excess 
non-linearity  parameter. 

Keywords.  Non-linearity  parameter;  excess  non-linearity  parameter;  binary  liquid  mixture; 
ultrasonic  velocity;  isentropic  compressibility. 

PACS  Nos    43-00;  43-25 

1.  Introduction 

The  non-linearity  parameter  B/A  plays  a  significant  role  in  non-linear  acoustics  and  its 
determination  is  of  increasing  interest  in  a  number  of  areas  ranging  from  underwater 
acoustics  to  medicine.  B/A  of  liquids  can  be  obtained  from  the  variation  of  sound 
velocity  with  temperature  and  pressure.  A  number  of  experimental  and  theoretical 
studies  have  been  performed  on  the  non-linearity  parameter  of  liquids  making  use  of 
phenomenological  [1-3]  or  thermodynamic  methods  [4, 5].  Also  some  studies  on  B/A 
have  been  performed  from  the  view  point  of  theory  of  liquids  [6].  B/A  values  for 
organic  liquids  [2,3,7],  liquid  mixtures  [6,8,9]  and  biological  samples  [10, 11]  have 
been  reported.  Tong  lie  et  al  [12]  calculated  the  B/A  values  for  pure  liquids  making  use 
of  Schaaff  s  equation  [13]  for  sound  velocity.  Although  detailed  studies  of  the  non- 
linearity  parameter  have  been  made  for  many  pure  liquids,  such  studies  are  sparse  in 
the  case  of  liquid  mixtures. 

In  the  present  study,  we  have  calculated  the  non-linearity  parameter  of  five  binary 
liquid  mixtures  of  benzene,  toluene,  chlorobenzene,  bromobenzene,  nitrobenzene  and 
methyl  ethyl  ketone  (MEK),  with  MEK  as  a  common  component.  We  make  use  of  the 
Tong  and  Dong  equation  [12]  for  pure  liquids  and  Flory's  statistical  theory  [14, 15]  for 
calculating  the  B/A  values.  The  thermodynamical  parameters  in  Tong's  equation  are 
calculated  for  binary  mixtures  using  Flory's  theory.  The  variations  of  B/A  with  mole 
fraction  of  MEK  are  discussed.  Also  similar  to  other  thermodynamical  parameters,  we 
define  an  excess  non-linearity  parameter  (B/A)E  for  liquid  mixtures. 

2.  Theory 

Tong  et  al  [12]  applied  Schaaff  s  equation  for  sound  velocity  in  the  basic  equation  for 


J(o)  =  (1-1/7)  (2) 

and 

-2  (3) 


3(x-l)(6-5x) 

Here  y  is  the  ratio  of  specific  heat,  p  is  the  density,  KT  is  the  isothermal  compressibility, 
T  is  the  temperature,  /?  is  the  expansibility  and  x  is  the  real  volume  of  a  molecule,  .x  is  the 
ratio  of  molecular  weight  to  pb,  where  b  is  the  van  der  Waals'  constant  given  by 
b  =  (\6/3)nr03N,r0  being  the  molecular  radius  and  N  the  Avogadro  number. 

In  the  present  study  the  thermodynamical  parameters  containing  in  (2)  and  (3)  and 
which  are  required  to  calculate  B/A  values  of  binary  mixtures  are  obtained  as  follows. 

The  characteristic  and  reduced  parameters  for  pure  components  are  obtained  from 
Flory's  statistical  theory  [14, 15]  and  those  of  mixtures  are  calculated  by  the  procedure 
adopted  by  Pandey  [16, 17].  The  thermodynamical  parameters  /?,  KT,  CP  and  y  for  the 
mixtures  which  are  necessary  to  calculate  B/A  for  the  mixtures  are  obtained  by  the 
relation  suggested  by  Khanwalkar  [18].  The  van  der  Waals'  constant  b  and  x  for  the 
mixtures  are  calculated  as 

b  =  ylbi-+yzbz  (4) 

and 

ylM1+y2M2  ,,, 


where  yi  and  y2  are  the  mole  fractions  and  M^  and  M2  are  the  molecular  weights  of  first 
and  second  component  respectively  and  p12  is  the  density  of  the  mixture. 

The  excess  properties  of  the  mixtures  are  defined  as  the  difference  between  the  thermo- 
dynamic  functions  of  mixing  for  a  real  solution  in  excess  to  those  for  an  ideal  solution.  In 
studying  liquid  mixtures,  excess  properties  are  more  important  than  thermodynamical 
properties  [19].  Similar  to  excess  therrnodynamic  functions  for  binary  liquid  mixtures  we 
define  an  excess  non-linearity  parameter  (B/A)E  for  binary  mixtures  as  follows: 

(B/A)E  =  (B/A)mix-(B/A)id  (6) 

where  (B/A)mix  is  the  theoretically  calculated  values  and  (B/A)ld  is  that  obtained  from 
ideal  components  defined  as 

(7) 


Here  1  and  2  refer  to  first  and  second  components. 

The  sound  velocity,  density,  adiabatic  compressibility  and  molecular  radius  required 
for  the  present  calculations  are  taken  from  the  literature  [20,  21]. 

3.  Results  and  discussion 

Acoustic  non-linearity  parameter  B/A  calculated  for  five  binary  mixtures  at  two 
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Table  1.     Mole  fraction  (yl }  of  methyl  ethyl  ketone,  sound  velocity  (c),  isentropic 
compressibility  (Ks),  and  non-linearity  parameter  (B/A)  of  binary  mixtures. 


303-15  K 

313-15K 

Vi 

c(m/s) 

K^TPa'1) 

B/A 

c(ra/s)   K, 

,<TPa-') 

B/A 

Methyl  ethyl 

ketone  +  Benzene 

o-oooo 

1276 

707 

10-76 

1230 

770 

9-79 

0-1126 

1269 

722 

10-34 

1226 

783 

9-52 

0-2249 

1260 

739 

9-97 

1217 

802 

9-29 

0-3113 

1253 

753 

9-72 

1210 

817 

9-14 

0-4163 

1244 

771 

9-46 

1201 

837 

8-99 

0-5232 

1234 

791 

9-23 

1191 

859 

8-88 

0-6045 

1225 

809 

9-08 

1182 

878 

8-83 

0-7148 

1212 

835 

8-92 

1169 

908 

8-81 

0-8265 

1197 

865 

8-83 

1154 

941 

8-88 

1-0000 

1173 

915 

8-81 

1128 

1002 

9-20 

Methyl  ethyl 

ketone  +  Toluene 

o-oooo 

1291 

699 

11-62 

1248 

757 

10-44 

0-1367 

1281 

716 

10-98 

1240 

773 

9-99 

0-2214 

1274 

728 

10-61 

1233 

786 

9-72 

0-3435 

1263 

746 

10-13 

1222 

807 

9-39 

0-4318 

1255 

761 

9-81 

1214 

822 

9-19 

0-5406 

1243 

782 

9-47 

1202 

846 

8-98 

0-6267 

1232 

801 

8-85 

1191 

867 

8-85 

0-7122 

1219 

824 

9-03 

1178 

893 

8-78 

0-8178 

1203 

855 

8-86 

1162 

927 

8-78 

1-0000 

1173 

915 

8-81 

1128 

1002 

9-20 

Methyl  ethyl  ketone  +  Chlorobenzene 

0-0000 

1245 

589 

12-02 

1206 

634 

10-85 

0-1068 

1238 

611 

11-40 

1197 

660 

10-41 

0-1972 

1235 

629 

10-97 

1194 

679 

10-09 

0-3202 

1229 

656 

10-41 

1188 

709 

9-68 

0-4174 

1224 

680 

10-01 

1183 

736 

9-39 

0-5576 

1218 

718 

9-55 

1177 

776 

9-06 

0-6242 

1213 

740 

9-37 

1172 

801 

8-94 

0-7288 

1201 

782 

9-06 

1160 

848 

8-81 

0-8531 

1187 

840 

8-84 

1146 

911 

8-80 

1-0000 

1173 

915 

8-81 

1128  • 

1002 

9-20 

Methyl  ethyl  ketone  +  Bromobenzene 

0-0000 

1146 

514 

11-70 

1109 

554 

10-80 

0-1221 

1144 

542 

11-10 

1104 

587 

10-35 

0-1917 

1141 

562 

10-80 

1101 

609 

10-09 

0-2946 

1139 

591 

10-41 

1099 

641 

9-77 

0-4069 

1142 

623 

10-02 

1102 

676 

9-46 

0-5335 

1146 

665 

9-59 

1106 

721 

9-15 

0-6748 

1152 

721 

9-19 

1112 

782 

8-89 

0-8126 

1155 

796 

8-90 

1115 

863 

8-80 

1-0000 

1173 

915 

8-81 

>1  128 

1002 

9-20 

(Continued) 


Table  1.    (Continued) 


303-15K  313-15K 


* 

c(m/s) 

K/TPa'1) 

B/A 

c(m/s)   Ks 

(TPa-1) 

B/A 

Methyl  ethyl 

ketone  +  Nitrobenzene 

0-0000 

1456 

395 

14-20 

1408 

426 

12-71 

0-0986 

1426 

424 

13-15 

1386 

452 

12-05 

0-2098 

1388 

463 

12-27 

1348 

494 

11-31 

0-3268 

1154 

506 

11-42 

1314 

542 

10-58 

0-4182 

1236 

537 

10-84 

1296 

575 

10-08 

0-5212 

1312 

579 

10-24 

1272' 

621 

9-64 

0-6458 

1277 

642 

9-64 

1239 

691 

9-1-5 

0-7512 

1241 

715 

9-19 

1201 

771 

8-87 

0-8119 

1223 

759 

8-99 

1183 

819 

8-78 

1-0000 

1173 

915 

8-81 

1128 

1002 

9-20 

different  temperatures  are  given  in  table  1.  The  table  also  contains  B/A  of  pure  liquids. 
It  is  found  that  B/A  values  decrease  as  the  mole  fractions  of  MEK  increases.  The  B/A 
for  liquids  has  been  interpreted  as  the  quantity  representing  the  magnitude  of  the 
hardness  of  the  liquid  [6]  which  may  be  considered  to  be  true  for  the  liquid  mixtures  as 
well.  For  all  the  five  mixtures,  as  the  mole  fraction  of  MEK  increases,  the  density  and 
ultrasonic  velocity  decrease  and  hence  the  compressibility  increases.  Therefore  as  the 
mole  fraction  of  MEK  increases  the  b^lk  modulus,  hence  B/A  must  decrease. 

Subramanyam  Reddy  [20]  reported  that  the  excess  isentropic  compressibility  for  all 
the  five  binary  mixtures  considered  in  the  present  study  are  negative.  A  plot  of  the 
calculated  excess  B/A,  (B/A)E,  against  mole  fraction  of  MEK  in  the  mixtures  is  shown  in 
figure  1.  It  is  interesting  to  note  that  (B/A)E  values  are  all  negative  and  show  the  same 
trend  as  that  of  the  reported  excess  isentropic  compressibility  [20].  A  negative 
compressibility  is  an  indication  of  strong  interaction  in  the  liquid  mixtures  [22],  which 
has  been  attributed  to  charge  transfer,  dipole-induced  dipole  and  dipole-dipole  interac- 
tion while  a  positive  sign  indicating  weak  interaction  has  been  interpreted  as  due  to 
dispersion  forces  [23].  In  MEK-benzene  system  there  is  a  dipole  interaction  between 
n  electrons  in  the  benzene  ring  and  the  carbonyl  group  >Co  of  MEK.  Also  in 
MEK-toluene  there  is  dipole  interaction  between  >CO  and  CH2  group.  In  the  other 
mixtures  the  same  type  of  interactions  are  present  between  >  CO  and  -Cl,  -Br  or 
-NO2 .  The  difference  in  the  magnitude  of  (B/A)E  for  different  mixtures  is  attributable 
to  the  difference  in  the  extent  of  interactions  present  in  them.  It  may  be  inferred  that 
a  more  negative  value  of  (B/A)E  indicates  a  strong  interaction  between  the  different 
groups  in  the  system.  This  is  in  accordance  with  the  results  of  the  study  of  excess 
isentropic  compressibility  values  [20]. 

4.  Conclusion 

Acoustics  non-linearity  parameter  B/A  is  calculated  for  five  binary  mixtures  with 
methyl  ethyl  ketone  as  common  component  at  two  different  temperatures.  An  excess 
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•*•  MEK  4-  BENZENE  *  MEK  +  TOLUENE 

-•-MEK  +  CHLOROBENZENE  •*  MEK  +  NITROBENZENE 


-1.2 


0       0.1     0.2     0.3     0.4     0.5     0.6     0.7     0.8     0.9       1 
MOLEFRACTION 

Figure  1.    Variation  of  excess  B/A  with  mole  fraction  of  methyl  ethyl  ketone  a' 
303- 1 5  K. 

non-linearity  parameter  (B/A)E  which  shows  the  same  type  of  variation  as  isentropk 
compressibility  with  mole  fraction  has  been  defined.  It  is  concluded  that  the  (B/A)E  i{ 
also  an  important  quantity  similar  to  other  thermodynamic  parameters  useful  tc 
explaining  the  interaction  in  binary  liquid  mixtures. 


References 

[1]  M  Paul  Hagelberg,  J.  Acoust.  Soc.  Am.  41,  564  (1967) 

[2]  A  B  Coppens  and  Robert  T  Beyer,  J.  Acoust.  Soc.  Am.  38,  797  (1965) 


j  j  uyun  KL  ui 

[3]  Charles  A  Cain,  J.  Acoust.  Soc.  Am.  80,  685  (1986) 

[4]  Harumi  Endo,  J.  Acoust.  Soc.  Am.  71,  330  (1986) 

[5]  B  K  Sharma,  J.  Acoust.  Soc.  Am.  73,  106  (1983) 

[6]  Harumi  Endo,  J.  Acoust.  Soc.  Am.  83,  2043  (1988) 

[7]  Wesley  N  Cobb,  J.  Acoust.  Soc.  Am.  73,  1525  (1983) 

[8]  Chandra  M  Sehgal  and  Burdett  R  Porter,  J.  Acoust.  Soc.  Am.  79,  566  (1986) 

[9]  Laszlo  Alter  and  E  A  Hiedemann,  J.  Acoust.  Soc.  Am.  34, 410  (1962) 
[10]  A  P  Sarvazyan  and  T  V  Gnalikian,  J.  Acoust.  Soc.  Am.  88, 1555  (1990) 
[11]  W  K  Law  and  L  A  Frizzell,  J.  Acoust.  Soc.  Am.  74, 1295  (1983) 
[12]  Tong  lie  and  Dong  Yanwu,  Kexue  Togbao  33,  1511  (1988) 
[13]  W  Schaaffs,  Z.  Phys.  115,  69  (1940) 
[14]  P  J  Flory,  J.  Am.  Chem.  Soc.  87, 1833  (1965) 
[15]  A  Abe  and  P  J  Flory,  J.  Am.  Chem.  Soc.  87,  1838  (1965) 
[16]  J  D  Pandey,  J.  Chem.  Soc.  Faraday  Trans.  I  75,  2160  (1979) 
[17]  J  D  Pandey,  J.  Chem.  Soc.  Faraday  Trans.  II 76, 1215  (1980) 
[18]  M  S  Khanwalkar,  Acoust.  Lett.  13,  121  (1990) 
[19]  Shao-muMa  and  Henry  Eyring,  J.  Chem.  Phys.  42, 1920  (1965) 
[20]  K  Subramanyam  Reddy,  J.  Chem.  Eng.  Data  31,  238  (1986) 
[21]  S  Ernst  and  J  Glinski,  Acustica  48,  109  (1981) 
[22]  R  J  Fort  and  W  R  Moore,  Trans  Faraday  Soc.  61,  2102  (1965) 
[23]  K  Sivanarayanan  and  S  Prakash,  Acustica  50,  286  (1982) 


226  Pramana  -  J.  Phys.,  Vol.  45,  No.  3,  September  1995 


Gyrotropic  and  piezoelectric  coefficients  and  attenuation  of  elastic 
waves  in  BaTiO3 
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Abstract.  Using  Landau  theory  of  phase  transition,  expressions  for  gyrotropic  coefficients,  and 
piezoelectric  coefficients  are  obtained  for  barium  titanate  in  the  tetragonal  phase.  Both  coeffi- 
cients vanish  at  the  ferroelectric  phase  transition  temperature.  The  piezoelectric  coefficients 
tallied  with  the  literature  values.  The  attenuation  coefficients  for  elastic  waves  propagating  along 
the  principal  directions  in  tetragonal,  orthorhombic  and  rhombohedral  phases  are  derived  based 
on  Landau  theory.  It  is  predicted  that  there  will  be  slight  amplification  for  both  longitudinal  and 
transverse  modes  in  the  rhombohedral  phase  at  a  temperature  close  to  the  rhombohedral  phase 
transition  temperature. 

Keywords.  Gyrotropic  and  piezoelectric  coefficients;  attenuation;  elastic  waves;  phase  transition. 
PACS  Nos  64-70;  62-20;  64-60 

1.  Introduction 

Barium  titanate  is  a  classical  example  of  a  substance  undergoing  a  first  order  phase 
transition.  It  is  paraelectric  above  Curie  temperature  (Tc),  120  °C,  and  has  a  cubic 
perovskite  structure.  At  the  Curie  point  the  structure  changes  from  cubic  m3m  (Oh)  to 
tetragonal  4  mm  (C4K)  with  spontaneous  polarization  (SP)  along  (001)  direction.  Below 
5  °C  the  SP  points  in  the  (01 1)  direction  and  the  crystal  system  is  orthorhombic  2  mm 
(C2K).  On  cooling  below  —  70  °C  it  undergoes  a  further  phase  transition  to  rhom- 
bohedral 3  m  (C3V)  with  SP  along  (111)  direction. 

Considering  the  tetragonal,  orthorhombic  and  rhombohedral  phases  as  strained 
forms  of  cubic  phase  the  author  obtained  expressions  for  elastic  anomalies  [1-3]  using 
Landau  theory  of  phase  transitions.  Using  the  second  order  elastic  anomalies  already 
published  [1-3]  expressions  for  the  gyrotropic  and  piezoelectric  coefficients  for  the 
barium  titanate  in  the  tetragonal  phase  are  derived.  Expressions  are  also  obtained 
for  the  attenuation  of  elastic  waves  travelling  along  the  principal  directions  (100),  (110) 
and  (111). 

2.  Gyrotropic  coefficients 

When  spatial  dispersion  is  present  in  a  system  the  elastic  constant  matrix  [4]  acquires 
the  structure. 

Cu(ak)=C0.(Q)  +  ^ji(Q,k)k1  (1) 

where  k  denotes  the  wave  vector  of  the  acoustic  wave,  Cl  its  circular  frequency.  dtjl  are 
gyrotropic  tensor  components  (5th  rank).  Their  number  depends  on  the  crystal  system 
and  these  have  been  worked  out  by  Kumaraswamy  and  Krishnamurthy  [4]. 
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The  free  energy  F  of  the  crystal  can  be  written  as  the  sum  of  (i)  elastic  energy  Fela  (ii) 
the  Landau  energy  FL  (iii)  the  energy  due  to  coupling  between  order  parameter  (in  this 
case  SP)  and  strains  Fc  (iv)  the  gyrotropic  energy  involving  SP  and  elastic  strains  and 
also  the  spatial  derivatives  of  SP  and  those  of  strains  Fg. 

F  =  Fela  +  FL  +  Fc  +  Fg  (2) 

Since  the  term  Fg  is  usually  much  smaller  than  Fc  it  can  be  neglected  in  the  calculation 
of  elastic  anomalies  and  Fg  should  be  taken  into  account  while  estimating  gyrotropic 
coefficients.  The  expressions  for  Fela,  FL  and  Fc  are  given  in  earlier  publications  on 
barium  titanate  [1-3]. 
Fg  is  chosen  as  equal  to 

*i  C(Px(^!/3x)  +  Py(dr,2/dy))  - 

l(Px(dr}2/dx)  +  Py(dnJdy}}  -  (n2(dPx/dx) 
3  L(Px(dr,5/dz)  +  Py(dnjdz}}  -  (n5(dPx/dz) 
£(Px(dri3/dx)  +  Py(dri3/dy))  -  (>l3(dP 
+  B5  l(Px(d 


+  B8i(Pz(dr,s/dx)  +  Pz(drjjdy))  -  (r,5(dPJdx)  +  ^(dPJdy)}}  (3) 

where  Bt(i  =  l  to  8),  rit(i=l  to  6)  and  PK(K  =  x,y,z)  are  constants,  strains  and 
components  of  SP  respectively.  Fg  is  chosen  so  that  it  is  invariant  under  the  transform- 
ation of  the  generating  matrices  of  the  tetragonal  group  4  mm  (C4K).  Each  term  in  the 
square  brackets  in  the  expression  for  Fg  given  above  is  a  difference  of  two  terms  which 
are  individually  invariant  under  the  symmetry  operations  of  the  tetragonal  groups.  It  is 
also  necessary  to  choose  equal  but  opposite  coefficients  (Si's)  for  the  two  terms.  This 
was  warranted  by  the  fact  that  the  gyrotropic  coefficients  dijt  must  change  sign  [4]  for 
any  interchange  of  indices  i  and;.  The  folio  wing  relations  for  the  equilibrium  values  of 
the  components  of  SP  and  their  fluctuations  for  the  tetragonal  phase  are  taken  from  the 
earlier  publications  [1,2], 

^o  =  ^2o=0  (4) 

PJ0  =-aP  =  0-284  x  101  2  a.  (5) 

P*  =  Art*  (6) 

P*y=An*  (7) 

Pt^Zwf  (8) 

i=l 

where  a  =  a'(  T  —  Tc)  with  a'  a  positive  constant.  P  is  a  constant  and  qf  (i  =  1  to  6)  are 
fluctuations  in  strains  about  their  equilibrium  values. 

0-502  Xl06 

(9) 


+  iO.ii) 
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_  0-0227  x  106 

a1=a2=  -(2r3012Pz0)/P3=— -=- — -  (10) 

v/a(l  +  iQi) 

(11) 

(12) 
(13) 
(14) 
The  relaxation  time  T  is  defined  as 

;  =  1  to  3    and    K  =  x,y,z  (15) 


The  numerical  values  for  P,  A,  o^  =  a2,  and  a3  are  calculated  using  the  values  of  elastic 
constants,  gn,g12  an<*  044  taken  from  Devonshire  [5]. 

The  cubic  phase  has  inversion  symmetry,  hence  all  the  dfjl  are  zero  in  this  phase.  In 
the  tetragonal  phase  dtjl  arises  due  to  the  absence  of  the  inversion  symmetry.  By 
substituting  PK  =  PKO  +  P%  and  rj(  =  nio  +  n.?  in  the  expression  for  Fg  the  gyrotropic 
coefficients  are  given  by  [6] 

rfij.,=_g_.  (16) 

The  following  gyrotropic  coefficients  can  be  obtained  with  the  relations  among  them. 

106  (17) 


B  0'0227 

K>'  (18) 


'°-32°     xlO«  (20) 


D     A.  CAT 


All  the  above  five  independent  gyrotropic  coefficients  appropriate  to  C4V  are  listed  by 
Kumaraswamy  and  Krishnamurthy  [4],  The  coefficients  are  complex  and  propor- 
tional to  [(T—  Tc)1/2(l  +  j'Qt)]"1.  The  temperature  variation  of  i  is  given  by 
Lemanov  [7] 


3.  Piezoelectric  coefficients 

When  the  free  energy  is  expressed  in  terms  of  strains  and  electric  field,  one  o 
piezoelectric  stress  coefficients  emi  [8].  If  free  energy  is  expressed  in  terms  of  stress 
electric  field,  piezoelectric  strain  coefficients  dmi  will  result  [8].  Using  polari 
theory  and  by  replacing  electric  field  with  polarization  in  the  expression  for  free  < 
one  can  also  get  the  a  and  b  coefficients  (ami  and  bmi)  as  referred  to  by  Cady  [fr 
The  expression  for  the  first  thermodynamic  potential  is  given  by 


where  C?.  are  elastic  constants  at  constant  polarization,  iskm  are  inverse  suscepti 
at  constant  strain  (clamped  constants  or  high  frequency  values)  PK  are  compom 
polarization  and  rj{  are  strains.  By  introducing  PK  =  PKQ  4-  P£  and  *?,-  =  ^0  +  *?* 
expression  for  free  energy  F  =  /pela  +  FL  +  Fc  and  comparing  it  with  (23)  the  pie. 
trie  coefficients  ami  are  the  coefficients  of  P%nf.  For  the  tetragonal  phase  of  t 
titanate  they  are  listed  with  their  numerical  values  as 


o  =  -11-94V/V(T'-  Tc) 


-  Tc)  x 


(T-  Tc)  x  106. 

Devonshire  [9]  derived  the  following  relations  between  C£,  ami,  bmi  and  rj^n  (s 
tibilities  at  constant  stress)  as 


The  values  of  dmi  are  of  the  order  10  ~6  C£'s  are  of  the  order  1012,bmi  are  of  the 
10"  7  and  r\^n  are  of  the  order  10  to  100.  The  piezoelectric  constants  calculated 
author  from  Landau  theory  are  of  the  order  106  and  tally  with  the  reported  value 
coefficients  a33,  a32  and  ais  vanish  at  T  =  Tc  and  they  vary  as  \/T—  Tc. 

4.  Attenuation 

Lemanov  obtained  an  expression  for  the  attenuation  coefficient  (a)  in  terms  of  re 
imaginary  parts  of  the  elastic  constants  [7]  as 

a  =  (Q/2)ImaCy/ReC/y 

The  elastic  anomalies  reported  by  the  author  in  different  phases  of  barium  titan; 
complex  [1-3].  Using  (29)  expressions  for  the  attenuation  coefficients  (a's)  < 
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written  in  terms  of  Qi  =  x  for  the  longitudinal  and  transverse  waves  along  the  principal 
directions  in  tetragonal,  orthorhombic  and  rhombohedral  phases  of  barium  titanate 
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The  variation  of  attenuation  coefficient"!  with  nT     v.   i       ^ 
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Figure  1.    (Continued) 
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Figures  1  (a)-(c).  Represent  variation  of  attenuation  coefficients  with  the  product 
of  circular  frequency  and  relaxation  time  (fit)  in  tetragonal,  orthorhombic  and 
rhombohedral  phases  respectively.  In  each  figure  curve  (a)  an  (b)  alu  and  (c) 
aj  (  represent  attenuation  coefficients  of  longitudinal  mode  along  (100),  (1 10)  and 
(li'i)  directions  respectively.  Curve  (d)  afi,  (e)  a/u  and  (f)  arm  represent  attenuation 
coefficients  of  transverse  mode  along  (100),  (1 10)  and  (1 1 1)  directions  respectively. 

maximum  amplification  for  QT  =  0-1  which  corresponds  to  ( T  —  Tc)  ^  0-1  K.  Unfortu- 
nately, there  are  no  experimental  data  available  for  comparison.  It  is  hoped  that  the 
present  work  will  stimulate  experimental  investigations  to  prove  the  validity  of  the 
theoretical  predictions. 
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Abstract.  Poly(L-leucine)  is  one  of  the  polyamino  acids  having  a  bulky  hydrophobic  side- 
chain.  For  want  of  full  phonon  dispersion  curves  and  density-of-states  on  this  biopolymer  Roles 
et  al  have  interpreted  their  specific  heat  data  in  a  limited  way.  In  the  present  communication  we 
report  an  analysis  of  the  normal  modes  and  their  dispersion  for  poly(L-leucine)  which  leads  to 
a  very  good  agreement  of  the  specific  heat  calculations  with  experimental  measurements.  It  is 
observed  that  the  main  contribution  to  specific  heat  comes  from  the  coupling  of  the  back-bone 
skeletal  and  side-chain  modes.  Several  other  assignments  have  been  revised.  A  special  feature  of 
some  dispersion  curves  is  their  tendency  to  bunch  in  the  neighbourhood  of  helix  angle.  It  has 
been  attributed  to  the  presence  of  strong  intramolecular  interactions.  Repulsion  between  the 
dispersion  curves  is  also  observed. 

Keywords.  Conformation;  phonon  dispersion;  a-helix;  normal  modes;  poly(L-leucine);  density- 
of-states;  specific  heat. 

PACSNo.    34'80 


1,  Introduction 

In  recent  publication  Wunderlich  and  Bu  [1],  Bu  et  al  [2],  Roles  and  Wunderlich  [3] 
and  Roles  et  al  [4]  have  reported  experimental  and  theoretical  studies  of  heat  capacities 
of  a  variety  of  polymeric  systems,  synthetic  as  well  as  biopolymeric.  In  most  of  the  cases, 
their  analysis  is  based  on  separation  of  the  vibrational  spectrum  into  group  and  skeletal 
vibrations.  The  former  are  taken  from  computations  fitted  to  IR  and  Raman  data  and 
the  latter  by  using  the  two  parameter  Tarasov  model  [1]  and  fitting  to  low  temperature 
heat  capacities.  However  in  a.  few  cases,  where  detailed  dispersion  curves  of  the 
vibrational  spectrum  are  available  [5-9],  they  have  been  used  for  obtaining  group  and 
skeletal  vibrations  and  number  of  vibrators  of  each  type.  In  some  cases,  dispersion 
curves  for  one  polymeric  system  have  been  used  to  obtain  the  number  of  vibrators  and 
frequencies  of  box  oscillators  for  polymers  with  identical  back-bone.  This  approach  is 
OK  when  full  dispersion  curves  are  not  available.  However  it  has  its  own  limitations, 
specially  when  the  side-chain  and  back-bone  modes  are  strongly  coupled.  In  the 
present  communication,  we  report  for  poly(L-leucine)  (PLL),  full  dispersion  curves, 
density-of-states  and  specific  heat  data  which  is  found  to  be  in  good  agreement  with  the 
experimental  measurements.  Vibrational  spectroscopy  is  an  important  tool  for  probing 
conformation  and  conformationally  sensitive  modes  of  a  polymer.  In  general,  the 
infrared  absorption,  Raman  spectra  and  inelastic  neutron  scattering  from  polymeric 
systems  are  very  complex  and  cannot  be  unravelled  without  the  full  knowledge  of 
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dispersion  curves.  One  cannot  appreciate  without  it  the  origin  of  both  syrnrru 
dependent  and  symmetry  independent  spectral  features.  Further  the  presence 
regions  of  high  density  of  states  which  appear  in  all  these  techniques  and  play 
important  role  in  thermodynamical  behaviour  are  also  dependent  on  the  profile  of 
dispersion  curves.  The  lack  of  this  information  in  many  polymeric  systems  has  b< 
responsible  for  incomplete  understanding  of  polymeric  spectra.  The  advent  of  las 
and  fast  computers  has  eased  these  problems  to  a  large  extent.  Dispersion  curves  a 
provide  information  on  the  extent  of  coupling  along  the  chain  together  with 
understanding  of  the  dependence  of  the  frequency  of  a  given  mode  upon  the  sequel 
length  of  the  ordered  conformation.  Thus  the  study  of  phonon  dispersion  in  polyme 
systems  continues  to  be  an  important  one.  This  group  has  earlier  reported  vibratioi 
analyses  and  phonon  dispersion  for  a  variety  of  biopolymeric  systems  having  a,  0, 
and  310  helical  conformation  [5-15].  As  a  part  of  this  continuing  programme,  i 
report  here  a  study  of  normal  modes  of  vibrations  and  their  dispersions  for  PLL. 

Poly(L-leucine)  (figure  1)  belongs  to  the  class  of  polyamino  acids  having  bull 
hydrophobic  side-chains.  In  solid  state,  PLL  adopts  an  a-helical  conformation  ar 
molecular  models  demonstrate  that  the  formation  of  a-helix  reduces  the  side-chain 
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side-chain  steric  interference  [16].  Infrared  and  Raman  spectroscopic  studies  of  PLL 
have  been  reported  by  Koenig  and  Sutton  [17]  and  Itoh  et  al  [18].  Their  assignments 
are  both  incomplete  and  are  based  on  qualitative  considerations.  It  is,  therefore,  all  the 
more  important  to  carry  out  a  complete  normal  mode  analysis  and  their  dispersion  for 
PLL.  It  is  through  the  dispersion  curves  that  one  can  correlate  microscopic  behaviour 
of  a  crystal  with  its  macroscopic  properties  such  as  specific  heat. 

2.  Theory  and  experiment 

2.  1  Normal  mode  analysis 

The  calculation  of  normal  mode  frequencies  has  been  carried  out  according  to  Wilson's 
G  F  matrix  method  [19]  as  modified  by  Higgs  [20]  for  an  infinite  chain.  The  Wilson 
GF  matrix  method  consists  of  writing  the  inverse  kinetic  energy  matrix  G  and  the 
potential  energy  matrix  F  in  internal  coordinates  R.  In  the  case  of  infinite  isolated 
helical  polymer,  there  are  an  infinite  number  of  internal  coordinates  which  lead  to 
G  and  F  matrices  of  infinite  order.  Due  to  the  screw  symmetry  of  the  polymer 
a  transformation  similar  to  that  given  by  Born  and  Von  Karman  can  be  performed 
which  reduces  the  infinite  problem  to  finite  dimensions.  The  transformation  consists  of 
defining  a  set  of  symmetry  coordinates. 

S(d)=     £    K"exp(J5(5) 

s=  -  oo 

where  d  is  the  vibrational  phase  difference  between  the  corresponding  modes  of  the 
adjacent  residue  units. 
The  elements  of  the  G(<5)  and  F(d)  matrices  have  the  form: 

GM=     Z    G^exp(i^) 

s=  —  oo 

*"»(*)=     Z    ^exp(^) 

s=  -  oo 

The  vibrational  secular  equation  which  gives  normal  mode  frequencies  and  their 
dispersion  as  a  function  of  phase  angles  has  the  form: 


The  vibrational  frequencies  v(5)  (in  cm'1)  are  related  to  eigenvalues  A(d)  by  the 
following  relation, 

I(6)  =  4n2c2v2(6)  (1) 

For  any  given  phase  difference  6  (other  than  0  or  n),  the  G(<5)  and  F(d]  matrices  are 
complex.  In  order  to  avoid  the  difficulties  involved  in  handling  complex  numbers, 
methods  have  been  devised  to  transform  the  complex  matrices  into  equivalent  real 
matrices  by  constructing  suitable  linear  combinations  of  coordinates.  One  method  of 


M 

N 


-N 
M 


In  the  present  case,  we  can  write  G(<5)  =  GR(6)  +  iGI(8)  and  F(8)  =  FR(8)  +  L 
where  GR(6),  F*(<5),  G'(<5),  FJ(<5)  are  the  real  and  imaginary  parts  of  G(<5)  and  F(8) 
product  H(8)  =  G(<5)F(<5)  becomes 


FK(«5)     - 


FR(6) 


HR(6)     ~Hl(d 
H'(5)      HR(8) 

=  GR(5)FR(5)-G 


where 


The  matrix  H(d)  now  has  dimensions  2N  x  2JV.  The  eigenvalues,  therefore,  oc< 
pairs  of  equal  values.  The  difficulty  of  dealing  with  complex  numbers  is  thus  avo 

2.2  Force  constant  evaluation 

The  force  constants  have  been  obtained  by  the  least  square  fitting.  In  order  to  o 
the  'best  fit'  with  the  observed  frequencies  the  following  procedure  is  adopted.  Init 
approximate  force  constants  for  back-bone  are  transferred  from  a  Poly(L-alanine 
Thus  starting  with  the  approximate  F  matrix  F0  and  the  observed  freque 
Aobs  (related  through  a  constant),  one  can  solve  the  secular  matrix  equation 

GF0L0=L0A0 

Let  A/U  =  A/obs-/U'0  in  the  above  equation.  It  can  be  shown  that  in  the  first  < 
approximation 


where  J  is  computed  from  L0.  We  wish  to  compute  the  corrections  to  F0  so  tha 
errors  AA  are  minimized.  We  use  the  theory  of  least  squares  and  calculate 


where  P  is  a  weighting  matrix  and  7'  is  the  transpose  of  J.  The  solution  to  this  equ; 
is  obtained  by  inverting  (  J'PJ)  to  give 


If  the  number  of  frequencies  is  greater  than  the  number  of  F  matrix  elements,  the  m 
J'PJ  should  be  non-singular  and  we  obtain  the  corrections  AF  which  will  minimiz 
sum  of  the  weighted  squares  of  the  residuals.  If  the  corrections  AF  are  fairly  large,  the  1 
relation  between  force  constant  and  frequency  term  in  the  matrix  eq.  (2)  breaks  d 
In  such  a  situation  ,  further  refinement  using  higher  order  terms  in  the  Taylor's  s 
expansion  of  A/L;  is  needed.  This  procedure  has  been  developed  by  King  et  al  [2. 
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The  poly(L-leucine)  (Lot  No.  81H5557,  DP(vis)242,  Mw  27,4000)  was  purchased 
from  the  Sigma  Chemicals,  USA.  The  FTIR  spectra  (3000-150  cm" l )  were  recorded  in 
CSI  on  Perkin-Elmer-1800  spectrophotometer  and  are  shown  in  figures  2  (a)  and  2(b). 
Before  running  the  spectra  the  equipment  was  well  purged  with  dry  nitrogen. 

2.3  Calculation  of  heat  capacity 

One  of  the  important  uses  of  dispersion  curves  is  that  the  microscopic  behaviour  of 
a  crystal  can  be  correlated  with  its  macroscopic  properties  such  as  heat  capacity.  For 
a  one  dimensional  system  the  density-of-state  function  or  the  frequency  distribution 
function,  which  expresses  the  way  energy  is  distributed  among  the  various  branches  of 
normal  modes  in  the  crystal,  is  calculated  from  the  relation, 

j  vj(<5)  =  v 

The  sum  is  over  all  branches  j.  Considering  a  solid  as  an  assembly  of  harmonic 
oscillators,  the  frequency  distribution  g(v)  is  equivalent  to  a  partition  function.  It  can  be 
used  to  compute  thermodynamic  quantities  such  as  free  energy,  entropy,  heat  capacity 
and  enthalpy  [22].  The  constant  volume  heat  capacity  is  obtained  using  Debye's 
relation 


with 


The  constant  volume  heat  capacity  Cy,  given  by  (4)  is  converted  into  constant 
pressure  heat  capacity  CP  using  the  Nernst-Lindemann  approximation  [23] 

CP-C.  =  3RA0(C2fT/C.T°m)  (5) 

where  A0  is  a  constant  often  of  a  universal  value  [3-9  x  10~3(°Kmol)/J]  and  T°m  is  the 
estimated  equilibrium  melting  temperature,  which  is  taken  to  be  573  °K[4]. 

3.  Results  and  discussion 

In  poly(L-leucine)  there  are  19  atoms  per  residue  unit  which  give  rise  to  57  dispersion 
curves  of  vibrations.  The  vibrational  frequencies  were  calculated  for  the  values  of 
d  ranging  from  0  to  n  in  steps  of  0-057T.  Assuming  that  \j/  is  the  angle  of  rotation  about 
the  helix  axis  which  separates  the  adjacent  units,  the  modes  corresponding  to  6  =  0  and 
\l/  are  infrared  as  well  as  Raman  active  and  5  —  2\{/  gives  only  Raman  active  modes.  Here 
if/  is  the  angle  of  rotation  about  the  helix  axis  which  separates  the  adjacent  units.  For 
a-helical  PLL  ^  =  5?t/9.  The  calculated  frequencies  at  6  =  0  and  5rc/9  are  compared 
with  the  observed  ones.  The  force  constants  which  gave  best  fit  to  the  experimental  data 
are  given  in  table  1.  Since  the  modes  above  1350cm"1  (except  amide  II)  are  non- 
dispersive,  only  the  modes  below  this  are  shown  in  figure  3.  Two  lowest  lying  branches 
(6  =  0  and  d  =  5n/9,  v  =  0)  are  the  four  acoustic  modes  which  correspond  to  the  rotation 
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v(N-Ccc)  2-850  0(Ca-Q3-Cy)  0-550(0-18) 

v(Ca-Ha)  4-250  ^(Cjg-Cy-C^)  0-515(0-18) 

v(Ca-C)  2-700  c£(C/J-Cy-G52)  0-515(0-18) 

v(C=O)  8-300  <j)(Cp-Cy-Hy)  0-523(0-22) 

v(C=N)  6-300  ^(Cd^-Cy-Cd,)  0-515(0-18) 

v(H-N)  5-390  ^(Cd^Cy-VLy)  0-500(0-22) 

v(Ca-C£)  2-550  0(G52-Cy-Hy)  0-500(0-22) 

4-250  WCy-CSt-Hdi)  0-445(0-21) 

4-250  <j(>(Cy-C(52-H«$2)  0-445(0-21) 

v(C0-Cy)  2-300  (f)(Hd  t-Cd  ,-HS  ^  0-421(0-24) 

v(Cy-C52)          2-120  (j>(H62-C62-HS2)  0-421(0-24) 

vfCy-C^)          2-120  ^(Ca-CjS-HjSa)  0-360(0-20) 

v(Cy-Hy)  4-340  <j>(Cy-Cp-Hpp)  0-360(0-20) 

v(CVH<$i)        4-330  0(Cy-CjS-H/?a)  0-360(0-20) 

v(C<52-H52)        4-330  WCa-Cp-Hpp)  0-360(0-20) 

^(H^-Cj8-H^a)  0-422(0-25) 

co(CO)  0-540  i>(N-Ca-Ha)  0-320(0-80) 

co(N-H)  0-120  <i>(N-Ca-Cj3)  0-450(0-50) 

i>(Ha-Ca-C)  0-410(0-20) 

i(C-Ca)  0-010  0(Ha-Ca-Cjff)  0-410(0-20) 

r(Ca-C^)  0-050  0(C-Ca-C/?)  0-520(0-18) 

t(C)?-Cy)  0-080  0(N-Ca-C)  0-130(0-50) 

tfCy-C^i)          0-016  </>(C=N-Ca)  0-530(0-35) 

z(Cy-C62)          0-016  0(N=C=O)  0-600(0-90) 

r(C=N)  0-030  0(C=N-H)  0-200(0-65) 

t(N-Ca)  0-010  0(H-N-Ca)  0-427(0-60) 

•C=O)  0-230(0-60) 

ON)  0-210(0-60) 


Note:  v,  <f),  to,  T  denote  stretch,  angle  bend,  wag  and  torsion  respec- 
tively. Non-bonded  force  constants  are  given  in  parentheses. 


about  the  helix  axis  and  and  the  translations  ||  and  two  _L  to  the  helix  axis.  The 
assignments  of  modes  are  made  on  the  basis  of  potential  energy  distribution,  line 
intensity,  line  profile,  second  derivative  spectra  and  the  presence/absence  of  modes  in 
molecules  having  atoms  placed  in  similar  environment.  For  the  sake  of  simplicity  the 
modes  are  discussed  under  two  separate  heads  viz.  backbone  modes  and  side-chain 
modes. 

3.1  Backbone  modes 

Main-chain  of  PLL  consists  of  amide  groups  joined  together  by  Ca  atoms.  Modes 

involving  the  motions  of  main-chain  atoms  (-C-Ca-N-)  are  termed  as  backbone 

modes.  Pure  back-bone  modes  are  given  in  table  2  and  pure  side-chain  modes  in 
table  3.  The  modes  involving  the  coupling  of  back-bone  and  side-chain  are  given  in 
table  4.  All  the  amide  modes  except  amide  A  and  amide  I  are  dispersive.  A  comparison 
of  various  amide  modes  of  a  PLL  and  a  poly(L-alanine)  [6]  is  given  in  table  5.  Amide  I, 
II  and  III  modes  fall  nearly  in  the  same  region.  It  is  clear  that  the  frequency  of  amide 

242  Pramana  -  J.  Phys.,  Vol.  45,  No.  3,  September  1995 


1400 


1350 


1300  - 


1250  - 


1200  1 


1150  -- 


u 

3 
CT" 


1100 


1050  - 


1000  - 


950 


900  - 


(a) 


(b) 


C 


0    0.2   0.4   0.6   0.8 

Phase  Factor  (5/7t)  — 


0.1  0.2  0.3 

Density-of-States  g(v)  — 


0.4 


Figure  3.    (a)  Dispersion  curves  of  poly(L-leucine)  (1400-900  cm    l );  (b)  Density- 
of-states  g(v)  ( 1 400-900  cm  ~ 1). 


V  mode  does  not  depend  solely  on  main-chain  conformation  but  side-chain  structure  in 
question  also  plays  an  important  role  in  determining  the  frequency  of  this  mode.  The 
frequency  of  CO-in-plane-bend  (amide  IV),  CO-out-of-planebend  (amide  VI)  and  C-N 
torsion  (amide  VII)  are  also  conformation  dependent  [24].  These,  modes  mix  strongly 
and  very  differently  with  the  side-chain  coordinates  depending  on  the  structure  of  the 
main  and  side-chains.  In  the  spectra  of  PLL  three  bands  have  been  observed  at  614, 656 
and  703  cm'1.  According  to  Koenig  and  Sutton  [17]  these  bands  are  due  to  amide  V  in 
a,  disordered  and  ft  sheet  conformations  respectively.  The  presence  of  all  these  modes  in 
the  spectra  of  PLL  is  said  to  be  indicative  of  the  coexistence  of  different  conformational 
states.  This  does  not  appear  to  be  correct.  In  a  poly(L-alanine)  [6]  656  cm" 1  band  has 
been  assigned  to  amide  VI  which  is  calculated  in  PLL  at  647cm""1.  Amide  V  is 
calculated  at  592  cm" l  at  6  =  0.  On  increasing  the  values  of  5  contribution  of  C=O  wag 
increases  and  at  zone  boundary  it  becomes  a  mixture  of  amide  V  and  amide  VI.  The 
703  cm" 1  band  is  a  mixture  of  side-chain  stretches  and  a>(N-H).  Our  assignments  are 
further  supported  by  the  fact  that  these  three  bands  are  observed  in  poly(L-a-amino-n- 
butyric  acid),  poly(L-norvaline)  and  poly(L-norleucine)  in  a  form  and  on  N-deuter- 
ation  the  703  and  656cm"1  band  do  not  change  appreciably  in  intensities  and 


Pramana  -  J.  Phys.,  Vol.  45,  No.  3,  September  1995 


243 


<f 

cr 

/**—  • 

rs1  S 

1) 

4> 

d--  a) 

T3 

"X3 

's 

'£ 

Z  '2 

II        ^ 

S 

S 

SiS 

+ 

+• 

S 

1r 

^ 

S" 

u 

ffi 
i 

u  5 

03 

PT 

^ 

E    ^ 

Q 

X 

U 

¥  S, 

W 
OH' 

^ 

+ 

J  ± 

"e 

£ 

c^ 

u 

'S"  f7 

i-ri     v^ 
*       1 

I 

X 

^_ 

' 

,**    U 

c 

w^t 

^*) 

£-* 

\J          1 

-SP 

I 

II 

1 

1        ?J 

"cfl 
en 

i 

u 

5- 

5-  £ 

< 

~ 

tr 

-e-  • 

"S-  "©- 

^ 

f> 

r- 

oo 

00 

.0 

O 

m 

S 

in 

« 

u 

S 

i 

S 

5 

S* 

•8 

^2 

|  = 

-f   U  « 

o 

II 

's 

s 

U   'g 

2   Z'g 

II 

& 

~~^ 

s.   <; 

5li. 

i     "T 

rt 

S 

»    So" 

CO 
U 

•o 

Q 
tu 

R 

U^—  ^ 

^  o" 

+    ii 

.X  u 

O 

o 
1 

15 

u 

S 

r-j 

f 

tS^L 

u  -~- 

^  u 

1 

6"  -^ 

1 

E 
o 

0 

.2? 

+ 

X    * 

+   X 

.s 

03 
JS 

co 

G? 

Q 

r~ 

10 

'S" 

u 

1 

—  ,     8 
O    Q 

<u 

Iw 

3 

^T 

^, 

*~jr       .S  f 

Oi) 

ft 

1^ 

1 

£->          | 

(U 

E 

II 
O 

5. 

II        M 

u  E. 

V-i 

« 

fi 

^^ 

^" 

-s^ 

^    -Q. 

oo 

4> 

'5 

IS 

CO 

rrj 

fv. 

VO 

o\ 

c 

« 

0 

m 

VO 

^•^ 

2 

o 

(N 

3 
CT 

(U 

VO 

<N 

cs 

1 

u 

s 

VO 

>n 

§ 

O 

2 

244 


Pramana  -  J.  Phys.,  VoL  45,  No.  3,  September  1995 


Table  3.    Pure  side-chain  modes. 


Cal. 


Obs.      Assignments  (%  P.E.D.  at  <5  =  0-0) 


2963 

2957 
2938 
2918 
2917 
2915 
2910 
2910 
2906 
2876 
1466 
1465 
1452 
1452 
1444 

1393 
1380 
1351 
1333 
1038 

1014 

977 


931 

842 

830 

328 
282 


2960   v(Ca-Ha)(99) 

v(C<51-H(51)(99) 
2938  v(G52-H<52)(98) 
2916 


(£(H52-C52-H52)(93) 


2913 

v(G52-H<52)(98) 

v(G52-H<52X96) 
2904   v(H/Ja-Q3)(47)  +  v( 
2874  v(HjS)5-Cj3)(49)  +  v(Hj3a  -  CjS)(50) 
1471 


1454 

1441 
1390 
1370 
1350 
1334 
1040 

1020 
980 


925 

833 


323 
289 


<£(Cy-C52-H52)(22)  + 


v(Cy-C52)(5) 


v(Cy- 


v(Cy-C52)(7) 


v(Cy-C62)(30)  +  v(C7-C51)(20) 

Sa-Cj?-Cy)(7)  +  4>(Cy-C<52-H<52)(7)  +  t(Ca-C/5)(6) 


Note:  All  frequencies  are  in  cm"1. 

frequencies  [18].  Only  the  band  near  610  cm  ~  l  shifts  to  about  450cm'1  because  it  is 
due  to  amide  V.  One  of  the  interesting  features  observed  in  the  dispersion  curves  is  the 
tendency  of  some  curves  to  crowd  or  close  in  near  the  helix  angle,  which  is  indicative  of 
the  coupling  between  various  modes.  This  phenomenon  is  observed  in  amide  V  and 
amide  VI.  Amide  VI  mode  is  most  dispersive  (23  cm~  x  ).  It  has  25%  contribution  from 
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Table  5.    Comparison  of  amide  modes  of  poly-L-leucine  and  poly- 
L-alanine. 

Poly-L-leucine  Poly-L-alanine 


=  0-0        5  =  5n/9        <5  =  0-0         6  =  5n/9 


Amide  A 

3313 

3313 

3293 

3293 

Amide  I 

1657 

1657 

1659 

1659 

Amide  II 

1546 

1518 

1515 

1540 

Amide  HI 

1299 

1318 

1270 

1274 

Amide  IV 

587 

633 

525 

440 

Amide  V 

587 

617 

595 

610 

Amicle  VI 

656 

633 

685 

656 

Amide  VII 

227 

171 

238 

190 

Note:  All  frequencies  are  given  in  cm~ l. 

oj(C=O).  With  increase  in  8,  the  energy  of  this  mode  and  the  contribution  of  a>(C=O) 
decrease  whereas  ^(O=C=N)  and  co(N-H)  start  mixing.  At  d  =  i/>  it  becomes  a  mixture 
of  amide  IV  and  V.  This  characteristic  feature  has  also  been  noticed  in  a  poly(L- 
alanine)  [£j  and  poly(a-aminoisobutyric  acid)  [15]  (3 10  helix)  and  can  be  attributed  to 
strong  intramolecular  interactions  stabilizing  the  helical  structure.  The  dispersion 
curves  of  polyglycine  II  and  other  /?  sheet  samples  do  not  show  this  feature  [7, 8].  The 
observed  peak  at  227  cm  ~ l  has  been  assigned  to  amide  VII  and  this  mode  has  a  lower 
value  at  d  =s  $n/9  in  comparison  to  d  =  0.  The  P.E.D.  of  this  mode  shows  considerable 
mixing  with  Ca-N  and  Ca-C  torsions.  As  in  case  of  a  poly(L-alanine)  [6],  in  a  PLL 
also,  there  is  certain  amount  of  mixing  of  the  torsional  modes  with  the  angle  bending 
modes,  The  peak  at  543  cm  ~ l  has  been  assigned  to  the  mixture  of  (N-Ca-C/0  bend 
(junction  mode)  and  main-chain  deformation.  This  mode  shows  maximum  dispersion 
(64  cm""1), 

3,2  Side*chain  modes 

The  side-chain  [-CH2CH(CH3)2]  of  PLL  consists  of  a  CH2  attached  to  a  gerri 
dimethyl  group.  Pure  side-chain  modes  are  given  in  table  3.  The  CH2  scissoring  mode 
calculated  at  1444  cm  ~ l  corresponds  to  the  observed  band  at  1441  cm  ~~ *  in  PLL  and  at 
1453  cm~l  in  poly(y-benzyl-L-glutamate)  [17]. 

The  peaks  at  1334,  1318  and  1299 cm'1  have  been  assigned  to  (Cy-Hy)  bend, 
(Ca-Ha)  bend  and  amide  III  (back-bone  mode)  respectively.  With  increase  in  6  the 
contribution  of  (Cy-Hy)  and  (Ca-Ha)  bend  decrease  in  1334cm"1  and  1318cm"1 
mode  respectively  and  the  latter  one  starts  mixing  with  the  amide  III.  A  repulsion  takes 
place  between  1334  and  13 18  cm  ~ l  modes  in  the  neighbourhood  of  helix  angle  and  at 
the  point  of  inflexion  there  is  a  sudden  jump  in  the  density-of-states. 

The  observed  band  at  11 72 cm"1  has  been  tentatively  assigned  to  rocking  of  CH3 
group  by  Koenig  and  Sutton  [17]  but  according  to  our  normal  mode  calculations  this 
mode  is  a  mixture  of  (Ca-Ha)  bend  (40%)  and  CH2  wag  (25%).  This  is  further 
supported  by  the  fact  that  in  poly(y-benzyl-L-glutamate)  which  does  not  have  any 
methyl  group,  the  observed  Raman  band  at  1 183  cm  ~ :  has  been  assigned  to  CH2  wag 
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Figure  4.    (a)  Dispersion  curves  of  poJy(L-leucine)  (950-400  cm   * );  (b)  Density- 
of-states  g(v)  (950-400  cm  ~ * ). 


[17].  This  is  consistent  with  our  assignment.  The  CH2  rocking  mode  is  assigned  to  the 
peak  at  875cm'1  which  is  in  accordance  with  the  assignment  of  Koenig  and  Sutton 
[17].  All  pure  side  chain  modes  are  non-dispersive. 

The  modes  calculated  at  1130  and  1093cm"1  ((5  =  0)  correspond  to  the  observed 
peaks  at  1129  and  1099cm"1.  They  contain  a  mixture  of  side-chain  CH2  twist  and 
skeletal  (N-Ca)  stretch.  The  1130cm"1  mode  is  non-dispersive.  At  <5  =  0  it  has  34% 
contribution  from  CH2  twist  and  as  6  increases  the  contribution  of  (N-Ca)  stretch 
decreases  and  that  of  CH2  twist  increases.  The  other  one  (1099cm"1)  has  13% 
contribution  from  CH2  twist  (at  6  =  0)  and  shows  a  reverse  behaviour  later  on.  This 
mode  is  dispersive  and  with  increase  in  <5  the  contribution  of  (N-Ca)  stretch  increases 
and  that  of  CH2  twist  decreases.  The  modes  at  948  and  1038cm"1  (at  6  =  0)  are 
mixtures  of  CH3  rock  and  side-chain  (C-C)  stretch.  The  1038cm"1  mode  is  non- 
dispersive  and  has  very  little  contribution  from  side-chain  (C/f-Cy)  stretch.  The  other 
one  (948  cm  ~ 1 )  has  47%  contribution  from  side-chain  (Ca-C/f)  stretch  (at  <5  =  0).  In  the 
948 cm" l  mode  the  contribution  of  (Ca-C/J)  stretch  decreases  and  that  of  CH3  rock, 
(Cy-C<52)  and  (Cy-Cd^  stretches  increase  with  increase  in  5  and  the  frequency  of  this 
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mode  increases  and  in  the  neighbourhood  of  helix  angle  (<5  =  \l/}  it  exchanges  its 
character  with  the  977  cm  ~ l  mode  (CH3  rock  at  5  =  0).  After  this,  the  948  cm  ~  *  mode 
becomes  non-dispersive  and  977cm"1  mode  is  repelled  upwards.  They  belong  to  the 
same  symmetry  species  [25]. 

3.3  Specific  heat 

Recently,  "heat  capacity  measurements  for  a  series  of  polyamino  acids  have  been 
reported  by  Roles  et  al  [3-4].  Their  approach  basically  involves  separation  of  vibra- 
tional  spectra  into  group  and  skeletal  spectrum  and  obtaining  the  number  of  vibrators 
for  each  case.  These  are  obtained  from  the  dispersion  curves  and  spectra  of  polymers 
having  the  same  skeletal  structure  (e.g.  poly(L-glycine)  and  a  poly(L-alanine)  for 
poly(L-leucine)).  As  remarked  earlier  this  approach  has  its  own  limitations  specially 
when  the  back-bone  and  side-chain  modes  are  heavily  mixed  up.  This  is  very  true  in 
case  of  poly(L-leucine)  below  1200cm"1.  The  density-of-states  are  shown  in  figures 
3(b),  4(b)  and  5(b).  Modes  which  are  purely  skeletal,  purely  side-chain  and  mixture  of 
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Figure  5.    (a)  Dispersion  curves  of  poly(L-leucine)  below  500cm    *,  (b)  Density- 
of-states  g(v)  below  500 cm" ! . 
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these  two  are  given  in  tables  2,  3  and  4  respectively.  Their  contributions  to  the  heat 
capacities  are  shown  in  figure  6A,  6B  and  6C  respectively  in  the  temperature  range 
220-390  K.  Total  heat  capacity  is  shown  in  figure  6D  and  A's  represent  the  experimen- 
tal data  of  Roles  et  al  [4].  The  calculations  are  in  very  good  agreement  with  the 
experimental  measurements  except  in  the  region  300-330  K  wherein  experimental  data 
show  a  bump.  Although  not  explicitly  mentioned,  this  bump  could  be  due  to  some 
artifacts  in  experiments,  such  as  presence  of  moisture.  It  should  disappear  on  repeating 
measurements.  Presence  of  any  glass  type  transition  is  also  not  reported.  Our  method- 
ology gives  better  agreement  with  experimental  data  substantiated  by  the  r.m.s. 
deviation  value  which  is  0-83%  as  compared  with  1-09%  of  Roles  et  al  [4]. 

The  contribution  from  the  lattice  modes  is  bound  to  make  an  appreciable  difference 
to  the  specific  heat  because  of  its  sensitivity  to  these  modes.  At  the  moment,  the 
calculation  for  dispersion  curves  for  a  unit  cell  are  extremely  difficult  because  even  if  we 
assume  a  minimum  of  two  chains  in  unit  cell  then  there  would  be  180  atoms  leading  to 
a  matrix  of  540  x  540.  It  would  bring  in  an  enormous  number  of  interactions  and  make 
the  problem  almost  intractable.  Thus  in-spite-of  several  limitations  involved  in  the 
calculation  of  specific  heat,  the  present  work  does  provide  a  good  starting  point  for 
further  basic  studies  on  thermodynamical  behaviour  of  polypeptides  and  proteins 
which  go  into  well-defined  conformations. 
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Abstract.  We  study  the  effect  of  the  mass  and  charge  dynamics  on  the  collective  behaviour  of 
a  dusty  plasma.  It  is  shown  that  the  finite  non-zero  streaming  velocity  of  the  dust  grains  leads  to 
a  novel  coupling  of  the  dust  mass  fluctuation  with  other  dynamic  variables  of  the  plasma  and  the 
grains.  The.rnass  fluctuations  causes  a  collisionless  dissipation  and  provides  an  alternate  channel 
for  the  beam  mode  instability  to  occur.  Physically  the  negative  energy  wave  associated  with  the 
beam  mode  couples  to  the  mass  fluctuation  induced  dissipative  medium  to  produce  the 
instability.  We  conclude  that  the  higher  value  of  the  ion  mass  density  to  the  dust  mass  density 
ratio  reduces  the  threshold  value  for  the  onset  of  the  instability.  Its  application  in  the  astrophysi- 
cal  context  is  discussed. 

Keywords.    Dusty  plasma;  mass  fluctuation;  charge  fluctuation. 
PACSNos    52-25;  52-35 

1.  Introduction 

Dusty  plasma  is  ubiquitous  in  the  universe.  Interstellar  media,  cometary  tails,  planetary 
rings,  ionosphere,  plasma  processing,  etc.  are  a  few  examples  of  the  space  and  the 
laboratory  plasmas  where  the  dust  plays  an  important  role  in  determining  the 
dynamical  behaviour  of  the  plasmas  [1-4].  Dusty  plasma  is  a  three  component  plasma 
system  consisting  of  the  electrons,  ions  and  the  charged  dust  grains.  The  typical  size  of 
the  dust  grains  is  a  few  microns  and  these  can  carry  ten  to  million  times  the  electronic 
charge.  In  general,  the  size,  mass,  and  charge  on  the  grains  may  vary;  for  example,  the 
mass  may  vary  between  10  ~2  g  to  a  few  a.m.u.  per  dust  charge  [5, 6],  the  size  may  vary 
between  a  few  micrometer  to  the  molecular  size  i.e.  there  exists  a  smooth  transition 
from  dusty  to  a  multi  component  plasma. 

The  dust  grains  in  a  plasma  may  be  charged  through  the  collisions,  photoemission 
etc.  We  assume  that  the  collisional  process  is  primarily  responsible  for  the  grain 
charging  (in  this  case  the  grains  are  negatively  charged).  As  a  result  of  the  collisions,  the 
ions  stick  to  the  grain  surface  and  thus  causes  the  variation  of  the  grain  charge,  mass 
and  radius.  Generally,  the  effect  of  the  mass  variation  of  the  grain  is  ignored  since 
wi/md«  1.  In  the  light  of  the  recent  data  of  ASPERA  for  the  matter  ejected  from 
Phobos  [6],  we  carefully  observe  that  even  in  the  above  mass  limit,  ion  mass  density 
may  be  comparable  to  the  dust  mass  density  leading  thereby  to  an  interesting 
consequence  of  a  new  wave  instability. 

Collective  behaviour  (waves,  instabilities,  etc.)  of  a  dusty  plasma  in  the  absence  of 
charge  fluctuation  has  been  studied  by  several  authors  [1-4].  Effect  of  the  charge 
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mass  on  the  grains  has  been  assumed  to  be  constant.  However,  in  general,  it  is  not  true 
[1-5]  Actually  the  size  (radius)  of  the  grains  should  also  be  treated  as  a  dynamical 
variable  But,  for  the  simplicity,  it  is  ignored  as  the  time  scale  over  which  the  radius 
varies  is  larger  than  that  of  the  mass  variation  due  to  accretion  [12].  Under  these 
assumptions  we  find  that  when  vjc,  >  ^/2(1  +  8),  where  r\m  =  (md<,«do)/(ml  //iio)  is  the 
dust  mass  density  to  ion  mass  density  ratio  and  5  =  ne0/nio  is  the  equilibrium  electron 
to  ion  number  density  ratio,  there  exists  a  novel  mode  of  instability.  The  paper  is 
organized  in  the  following  fashion.  In  §2,  basic  governing  equations  are  given. 
Dispersion  relation  is  derived  in  §  3.  Section  4  includes  the  results  and  discussions. 

2.  Basic  equations 

We  consider  a  three  component  plasma  consisting  of  the  electrons,  ions  and  the 
electrically  charged  dust  grains.  It  is  assumed  that  the  mass  variation  is  solely  due  to  the 
sticking  of  the  ions  on  the  grain  surface  and  the  other  processes  like  photoionization, 
sputtering,  etc.  are  ignored.  The  dust  mass  fluctuation  equation  is  derived  from  the 
mass  conservation  relation.  The  basic  linearized  equations  for  the  number  conserva- 
tion of  the  electrons,  ions  and  the  dust  fluids  are: 

a,«el+V(«elve0  +  ne0vel)=-/?e0«el-/?el«e0,  (1) 

ctnil+V(?7ilvi0  +  ni0vil)=  -  Ao«n  ~  Aiwio>  (2) 

£>di+V(ndlvd0  +  nd0vdl)  =  0,  (3) 

where  the  subscripts  0  and  1  represent  the  equilibrium  and  the  perturbed  quantities 
respectively;  na,  va  (a  =  electron,  ion  and  dust)  are  the  number  density  and  velocity  of 
ath  species.  /?e  and  ft  denote  the  attachment  frequencies  of  the  electrons  and  the  ions 
respectively. 
Using  (2)  and  (3)  along  with  the  mass  conservation  equation 

5,p  +  V-J  =  0,  (4) 

where  p  =  Zmana  and  J  =  Imanava,  one  obtains  the  required  equation  for  the  mass 
fluctuation  as 


'V)mdl  =(md0/nd0)(jSion!1  +  j851w,0).  (5) 

In  writing  the  above  equation,  the  term  proportional  to  the  electron  mass  has  been 
neglected.  Using  similar  arguments,  the  dust  charge  fluctuation  equation  can  be 


derived  as 


'dO 


Here  the  attachment  frequencies  for  the  electrons  and  the  ions  have  been  denned  as 
Pa  -  /tnd/g  n.  with  'a'  representing  for  the  electron  and  ion.  The  expressions  for  the 
electron  and  ion  currents  are  given  in  the  refs  [12,  13].  Using  the  above  definition,  and 
the  equilibrium  constraints  Je0  +  JJO  =  0,  one  finds 
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and  thus  the  charge  and  mass  fluctuation  equations  can  be  written  as: 

(8) 


where  Jt  =  at  +  (vd0-V),  n  =  e\Ie0\C-l(T;1  +  w"1)  and  r)  =  e\Ic0\C-lw^  with  C  as 
the  capacitance  of  the  grain  and  w0  =  T,  -  e<j>[0,  T{  being  the  ion  temperature  and  0f0 
the  equilibrium  floating  potential  of  the  grain  surface.  Equilibrium  quasi  neutrality 
<7i"io  +  2doMdo  ~~  <?e"eo  =  0  n^s  Deen  assumed. 
Equation  of  motion  for  the  electron,  ion  and  dust  grain  fluids  are 

mc«edtve  =  -  VPe  +  <?eneE  -  meneve(ve  -  vd).  (10) 

m,n{dtv,  =  -  VPi  +  q{n{E  -  m,n, v,(v,  -  vd). 

^Kvd)  =  GdE-  (11) 

In  the  equation  of  the  motion  of  the  dust,  collisional  terms  are  not  included  and  the  dust 
grains  are  assumed  to  be  cold.  The  Poisson's  equation  closes  the  set  of  basic  equations 
and  is  given  as 

V2</>=  -4ne(ni-ne)~4nQdnd.  (12) 

Now  (8-12)  form  the  basic  set  of  equations  for  the  study  of  the  various  wave 
phenomena  in  dusty  plasma  with  dynamical  behaviour  of  the  dust  mass  and  charge 
fluctuations. 

3.  Dispersion  relation 

We  are  interested  in  the  low  frequency  response  of  the  plasma  i.e.  CD  «kvti,kvte.  The 
equilibrium  flow  of  the  dust  grains  is  non-zero  i.e.  vd0  ^  0.  Electrons  and  ions  are 
assumed  to  follow  the  Boltzmann  distribution; 

(  -~- 

\  e 

(13) 

where  ne0,«io  are  tne  equilibrium  electron  and  ion  number  densities  respectively. 
Considering  the  fluctuating  quantities  to  vary  as  exp(—  icot  +  ik-r),  (8-12)  can  be 
Fourier  analyzed  to  yield  the  desired  dispersion  relation  as 

1     /          iff  1     /         ifid 


^  =  0,  (14, 


Pramana  -  J.  Phys.,  Vol.  45,  No.  3,  September  1995  257 


JB  P  Pandey  and  C  B  Dwivedi 
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where  o>2d  =  -  -,  cf=—  ,  A*,  =  -  -  ^-^,  <5  =  -^,  t  =  -i,  co2A  = 


d     -  -,          —  ,       ,     -  -  -^,         -,        -,       A 

™d0  Wi  47™eO,iOe  «iO  re  do 

co  =  co  —  /cudo.  Except  for  the  4th  term  in  (14),  rest  of  the  terms  are  identical  to  the 
dispersion  relation  of  Bhatt  and  Pandey  [11].  The  4th  term  is  caused  by  the  mass 
fluctuation  dynamics  of  the  grains.  We  note  that  this  term  vanishes  when  the  dust  drift 
is  zero.  It  is  quite  natural  because  in  the  absence  of  the  finite  flow  velocity  of  the  dust 
grains,  the  mass  fluctuation  equation  becomes  redundant  and  it  does  not  get  coupled 
with  the  other  dynamic  variables  of  the  plasma  and  dust  fluids.  Defining  Cds  =  A2ffa)pd 
where  A~{?  =  A~2  +  Af2;  assuming  Te  «  T{  %  T,  ne0  w  ni0  «  n0  (a  situation  relevant  to 
the  planetary  rings  [14]),  eq.  (14)  in  the  limit  of  k2  A2ff  «  1  can  be  written  as 

(o>2  -  k2  C2J(d)  +  irj)  +  i£o>2(l  +  8)  =  afi  -  ips(cb  +  ir\\  (15) 

where 


4.  Results  and  discussion 

We  are  interested  in  a  physical  situation  where  mass  variation,  charging  and  collision 
frequency  satisfy  the  condition  //~ //,/?<  to.  Since  in  this  limit,  charge  and  mass 
dynamics  can  have  appreciable  effect  and  the  damping  effect  on  the  wave  is  not  very 
severe,  the  root  of  (15)  can  be  calculated  perturbatively.  Accordingly  the  solution  can  be 
given  as 

+  i&kCA  ~ik2C2( 


Cds/  '  ""'-'  /c2C 


ds 


If  one  assumes  that  £  >  fcCds(l  +  5)  i.e.  f  —  1  >  rj^2(l  +  <5),  an  instability  sets  in.  Thus 

\  ci  J 
we  see  that  the  threshold  condition  for  the  onset  of  the  beam  driven  instability  is 

modified  over  the  earlier  condition  vd0/ci  >  1  as  derived  by  Rosenberg  [14].  Obviously 
the  multiplicative  factor  r]1^2  in  our  threshold  condition  originates  from  the  dust  mass 
fluctuation  dynamics. 

(n    m    Y/2 
In  many  astrophysical  plasmas  r]1'2  =      d°    d°        »  1  owing  to  the  large  ratio  of 

V  nlomi0J 

the  dust  to  ion  mass.  Thus  the  mass  fluctuation  dynamics  of  the  dust  grains  has  a  major 
role  to  play  in  deciding  the  onset  condition  for  the  beam  driven  mode  instability  in 
dusty  plasmas.  As  we  will  see  in  the  following  discussions,  the  Mass  Fluctuation  Driven 
Streaming  Instability  (MFDSI)  may  have  favourable  conditions  in  many  of  the 
astrophysical  systems. 

(a)  Mars:  The  formation  of  the  dust  belt  around  the  mars  is  according  to  the  present 
belief  caused  by  the  material  ejected  from  its  moon  Phobos.  Assuming  that  the  grains 
initially  move  along  the  circular  Keplerian  orbit  with  ud0  %  yPhobos  ~  2-64  km/sec;  for 
typical  plasma  parameters  of  the  Mars,  nio  ~  0-1  cm" 3,  T  ~  0-1  eV,  m{  =  10~24  g  [15], 
one  finds  that  ud0/Ci~0-2.  Taking  «do  ~  1(T9  -  lCr10cm-3,  mdo  ~  103  -  107 a.m.u. 
[6],  one  finds  the  value  of  ^/2  ~  10~ 5  - 10~  6.  Thus  the  onset  condition  for  the  MIDIS 

is  Satisfied   flnH  its  PYict<=>r»Pf»  in   X/fartion  rino  io  T^ooiKIa 


(b)  Jovian  ring:  Quasi  periodic  (-28  days)  high  speed  (20-  56  km/sec)  streams  of 
submicron(0-02<fl<01/mi)sizedgrainsofthemass(l-6x  10"16  ^md0^  1-1  x  l(T14g) 
were  detected  by  'Vlysses'  mission  to  Jupiter  [4].  Taking  typical  ring  plasma  para- 
meters: «,0  ~  102  cm  3,  mj-10  22  g  for  a  tenuous  dust  cloud  nd0~  10  ~3  cm'3  [16], 
one  can  calculate  v^/c,  ~  3  and  ri^2  ~  1.  Obviously  the  instability  condition  holds' 
good  in  the  Jovian  ring  and  hence  the  excitation  of  MFDSI  may  not  be  ruled  out  in  the 
Jovian  atmosphere. 


(c)  Saturn  ring:  For  Saturn  ring  vdQ/c{^s~k  ~  Q(i)  [g]  for  different  rings. 

Here  R  is  the  distance  from  the  Saturn  to  ring  'and  Qs,Qfc  represent  rotational  and 
Keplerian  frequencies  respectively.  For  typical  plasma  parameters  ni0~10cm~3, 
T~  50  eV,  m;  ~  10  23g  for  the  example  for  O+  ions)  and  ndomd0  ~  1  g/'cm3  [8],  one 
gets  r\H2  ~  1011.  Very  clearly  the  instability  condition  is  not  satisfied  at  all  and  hence 
the  MFDSI  is  unlikely  in  Saturn  ring. 

(d)  Comets:  Havens  [18]  reports  that  the  streaming  instability  may  play  an  important 
role  in  interaction  between  cometary  dust  and  solar  wind  plasma.  Nappi  [19]  has 
also  pointed  out  the  interesting  consequences  for  a  streaming  instability  if  dust  is 
a  dominant  charge  carrier.  Considering  the  dust-plasma  parameters  inside  the 
ionopause  of  the  Halley's  comet  as  described  by  Mendis  and  Rosenberg  [4]  i.e. 
Hi0~103-104cm~3,  Te~  ri=  T~0-leV,nd0,10-3cm-3,fl~0-l-10/im,z;d0(ofthe 
order  of  the  solar  wind  speed)  ~  400  km/sec,  vdQ/c.t  ~  102.  Now  considering  the  dust 
mass  wd0~  10~16g,  the  value  of  ^/2~  10"  3.  Thus  it  is  concluded  that  the  cometary 
environment  may  be  susceptible  to  the  MFDSI. 

(e)  Interstellar  media:  Most  interesting  region  in  the  interstellar  media  is  the  star 
forming  zones  where  one  observes  the  molecular  clouds  [17].  The  effect  of  the  MHD 
shocks  in  such  clouds  have  been  studied  in  the  recent  past  [3].  Assuming  that  the  shock 
front  moves  with  typical  velocity  Vs  «  ud0;.for  the  typical  parameters  [3]  T  ~  10-20  K, 
m{  ~  10~22g,  one  gets  the  shock  speed  Vs  ~  10  km/sec  leading  thereby  vdQ/c{  ~  104.  For 
Znd0««i0,  one  finds  that  ?7^/2~l.  Thus  the  interstellar  media  especially  the  star 
forming  zones  may  become  important  locations  for  the  existence  of  the  MFDSI. 

It  is  worth  adding  a  few  more  comments  in  favour  of  the  aforesaid  instability.  The 
inclusion  of  the  mass  and  charge  fluctuation  dynamics  requires  that  the  time  scale  of  the 
wave  of  interest  (rw  =  2n/(a}  be  of  the  order  of  the  time  scales  of  the  mass  fluctuation 
(T  =  (a2wini0^l^i)~1)  and  charge  fluctuation  (tc  =  (na2wene0e)~i)  where  na2  is  the 
geometrical  cross-section  of  the  grains;  vv;  is  the  ion  thermal  velocity;  A,  ^  are  the 
atomic  mass  number  and  sticking  co-efficient  of  the  ions  respectively;  we  is  the  electron 
thermal  velocity.  These  requirements  restrict  the  scale  size  of  the  instability  and  is  found 
that  in  the  case  of  the  above  listed  astrophysical  systems  the  scale  sizes  are  of  the  order 
of  the  size  of  the  respective  systems.  Thus  it  is  conjectured  that  the  MFDSI  could  be 
a  possible  physical  mechanism  for  the  large  scale  structure  formation.  Furthermore,  it 
could  also  be  proposed  as  a  triggering  mechanism  for  the  onset  of  the  Jeans  collapse. 

In  the  light  of  the  above  discussions,  it  becomes  clear  that  the  MFDSI  has  a  major 
role  to  play  in  most  of  the  astrophysical  situations.  The  dust  mass  to  ion  mass  density 
ratio  decides  the  threshold  condition  for  the  onset  of  the  instability. 
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PARSHANT  BATHAM1,  S  B  ROY2,  SHAILENDRA  KUMAR2,  A  K  PRADHAN2, 
A  K  GROVER1,  P  CHADDAH2,  RAM  PRASAD3  and  N  C  SONI3 

Centre  of  Advanced  Study,  Deparatment  of  P-hysics,  Panjab  University,  Chandigarh  160014, 
India 

2  Low  Temperature  Physics  Group,  Centre  of  Advanced  Technology,  Indore  452013,  India 

3  Metallurgy  Division,  Bhabha  Atomic  Research  Centre,  Bombay  400  085,  India 

MS  received  22  June  1995 

Abstract  Results  are  presented  for  study  of  nonlinear  magnetization  of  a  sintered 
YBa2Cu3O7_j,  pellet  of  different  thickness  of  a  sample  subjected  to  very  low  magnetic  field.  On 
cooling  the  sample  below  Tc  in  zero  field  a  change,  in  the  oscillatory  structure  of  harmonics  in 
increasing  dc  field  is  observed  in  very  low  ac  magnetic  field.  The  effect  of  finiteness  of  the  sample 
on 'the  oscillatory  structure  and  on  the  hysteresis  of  harmonics  ;s  also  studied.  The  results  are 
explained  qualitatively. 

Keywords.    Nonlinear  magnetization;  critical  current  density;  demagnetization  factor. 
PACSNos    74-30;  74-60 


1.  Introduction 

Nonlinear  magnetization  response  of  hard  type  II  superconductors  (including  the  high 
temperature  superconductors)  is  a  well  studied  phenomenon  [1-7].  According  to  Bean's 
Critical  State  Model  [1]  the  origin  of  the  harmonics  is  hysteresis  in  magnetization  and 
the  nonlinear  relation  between  magnetization  and  applied  field  due  to  pinning  of  the 
flux  lines  in  hard  type  II  superconductors.  Bean's  model  assumes  that  a  hard  type  II 
superconductors  can  carry  a  limiting  macroscopic  superconducting  current  density 
J  =  JC  and  further,  that  any  electromotive  force,  however  small,  will  induce  the 
macroscopic  superconducting  current  of  density  Jc  to  flow  locally.  Given  the  distribu- 
tion of  current  that  are  set  up  and  the  applied  field,  one  can  calculate  the  flux  density 
B(r)  at  various  point  in  the  sample.  The  average  magnetization  induced  by  an  applied 
field  can  be  calculated  by  averaging  the  local  flux  density.  If  we  apply  an  ac  magnetic 
field  [#(£>=  Haccos  wt],  we  have  M(wt)  —  —  M(wt  +  n)  i.e.  magnetization  which  has 
time  reversal  symmetry.  This  is  shown  schematically  in  figure  la  and  one  observes  only 
the  odd  harmonics  in  accordance  with  Bean's  critical  state  model  [1].  In  the  limit  that 
critical  current  density  is  independent  of  applied  field,  this  time  reversal  symmetry  is 
maintained  even  when  a  static  magnetic  field  is  superimposed  as  depicted  in  figure  Ib. 
The  field  dependence  of  critical  current  density  leads  to  generation  of  even  harmonics, 
when  we  superimpose  the  static  magnetic  field  i.e.  when  H(i)  =  HAc '+  Haccos  wt.  It  also 
breaks  the  time  reversal  symmetry  of  the  magnetization  loop  as  indicated  in  figures  Ic- 
le  [8].  Ji  et  al  [2]  and  Navarro  et  al  [3]  reported  observation  of  the  even  harmonics 
when  the  dc  field  was  superimposed  on  time  varying  magnetic  field  in  high  temperature 
superconductors.  The  amplitude  of  harmonics  was  calculated  as  a  function  of  external 
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Figure  1 .  (a)  ac  magnetization  as  function  of  magnetic  field;  (b)  ac  magnetic  field  as 
function  of  magnetic  field  in  presence  of  dc  field  where  the  current  density  does  not 
depend  upon  magnetic  field;  (c)-(e)  ac  magnetization  as  function  of  field  in  increas- 
ing order  of  dc  magnetic  field  where  the  current  density  depends  upon  magnetic 
field. 


dc  field  and  it  Was  found  that  [4]  magnitude  of  nth  harmonics  should  show  oscillatory 
structure  with  (n  —  1)  minima  when  the  superimposed  H      field  varies  between  +  H 


and  -  Hac.  The  history  effect  of  the  static  magnetic  field  [6,7]  on  the  magnitude  of 
harmonics  in  the  case  of  ceramic  samples  of  high  temperature  superconductors  is 
a  much  more  complicated  phenomenon.  The  history  effect  is  explained  using  the  two 
component  behaviour  of  ceramics  ascribed  to  grains  and  intergrain  boundaries. 
Further,  the  observation  of  a  sharp  structure  of  second  harmonic  in  remnant  state  [9] 
can  be  explained  in  terms  of  flux  trapping  in  grain  and  using  this  memory  effect  of 
magnetization  in  second  harmonics,  measurements  can  be  made  to  estimate  the  field 
for  first  flux  penetration  into  grains,  viz.  Hclg. 

In  the  present  investigation  of  harmonics  generation,  we  have  focused  attention 
onto  an  aspect  no  specially  studied  in  any  detail  so  far,  namely  the  effect  of  the 
sample  shape  (i.e.  demagnetization  factor)  on  the  generation  of  harmonics  in  a 
given  ceramic  preparation  of  a  compound.  Magnetization  studies  in  ceramic 
sample  are  routinely  used  to  elicit  informations  of  field  dependence  of  intergrain 
and  intragrain  critical  current  density.  Our  present  studies  demonstrate  that 
for  the  quantitative  information  on  the  current  densities,  the  effect  of  demagnetiza- 
tion factor  is  necessary.  We  report  the  2nd  and  3rd  harmonic  as  a  function  of  static 
magnetic  field  in  zero  field  cooled  state  of  YBa2Cu3O7_y  sintered  pellet  sample 
with  different  demagnetization  factors  with  very  low  ac  driving  field  at  a  temperature 
of  77  K.  These  samples  with  different  demagnetization  factors  were  obtained  by 
progressive  thinning  of  the  same  pellet. 
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2.  Experimental 

The  results  reported  here  are  on  a  sintered  pellet  of  YBa2Cu3O7_y  which  were 
prepared  by  standard  solid  state  sintering  process.  Y2O3,  BaCO3  and  CuO  in 
appropriate  proportion  were  calcinated  in  air  at  900  °C  for  15hr,  interspersed  with 
grinding  and  mixing.  The  powder  was  compacted  (at  lOtons/sq  in)  and  sintered  in 
oxygen  at  940  °C  for  15  h  and  was  held  at  400  °C  during  cooling.  No  second  phase  was 
detected  in  X-ray  diffraction  and  oxygen  was  estimated  by  iodometry  as  6-95.  The 
sample  has  a  resistive  Tc  of  90  K  and  this  was  confirmed  by  magnetization  data  as  well. 
For  measuring  the  ac  magnetization  a  low  field  hysteresis  loop  tracer  [10]  with  driving 
frequency  of  3  17  Hz  and  driving  field  #ac  1  to  2  Gauss  was  used.  The  field  is  applied 
perpendicular  to  the  plane  of  the  pellet  to  study  the  effect  of  the  demagnetization  factor 
of  the  pellet.  A  dc  biasing  field  upto  45  G  can  be  provided  by  a  split  pair  of  Helmholtz 
coils,  coaxial  to  the  coil  of  hysteresis  loop  tracer. 

An  AD3522  FET  analyzer  was  used  to  measure  the  amplitude  of  harmonics  in  the 
AC  magnetization.  The  possibility  of  spurious  contribution  at  secondary  coil  (in 
absence  of  sample)  from  the  driving  frequency  was  checked.  No  contribution  from 
higher  harmonics  signal  was  observed  in  the  absence  of  the  sample,  or  with  the  sample 
at  T>  Tc.  The  noise  level  was  about  15  mV.  For  calculating  the  demagnetization 
factor  we-  measure  the  ac  magnetization  at  a  low  ac  field  value  (1  G)  such  that  no 
hysteresis  was  observable.  With  two  different  orientation  that  is  field  parallel  to  plane 
of  pellet  (of  volume  V)  and  field  perpendicular  to  the  plane  of  the  pellet,  we  obtain 
values  MH  and  Mx  respectively  [11]. 


]VII);    MJV=-H/(1-N±)  (1) 

We  also  not  that  the  trace  of  demagnetization  factor  is  one  i.e. 

(2) 


where  JV.j_  and  NB  are  the  demagnetization  factors  when  applied  field  is  perpendicular 
and  parallel  to  the  plane  of  the  pellet.  Using  (1)  and  (2)  and  the  observed  MB  and  Mj_  ,  we 
find  that  demagnetization  factors  are  Nl  =  0-745,  N2  =  0-792  and  N3  =  0-842  for  three 
different  pellets  of  thickness  are  t1  —  1-61  mm,  t2  =  1-41  mm,  and  t3  =  0-92  mm,  respec- 
tively, where  the  magnetic  field  is  perpendicular  to  the  plane  of  sample.  Each  of  these 
sample  had  a  diameter  of  8-08  mm.  We  are  interested  in  generation  of  harmonics  in  zero 
field  cooled  (ZFC)  state.  The  sample  is  cooled  to  77  K  in  absence  of  any  dc  bias  (Hdc) 
and  then  the  Hdc  is  switched  on.  Measurement  are  made  for  various  H  dc  field  in  small 
value  of  constant  Hac  field. 

3.  Results  and  discussion 

3.1.  Structural  change  in  oscillatory  behaviour  of  harmonics 

In  figure  2,  we  present  the  magnitude  of  third  harmonic  signal  (3f  )  as  a  function  of  Hdc 
with  #ac  fixed  at  1,  1-5,  2  and  3  Gauss  for  sintered  pellet  YBa2Cu3O7_y  with 
demagnetization  factors  Nt  =  0-74.  Figures  3  and  4  show  the  same  data  for  sample  with 
demagnetization  factors  N2  =  0-792  and  N3  =  0-842  respectively.  In  figure  2,  the 
magnitude  of  third  harmonic  shows  a  minimum  as  Hdc  varies  from  zero  to  maximum 
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Figure  2.    Third  harmonic  amplitude  (3f)  as  a  function  of  dc  magnetic  field,  the  ac 
field  being  fixed  at  1, 1-5,  2  and  3  G  for  demagnetization  factor  A^  =  0-745. 


value  (5G)  at  driving  field  #ac  =  3G.  But,  at  driving  fields  //ac=l-5  and  1G, 
the  3f  signal  increases  monotonically  on  varying  the  dc  bias  from  0  to  maximum 
value  5G  which  is  quite  unusual.  The  observation  of  one  minimum  in  3f  signal 
for  H  =  3G  is  consistent  with  theory  given  by  Ji  etal  [2]  in  which  they  have 
calculated  that  the  magnitude  of  nth  harmonic  should  show  oscillatory  behaviour  with 
(n-  1)  minima  on  varying  the  Hdc  from  ~HAC  to  ffac.  This  discussion  is,  however, 
valid  only  for  Hac  >  H*,  where  H*  is  the  field  for  full  penetration  of  the  sample.  In  our 
observation  of  the  M-H  loops  on  this  sample,  the  field  for  full  penetration  is 
about  2-0  G.  The  change  in  the  oscillatory  behaviour  for  demagnetization  factor 
Nl=Q-74  shown  in  figure 2  appears  to  be  at  driving  field  at  Hac  =  2G  =  H*. 
For  driving  field  Hac  <  H*,  we  observe  a  behaviour  different  from  that  predicted 
in  ref.  [2]  in  that  3f  signal  has  only  one  minimum  and  that  is  at  Hdc  =  0.  This  is 
also  seen  from  the  calculation  of  ref.  [4]  when  Hac«#*.  The  change  in  the 
oscillatory  structure  in  3f  signal  for  demagnetization  factors  Af2  =  0-79  and 
N3  =  0-84  is  measured  to  be  at  field  values  H  =  1-5  and  1  G,  as  shown  in  figures  3  and 
4  respectively.  These  results  are  new  and  call  for  further  refinement  in  the  existing 
theoretical  models. 


3.2  Effect  of  geometry  on  the  harmonics 

From  figures  2  to  4,  the  change  in  oscillatory  behaviour  of  3f  vs  Hdc  shifts  to  lower  field 
values  on  increasing  the  demagnetization  factor.  This  confirms  the  fact  that  on 
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Figure  3.    Third  harmonic  amplitude  (3f )  as  a  function  of  dc  magnetic  field,  the  ac 
field  being  fixed  at  1, 1-5,  2  and  3  G  for  demagnetization  factor  N2  =  0-792. 
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Figure  4.    Third  harmonic  amplitude  (3f)  as  a  function  of  dc  magnetic  field,  the  ac 
field  being  fixed  at  1, 1-5,  and  2  G  for  demagnetization  factor  N3  =  0-89. 
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as  function  of  Hdc  raised  to  39  G  and  lowered  to  zero  value  in  H_  =  4  < 
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Figure  7.  Unique  functional  dependence  of  amplitude  of  second  harmonic 
a  function  of  scaled  field  (#scl).  (•)  curve  denote  scaling  of  applied  fi 
demagnetization  factor  JVt  =0-745  and  (•)  curve  signifies  the  scaling  of  fi 
demagnetization  factor  N2  =  0-792  for  the  unidirectional  increase  of  the  ; 
field. 


increasing  the  demagnetization  factor,  the  effective  field  experienced  b 
sample  increases.  The  generation  of  even  harmonics  is  the  consequences 
field  dependence  of  the  critical  current. density  [2-6].  The  irreversible  beha 
of  the  nonlinear  response  of  the  YBa2Cu3O.7_J,  is  attributed  to  the  two 
ponent  network  of  sample  i.e.  integrain  and  intragrain  region.  In  figures 
6,  we  observe  that  2f  signal  shows  a  history  effect  with  the  applied  ma 
field  due  to  its  ceramic  nature  [5,6]  for  demagnetization  factors  N2 
and  N3  =  0-79.  We  have  used  a  large  ac  field  Hac  =  4  G  and  we  observe  a 
ficant  difference  in  the  amplitude  and  in  the  field  dependence  of  2f  for  di 
demagnetization  factors. 

However,  scaling  the  applied  field  with  the  demagnetization  factor  Hs 
{1  — JV(1  —  Vf)}  [12],  where  N  is  demagnetization  factor  and  Vf  is  el 
volume  of  sample  into  which  flux  is  able  to  penetrate.  This  scaling  be 
useful  once  the  flux  starts  penetrating  the  intergrain  region  and  our  eq. 
longer  holds.  We  observe  in  figure .7  2f  signal  show  a  unique  functional  depe: 
on  scaled  field  for  unidirectional  increase  in  applied  field  and  in  figun 
unidirectional  decrease  in  applied  field,  where  following  ref.  [12],  Vf  is 
to  be  0-4.  Therefore,  for  quantitative  measurement  of  critical  current  < 
from  magnetization  studies,  demagnetization  correction  is  very  important.  Our 
show  that  when  this  is  taken  into  account,  a  unique  dependence  on  applied 
obtained. 
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Figure  8.  Unique  functional  dependence  of  amplitude  of  second  harmoi 
a  function  of  scaled  field  (#scl).  (•)  curve  denote  scaling  of  applied 
demagnetization  factor  A^  =  0-745  and  (•)  curve  signifies  the  scaling  o 
demagnetization  factor  N2  =  0-792  for  the  unidirectional  decrease  of  th 
field. 


4.  Conclusion 

In  conclusion,  we  observed  that  the  3f  shows  consistent  behaviour  as  theories  p 
ac  field  /fac  >  //*.  However  at  field  smaller  than  //*,  3f  increases  monotonicall 
field  at  which  the  shift  in  structure  observed  also  decreases  on  thinning  th< 
progressively  i.e.  on  increasing  the  demagnetization  factor.  We  measured  th 
cant  difference  in  the  second  harmonics  at  different  demagnetization  factc 
sample.  But,  scaling  the  applied  field  with  demagnetization  factor  and  the  tola 
experiencing  the  field,  the  2f  shows  a  unique  field  dependence. 
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Abstract.  Two  photon  absorption  spectrum  of  p-dichlorobenzene  has  been  studied  in  the 
spectral  region  497-552  nm  (one  photon  wavelength).  Bands  involving  several  frequencies  of  the 
excited  electronic  state  not  detected  in  the  one  photon  absorption  are  seen.  The  observed  two 
photon  spectrum  indicates  that  the  electronic  transition  retains,  to  a  large  extent,  the  salient 
features  of  the  two  photon  absorption  spectrum  of  benzene  and  thus  confirms  chlorine  as  weak 
perturber  (in  the  sense  of  Goodman  and  Rava  [1]). 

Keywords.  Two  photon  absorption  spectrum;  p-dichlorobenzene;  weak  perturber;  molecular 
spectra. 

PACSNo.     30-10 

1.  Introduction 

Two  photon  absorption  studies  enable  the  observation  of  several  features  not  accessi- 
ble to  direct  observation  in  the  usual  one  photon  absorption  spectrum  [1].  Substituted 
benzenes,  wherein  one  or  more  hydrogen  atoms  are  replaced  by  other  atoms  or  group 
of  atoms,  form  a  related  class  of  molecules  whose  systematic  study  has  led  to  a  better 
understanding-of  many  aspects  of  molecular  spectra  and  hence  such  molecules  have 
continued  to  draw  the  attention  of  molecular  spectroscopist.  Recently,  we  have 
reported  an  analysis  of  a  part  of  the  two  photon  electronic  absorption  spectrum  of 
odichlorobenzene  (ODCB)  in  the  wavelength  region  485-540  nm  (one  photon 
wavelength).  In  the  present  communication  we  report  the  results  obtained  in  a  similar 
study  on  p-dichlorobenzene  (PDCB). 

Earlier  one  photon  absorption  studies  on  PDCB  have  been  carried  out  by  Wollman 
[2]  and  by  Conrad-Billroth  [3]  in  the  liquid  phase  and  by  Sponer  [4],  Joshi  [5]  and 
Anno  and  Matubara  [6]  in  the  vapour  phase  at  low  resolution.  Rai  [7]  reported  the 
one  photon  absorption  spectrum  of  PDCB  at  a  moderately  high  resolution  which 
enabled  him  to  identify  several  hitherto  unobserved  vibrational  features.  Even  though 
two  photon  electronic  absorption  corresponding  1B2u «- 1  Alg  transition  of  benzene  has 
been  investigated  for  several  mono-  and  di-substituted  benzenes  (Goodman  and  his 
colleagues  [1]),  no  work  on  dichlorobenzene  has  been  reported  prior  to  present  study. 

2.  Experimental 

The  experimental  set  up  used  for  recording  the  two  photon  spectrum  of  p-dichloroben- 
zene is  essentially  the  same  as  described  in  our  earlier  paper  [8].  In  brief  it  consists  of 
a  pyrex  cell  4  cm  in  diameter  and  20  cm  in  length.  A  small  bulb  is  attached  in  the  middle 
of  the  tube  to  hold  sample  while  the  ends  are  closed  with  flat  glass  windows.  An  outlet  is 

271 


/  B  Singh  el  al 

also  provided  to  connect  a  vacuum  pump  to  create  a  proper  vacuum  in  the  cell.  The 
electrodes  are  circular  discs  of  stainless  steel  1  cm  in  diameter  and  are  mounted  in  the 
cell  facing  each  other  at  a  mutual  separation  of  1  cm.  The  electrodes  are  at  a  distance 
5cm  from  one  of  the  windows.  The  experimental  set  up  used  is  shown  in  figure  1. 

A  freshly  purified  sample  of  p-dichlorobenzene  (reported  purity  99-9%)  is  kept  in  the 
bulb  of  the  ceil  which  is  dipped  into  a  container  of  liquid  ^nitrogen  while  the  cell  was 
evacuated  to  a  residual  gas  pressure  of  approximately  10~2torr.  The  liquid  nitrogen 
container  is  then  removed  and  the  p-dichlorobenzene  vapour  fills  the  cell  at  a  pressure 
appropriate  to  the  ambient  temperature.  A  DC  voltage  of  11 0-1 50V  was  applied 
across  the  electrodes  from  a  stabilized  power  supply  (Bertan  model  No.  205A-O1R). 
The  laser  radiation  from  a  dye  laser  (Quanta  Ray  model  PDL-2  with  coumarine  500 
dye)  pumped  by  the  third  harmonic  of  a  Nd:  YAG  laser  (Quanta  Ray  model  DCR-2) 
was  focused  at  a  spot  between  the  two  electrodes  with  the  help  of  a  5  cm  focal  length 
convergent  lens.  The  repetition  rate  of  the  laser  was  10  Hz  and  pulse  width  of  the  dye 
laser  was  ~  7  ns.  The  dye  laser  peak  output  power  at  510  nm  was  lOmJ/pulse. 

Under  conditions  of  tight  focusing,  the  photon  density  in  the  region  becomes  large 
enough  to  cause  multiphoton  absorption.  Since  the  wavelength  of  the  dye  laser  is 
appropriate  for  two  photon  absorption  to  the  lowest  lying  singlet  excited  state  of 
p-dichlorobenzene  a  substantial  fraction  of  these  molecules  are  excited  to  the  excited 
state.  Such  excited  molecules  can  be  ionized  by  the  absorption  of  an  additional  photon 
resulting  in  the  creation  of  electron  ion  pairs.  These  ions  and  electrons  are  collected  at 
the  respective  electrodes  and  give  rise  to  an  ionization  signal.  The  A.C.  signal  thus 
generated  was  filtered  from  D.C.  by  using  a  capacitance  resistance  combination.  The 
signal  thus  obtained  is  processed  through  a  boxcar  averager  (EG  and  G  model  164) 
with  a  gate  width  of  5  ns.  The  bias  voltage,  laser  power  and  the  focusing  of  the  lens  were 
adjusted  so  that  the  signal  for  one  of  the  absorption  peaks  (near  the  middle  of  the  tuning 
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Figure  1.    Experimental  set  up  used  for  recording -the  two  photon  absorption 
spectrum  of  p-dichlorobenzene. 


range)  was  optimum.  The  resulting  absorption  specuum  was  men  recorded  wim 
a  double  pen  recorder.  The  resolution  achieved  in  the  experiment  is  1  -2  cm " l.  A  small 
part  of  the  dye  laser  beam  was  sent  into  a  Fe-Ne  hollow  cathode  lamp  and  the 
optogalvanic  signal  of  Ne  was  used  to  provide  standard  lines  for  wavelength  calibra- 
tion of  the  spectrum.  A  part  of  the  spectrum  along  with  noise  level  is  shown  in  figure  2. 
The  measured  wavelengths,  the  frequencies  and  the  relative  intensities  of  the  various 
bands  are  listed  in  table  1.  We  also  monitored  the  signal  by  varying  the  laser  power.  It  is 
marked  that  the  signal  is  varying  linearly  with  the  square  of  the  laser  power.  This  clearly 
indicates  that  the  signal  is  due  to  simultaneous  absorption  of  two  photons. 

3.  Results. 

The  reduced  point  group  symmetry  D2h  of  PDCB  results  in  the  transition  analogous  to 
the  1B2u'*-  JAlg  transition  of  benzene  becoming  an  allowed  transition.  This  results  in 
the  appearance  of  a  strong  (O-O)  band  along  with  intense  bands  involving  the  totally 
symmetric  vibrations,  namely  1,  2,  6a,  8a  and  9a  (in  Wilson  notation).  The  one  photon 
absorption  spectrum  of  PDCB  vapour  analyzed  by  Anno  and  Matubara  [6],  Joshi  [5] 
and  Rai  [7]  clearly  demonstrates  the  correctness  of  this  interpretation.  The  origin  band 
as  identified  by  these  authors  is  at  35742  cm  ~  l.  Since  the  longest  wavelength  available 
with  the  present  dye  is  552-0  nm,  we  could  not  observe  bands  with  energy  less  than 
36240cm"1  and  we  have  taken  the  value  reported  by  Rai  [7]  for  the  position  of  the 
(O-O)  band. 


TWO    PHOTON 
ABSORPTION    SPECTRUM     OF    p-OICHLOROBENZENE 
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Figure  2a.     A  part  of  the  two  photon  absorption  spectrum  of  />-dichloroben/ene. 
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Table  1.    Prominent  two  photon  bands  in  p-dichlorobenzene  and  their  assignment. 


Wavelength  of 
the  bands 

(A) 

2v  in  cm  i 

Av 
in  cm"1 

Intensity 

Assignment 

5512-6 

36280 

538 

W 

6ao 

5499-6 

36366 

624 

W 

17aJ 

5482-7 

36478 

736 

VS 

7aJ 

5469-5 

36566 

824 

MW 

17aJ17bX 

5440-5 

36761 

1019 

MS 

12ao 

5436-9 

36785 

1043 

MS 

15J 

5436-2 

36790 

1048 

VS 

1* 

5433-0 

36803 

1061 

S 

9aJ 

5432-7 

36814 

1072 

VS 

18ao 

5426-2 

36858 

1116 

MS 

6ao17ao 

5414-7 

36936 

1194 

M 

7ao16bo 

5409-7 

36970 

1228 

S 

3o 

5401-9 

37024 

1282 

W 

7ao6bo 

5399-4 

37082 

1340 

W 

iy  o 

5389-8 

37107 

1365 

W 

19a£ 

5381-4 

37165 

1423 

VS 

8ao 

5358-3 

37325 

1583 

VS 

14o 

5309-8 

37666 

1924 

W 

14o18bo 

5254-2 

38064 

2322 

W 

14o7ao 

5224-8 

38278 

2536 

M 

14jl8bJ17aJ 

5174-7 

38651 

2909 

W 

9a27a*18b{ 

5141-5 

38898 

3156 

S 

14J1X 

5128-1 

39000 

3258 

M 

14;.19aJ18b} 

5115-7 

39095 

3353 

M 

14jlj7aJ 

5097-1 

39237 

3495 

MS 

14j7a;i6bJ 

5081-9 

39355 

3613 

W 

14jlJ10b} 

5078-0 

39385 

3643 

W 

14jlj6aJ 

5066-2 

39477 

3735 

S 

14j9a5l8b^ 

5052-1 

39588 

3846 

M 

7a219bj6a2 

5048-7 

39614 

3872 

S 

14jl8aj3j 

5038-2 

39696 

3954 

MS 

1419a217b210b1 

5016-5 

39868 

4127 

MS 

14l9a218bi 

5005-2 

39958 

4216 

M 

141l217ail8b2 

4990-0 

40079 

4337 

MW 

14^9a26a^ 

4977-5 

40180 

4438 

MW 

14;9a;7aJ10b} 
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It  is  well-known  that  in  the  corresponding  transition  in  benzene  vapour  the  longest 
wavelength  band  observed  at  the  lowest  temperature  is  the  one  involving  the  excitation 
of  one  quantum  of  a  C-C-C  in-plane  bending  vibration  of  e2g  symmetry.  The  ground 
state  magnitude  of  this  frequency  is  608  cm  ~ 1  (mode  6a  in  Wilson  notation)  while  the 
excited  (1B2u)  state  magnitude  is  538  cm~ l.  This  vibration  is  not  expected  to  change 
drastically  when  some  of  the  hydrogen  atoms  in  benzene  are  replaced  by  chlorine 
atoms  as  in  PDCB.  Moreover,  though  the  transition  is  now  electronically  allowed  the 
charge  cloud  of  the  molecule  does  retain  to  a  great  extent  the  same  features  as  in 
benzene  so  that  the  excitation  of  a  single  quantum  of  this  vibration  could  be  expected  to 
give  rise  to  a  false  origin  and  band  structure  loaded  on  to  it.  This  is  actually  observed. 

The  weak  band  at  a  separation  of  538  cm  ~ l  (on  the  shorter  wavelength  side)  from  the 
(O-O)  band  is  ascribed  to  the  excitation  6a^.  This  mode  is  the  Raman  active 
component  of  the  above  mentioned  e2g  vibration  of  benzene.  The  other  component  of 
this  doubly  degenerate  vibration  (in  benzene)  is  seen  as  another  weak  band  at 
a  separation  of  624  cm"  *  from  the  (O-O)  band.  This  is  ascribed  to  the  17a  mode  and  is 
infrared  active. 

A  relatively  intense  band  is  seen  at  0  +  736  cm" l.  This  frequency  is  ascribed  to  the  7a 
mode  involving  the  C-C1  stretching.  The  corresponding  frequency  (748  cm"  ^  in  the 
ground  state  of  the  molecule  is  seen  prominently  in  Raman  spectrum. 

A  group  of  five  bands  is  seen  at  36761,  36785,  36790,  36803  and  36814cm"1  with 
separation  from  the  O-O  band  of  1019, 1043, 1048, 1061  and  1072  cm~  \  respectively. 
The  bands  are  assigned  as  0  +  12J,  0  +  15J,  0  +  I1,  0  +  9a£  and  0  -I-  18aJ.  Of  these  the 
bands  at  a  separation  of  1048 cm"1  is  the  most  prominent  and  is  ascribed  to  the 
excitation  of  the  totally  symmetric  ring  breathing  frequency  (some  of  these  bands  may 
also  involve  sequence  bands  buried  under  the  succeeding  bands). 

The  band  observed  at  37325  cm~ 1  has  the  largest  intensity  in  the  whole  spectrum.  Its 
separation  from  the  O-O  band  is  1 583  cm  ~ : .  This  frequency  is  one  of  the  characteristic 
features  in  the  two  photon  spectrum  of  benzene  derivatives  and  is  assigned  as  14^.  It  is 
due  to  C-C  stretching  vibration.  This  frequency  being  the  most  active  is  involved  in 
a  large  number  of  bands,  in  combination  with  several  other  frequencies,  which 
dominate  the  whole  structure.  Some  of  the  strong  combination  bands  where  this 
v14  fundamental  is  involved  are  141 Ij6aj  at  38898  cm'1,  14j9aJ 18b£  at  39477 cm -1 
14J18J3J  at  396 14 cm'1  etc.  Some  other  fundamental  vibrations  which  appear  with 
slightly  lower  intensity  have  the  magnitude  1228 cm'1,  1340 cm'1,  1365cm'1  and 
1427  cm~  \  These  bands  are  assigned  to  3  J  (of  b3g  species),  19a£  (of  b2u  species),  19bJ  (of 


Table  2.    Intensity  of  different  vibrations  with  respect  to 
v14  in  o-  and  p-dichlorobenzenes. 

Vibration  ODCB  Molecule   PDGB  Molecule 


7a 

1/6 

1/2-2 

18a 

1/2-5 

1/2-2 

1 

1/2-5 

1/2-2 

12 

1/6 

1/4 
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Figure  3.    A  part  of  the  one  photon  absorption  spectrum  of  p-dichlorobenzene 


blu  species)  and  Sa^  (of  ag  species)  modes  respectively.  Sequence  bands  are  seen  to 
accompany  several  of  the  main  bands  with  separation  —  26,  —31  and  —86cm"1, 
Table  1  gives  the  complete  assignment  of  the  observed  bands.  Some  of  the  bands 
appear  prominently  in  the  two  photon  spectra  of  o-  as  well  as  in  p-dichlorobenzene.  The 
intensities  of  these  bands  with  respect  to  v14  in  the  two  molecules  are  compared  in 
table  2. 

We  have  also  compared  our  two  photon  spectrum  with  one  photon  spectrum 
reported  by  Rai  [7].  A  part  of  the  one  photon  spectrum  is  also  given  in  figure  3.  Some  of 
the  prominent  bands  observed  in  our  two  photon  spectrum  (viz.  OQ  +  1061cm"1; 
O£  +  1048 cm~  \  O°Q  +  736 cm~  \  Og  +  538 cm~ 1  etc)  are  clearly  seen  in  one  photon 
spectrum  also. 
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Abstract.  The  time-resolved  laser  emission  of  coumarin  102  was  investigated  in  various  aprotic 
and  protic  solvents  at  picosecond  resolution  by  frequency  upconversion  technique.  The  spectral 
shift  of  the  transient  emission  spectrum  is  attributed  to  solvent  reorientation  and  the  results  are 
discussed. 
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1.  Introduction 

Following  the  original  experiment  of  Halliday  and  Topp  [1],  the  frequency  conversion 
technique  has  been  applied  by  a  number  of  workers  to  the  investigation  of  charge 
transfer  processes,  energy  transfer  processes,  isomerization  reactions,  vibrational 
relaxations  and  solvent  relaxations  [2].  As  demonstrated  by  Moore  et  al  [3]  the  use  of 
multichannel  detection  with  upconversion  technique  can  provide  more  spectral  infor- 
mation which  cannot  be  obtained  by  other  standard  techniques  such  as  streak  camera 
and  time  correlated  single  photon-counting.  This  technique  is  attractive  because  the 
time-resolution  achievable  by  this  method  is  determined  solely  by  the  duration  of  the 
excitation  pulse  and  not  by  the  sample  lifetimes  and  also  the  upconverted  emission  is 
free  from  background  noise.  As  a  part  of  our  programme  on  picosecond  processes  in 
aminocoumarins,  we  present  the  preliminary  investigation  of  solvation  dynamics  in 
protic  and  aprotic  solvents  using  C  102  as  a  probe  molecule  and  the  frequency 
upconversion  technique  developed  in  our  laboratory.  Although  the  experimental  and 
theoretical  studies  in  solvation  dynamics  in  solution  has  undergone  a  dramatic 
revolution  in  the  past  decade,  yet  the  situation  is  not  very  clear  as  witnessed  by 
discrepancies  between  theoretical  and  experimental  results  [4, 5]. 

When  a  molecule  is  excited  to  higher  electronic  state,  several  processes  compete 
to  deactivate  the  excited  state  during  its  lifetime.  These  processes  can  arise  from 
either  Franck-Condon  state  or  relaxed  excited  state.  The  evolution  of  the  excited 
state  is  controlled  by  the  relative  rates  of  different  deactivation  processes.  The  main 
processes  which  usually  compete  with  the  radiative  deactivation  of  Franck-Condon 
excited  state  are  vibrational  relaxation,  solvent  relaxation,  formation  of  charge 
transfer  complexes,  isomerization,  energy  transfer  between  like  molecules  and  inter- 


vibrational  relaxation  in  Sx  state  to  be  a  very  fast  process  which  escapes  our  instru- 
mental resolution.  For  coumarins,  the  ISC  yields  by  triplet  counting  technique 
have  been  reported  to  be  less  than  0-05  for  ethanol  [6]  at  room  temperature.  The 
rate  constant  for  ISC  calculated  using  i(Si  )  =  4-36  ns  is  so  low  that  the  process 
takes  place  on  a  time  scale  much  longer  than  the  time  delay.  Intermolecular  energy 
transfer  processes  between  like  molecules  via  Forster  mechanism  are  relatively 
fast  at  high  concentrations  and  slow  at  normal  concentrations.  In  the  present 
system  this  process  was  minimized  by  keeping  the  concentration  of  the  solute  at 
1  x  10~3moldm~3.  Our  probe  molecule  is  a  rigid  one  and  structural  considerations 
show  that  isomerization  is  quite  unlikely.  Although,  discrete  charge  transfer  complex 
via  hydrogen  transfer  has  been  reported  for  C 102  in  water  [7],  protonation  of  C  102 
is  not  possible  in  protic  and  aprotic  solvents.  Our  previous  studies  [8]  have  shown 
that  dipole  moment  of  C  102  changes  from  5-016  D  to  6-849  D  upon  excitation  from  S0 
to  $1  state.  Also,  our  luminescence  studies  of  C 102  as  function  of  temperature  reveals 
that  the  fluorescence  of  C  102  at  room  temperature  (293  K)  occurs  at  longer  wavelength 
(Amax  =  468  nm)  than  the  one  obtained  at  77  K  (Amax  =  456  nm).  There  is  no  broadening 
of  the  emission  at  intermediate  temperatures  and  no  spectral  shift  is  observed  below 
77  K  (figure  1).  Such  characteristics  of  dye  solution  are  generally  the  reflections  of 
solvent-relaxation  [9]. 

Immediately  after  laser  excitation,  the  probe  molecule  will  be  in  a  'Franck-Condon' 
(FC)  excited  state  (a  transitory  excited  state),  possessing  an  altered  electron  distribution 
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Figure  1.    Normalized  fluorescence  spectra  of  C 102  (1  x  10~3mol  dm~ 3)  at  77  K 
and  293  K. 
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but  the  ground  state  geometry.  The  excited  molecule  is  initially  surrounded  by 
equilibrium  ground  state  solvent  cage.  During  the  lifetime  of  the  excited  state  (Sj ),  the 
solvent  cage  adopts  orientation  consistent  with  the  new  excited  state  charge  distribu- 
tion of  the  probe  molecule.  As  the  solvent  orientation  changes  continuously  with  time, 
the  energy  of  the  solute-solvent  configuration  changes  correspondingly  leading  to  the 
time-dependent  spectral  shift  in  the  emission  peak  of  the  probe  molecule.  The  solvation 
dynamics  of  C  102  in  non-associating  solvents  such  as  acetonitrile  has  been  inves- 
tigated earlier  by  Kahlow  etal  [10]  from  sub-picosecond  fluorescence  spectroscopy 
and  the  measured  solvent-relaxation  time  was  reported  to  be  0-9  ps.  However,  the  time 
scale  of  solvation  process  at  room  temperature  is  expected  to  be  in  the  picosecond 
domain  for  the  associating  solvents  such  as  alcohols.  In  the  present  work,  we  focus  our 
attention  on  the  measurements  of  time-dependent  superradiant  laser  emission  on 
picosecond  time  scale  in  both  polar  aprotic  and  protic  (alcohols)  solvents  to  evaluate 
solvent-relaxation  and  compare  its  time  scale  relative  to  orientational  relaxation  time 
of  solute  molecule  determined  previously  [8]  in  the  ps  time  domain.  The  superradiant 
laser  emission  is  nothing  but  amplified  spontaneous  emission  (ASE)  which  shortens  the 
Si  state  lifetime  from  tf  to  TASE.  According  to  the  relation  derived  by  Penzkofer  and 
Blau  [11]  for  the  ratio  of  these  two  lifetimes,  for  a  given  dye,  the  lifetime  of  St  in  the 
presence  of  ASE  can  be  altered  by  suitably  adjusting  the  solid  angle  of  effective  ASE  and 
the  position  of  maximum  gain.  Therefore,  when  the  conversion  efficiency  of  the  mixing 
crystal  is  very  poor  to  upconvert  the  fluorescence,  one  can  use  the  superradiant 
emission  as  a  probe  by  adjusting  the  excitation  conditions  properly.  This  study  is  also 
important  in  view  of  the  current  interest  [12]  on  the  influence  of  rotational  motions  of 
solvent  and  probe  molecules  on  the  dye  laser  performance. 

2.  Theoretical  description 

When  two  optical  fields  of  frequencies  o^  and  co2  are  mixed  in  a  phase  matched 
non-linear  crystal,  the  output  optical  field  with  frequencies  a)l  +  co2  and  o^  —  co2  is 
generated.  The  time-resolved  spectra  can  be  measured  at  a  given  time  after  excitation 
either  by  tuning  the  laser  frequency  or  by  tuning  the  crystal  at  a  fixed  laser  frequency  to 
select  the  fluorescence  wavelength.  The  phase  matching  condition  for  frequency 
conversion  is  that  k3  =  k±  +  k2,  where  /cl5  k2  and  k3  are  the  fluorescence,  gating  laser 
and  output  signal  wave  vectors,  respectively.  The  conversion  efficiency  of  the  crystal  is 
proportional  to  the  intensity  of  the  gating  pulse: 

The  instantaneous  power  of  the  sum-frequency  in  a  crystal  of  length  L  is  given 
approximately  by  [13] 

PJL)=L2P1P2(U3/co1  (2) 

where  subscripts  on  the  frequency  co  and  the  power/unit  area  P  refer  to  the  fluorescence, 
gating  pulse,  and  the  sum-frequency  respectively.  If  the  power  of  the  laser  remains 
constant  to  within  a  few  per  cent,  the  upconverted  signal  at  a  particular  delay  is 
proportional  to  the  fluorescence  intensity. 
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3.  Experimental 

The  schematic  of  the  upconversion  experiment  is  shown  in  figure  2.  The  laser  system 
was  passively  and  actively  mode-locked  Nd :  YAG  laser  (Continuum,  USA)  which 
provided  1064  nm  pulse  of  35  ps  duration  at  a  repetition  rate  of  10  Hz  with  energy  of 
70  mJ.  It  was  frequency  doubled  to  532  nm  in  a  KDP  crystal.  Subsequently,  mixing  up 
of  1064  nm  and  532  nm  lights  in  another  KDP  crystal  provided  a  sum-frequency  at 
355nm(35ps,  15mJ). 

The  solutions  of  C 102  (Eastman  Kodak  and  Exciton,  USA)  prepared  at 
1  x  10~3moldm"3  in  various  solvents  were  contained  in  a  10mm  pathlength  quartz 
cell  and  excited  by  a  355  nm  pulse.  The  unwanted  feedback  from  the  cell  walls  was 
avoided  by  careful  alignment  of  the  pump  beam  and  the  cell.  A  typical  dye  laser  output 
for  ethanolic  solution  of  C  102  had  nearly  5  mrad  divergence  and  bandwidth  of  13  nm 
(fwhm)  as  compared  with  43  nm  (fwhm)  for  fluorescence  (figure  3).  The  thermal  effects 
in  the  solution  were  minimized  by  circulating  the  dye  solution.  The  superradiant  laser 
emission  collected  by  the  lens  was  focussed  onto  the  KDP  type  II  crystal  (4mm  path 
length)  where  it  was  mixed  collinearly  with  the  1064nm  gating  pulse.  Our  attempts  to 
upconvert  the  fluorescence  were  in  vain  probably  due  to  low  conversion  efficiency  of 
the  crystal  used.  The  arrival  time  of  the  excitation  pulse  on  the  mixing  crystal  was 
controlled  by  adjusting  the  distance  travelled  by  pump  pulse  using  variable  optical 
delay.  By  angle-tuning  the  crystal  and  adjusting  the  relative  convergence  of  the  input 
pulses  in  the  crystal,  we  were  able  to  upconvert  the  laser  emission  from  455-485  nm 
(spectral  coverage:  30  nm;  fwhm:  13  nm)  to  318-333  nm  (spectral  coverage:  15  nm; 
fwhm:  5nm).  In  addition  to  the  upconverted  signal,  the  output  from  the  crystal 
consisted  of  the  fundamental  and  its  second  harmonic  which  were  cut  off  by  using 
appropriate  filters. 

The  upconverted  signal  was  collected  by  the  optical  fibre  and  measured  with  a  0-5  m 
polychromator  (Spectroscopy  Instruments,  Germany,  600g/mm)  and  double  intensifi- 
ed photodiode  array  which  were  controlled  by  a  microcomputer.  The  signal  was 
recorded  as  a  function  of  delay.  The  spectral  data  were  averaged  over  multiple 
measurements.  The  time-resolved  spectra  were  obtained  with  no/little  distortion. 
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Figure  2.    Schematic  of  upconversion  experiment: 
M:  Mirror:  L:  Lens:  BC:  Beam  combiner:  F:  Filter. 
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Figure  3.     Static  fluorescence  (FL)  and  dye  laser  spectra  (LA)  of  C 102  in  ethanol  at 
1  xKT3Moldrrr3 


Absorption  spectra  at  various  concentrations  for  all  the  solvents  were  recorded  using 
Hitachi  (Model  3200)  UV/Vis  spectrophotometer  and  steady-state  fluorescence 
emissions  were  measured  using  a  400  Watt  Hg  lamp,  a  scanning  monochromator 
(Oriel),  a  photomultiplier  (RCA  931  A)  and  an  X-Y  recorder  (Riken  Denshi).  Low 
temperature  luminescence  studies  were  made  using  closed  cycle  refrigerator  (RMC 
Cryosystems,  USA). 


4.  Results  and  discussion 

Steady-state  spectral  characteristics  of  C  102  in  various  solvents  at  optimum  concen- 
trations along  with  the  properties  of  the  solvents  are  shown  in  table  1.  We  find  that 
compared  to  dioxane  solution,  all  other  selected  solutions  exhibit  red-shift  both  in 
absorption  and  fluorescence  maxima  which  correlate  reasonably  with  solvent  polarity 
calculated  according  to  Lippert  [14].  Such  spectral  shifts  are  generally  the  reflections  of 
solute-solvent  interactions  which  may  be  of  general  or  specific  type  such  as  hydrogen 
bonding.  The  C  102  is  a  planar  molecule  in  the  ground  state  and  its  dipole  moment  as 
stated  earlier  changes  after  excitation,  indicating  that  C102  should  be  solvated 
relatively  larger  in  St  compared  to  ground  state.  In  the  general  type  of  solvent  effect,  the 
Stokes  shift  depends  on  dielectric  constant  and  refractive  index  of  solvents,  while 
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specific  effects  are  associated  with  n-donor,  7r-donor  and  hydrogen  bonding  abilities  of 
the  solvents.  For  polar  solutes  like  C  102,  the  interaction  with  non-polar  solvents  like 
dioxane  depends  on  the  dipole-induced-dipole  forces,  while  with  aprotic  (polar  but 
non-hydrogen  bonding)  solvents,  the  solute-solvent  interaction  depends  on  the  stron- 
ger dipole-dipole  forces.  In  protic  solvents  (alcohols),  in  addition  to  dipole-dipole 

Table  1.  Steady-state  absorption  and  fluorescence  characteristics  of  C102  in 
various  solvents  and  calculated  values  of  polarity  functions  of  solvents  using 
Lippert's  expression  (1957). 
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Figure  4.    Absorption  maxima  (•),  Static  emission  maxima  (Q)  and  Stokes  shift 
(A)  as  a  function  of  ET  (30). 
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interaction,  specific  interaction  such  as  H-bonding  may  be  effective  as  the  intramolecu- 
lar charge  transfer  (ICT)  character  of  C  102  is  favourable  for  H-bonding  with  hydroxy 
group  in  alcohols.  The  dependence  of  Stokes  shift,  vfl  —  vf  (cm~ l),  on  solvent  polarity 
accordng  to  Lippert's  expression  seems  to  be  less  regular  within  polar  solvents.  This 
trend  may  perhaps  be  due  to  the  facWhat  the  Lippert's  relation  ignores  the  molecular 
aspect  of  solvation.  It  is  therefore  useful  as  pointed  out  by  others  also  [15, 10]  to  use  ET 
(30)  function,  the  empirical  measure  of  solvent  polarity  [16]  for  understanding  the 
polarization  dependence  of  spectral  characteristics.  Figure  3  shows  the  steady-state 
absorption  and  emission  maxima  and  Stokes  shift  as  a  function  of  ET  (30).  As  seen  from 
figure  4,  the  absorption  and  static  emission  maxima  are  linearly  related  to  ET  (30),  the 
slope  of  the  plot  for  emission  being  greater  than  that  for  absorption,  implying  that  the 
solvation  in  Sj  is  consistent  with  the  finding  that  ne  >  \ng.  The  linear  £T(30)  dependence 
of  Stokes -shift  indicates  the  existence  of  general  type  of  solute-solvent  interaction. 

Transient  emission  spectra  recorded  in  protic  solvents  (ethanol  and  n-butanol)  were 
found  to  be  red-shifted  almost  continuously  on  longer  time  scales  available  with  the 
set  up,  while  in  aprotic  solvents  (AN  and  DMSO),  the  shift  though  being  continuous  is 
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Figure  5.     Time  resolved  spectra  of  C  102  at  20  "C. 
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complete  at  early  times  (nearly  45  ps),  indicating  that  solvent  relaxation  in  aprotic 
solvents  is  very  fast  and  falls  within  the  time-resolution  of  the  instrument.  The  observed 
continuous  red-shift  can  be  ascribed  to  a  continuously  evolving  solvent-relaxation.  The 
FC  (Sj)  state  is  a  quasi-continuum  state  consisting  of  discrete  orientational  levels 
corresponding  to  solute-solvent  configurations.  The  transient  spectra  thus  correspond 
to  an  elementary  emission  (almost  continuous)  originating  from  all  possible  orienta- 
tional levels  located  between  non-relaxed  F'C  state  and  equilibrated  excited  state. 
Typical  time-resolved  spectra  for  ethanolic  solutions  is  given  in  figure  5.  The  temporal 
profiles  obtained  from  plots  of  intensity  of  the  upconverted  signal  versus  time  (figure  6) 
yielded  1/e  times  (TASE)  of  45  ps  and  60  ps  respectively  for  n-butanol  and  ethanol  which 
were  found  to  be  independent  of  wavelength.  Consequently,  the  transient  solvation 
phenomena  is  not  apparent  in  the  intensity  decay  profiles.  The  wavelength  indepen- 
dence of  decay  in  the  present  time-resolved  experiment  may  be  due  to  the  narrow 
spectral  coverage  of  the  upconverted  signal.  Such  a  behaviour  has  been  reported  earlier 
[17].  For  AN  and  DMSO  the  decay  falls  within  the  time  response  of  the  system.  The 
TASE  values  correspond  to  reduced  lifetimes  of  S1  state  by  amplified  spontaneous 
emission  via  stimulated  emission.  Since  lifetimes  of  Sl  are  very  much  reduced,  we 
believe  that  other  non-radiative  process  probably  that  involving  rearrangement  of 
solute  molecules  hindered  by  solvent  may  be  operative  since  these  lifetimes  approxi- 
mately correlate  with  the  viscosity  of  the  solvent.  During  the  course  of  solvation  the 
degree  of  charge  transfer  in  Sl  state  may  change  continuously  and  the  interconversion 
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kinetics  of  charge  transfer  transient  species  is  modulated  by  solvent  motion.  We  also 
notice  that  these  lifetimes  are  in  reasonable  agreement  with  TS  values  derived  from 
Stokes  shift  correlation  functions.  However,  it  is  important  to  note  that  TASE  obtained 
from  I(t)  decay  curve  is  concentration  dependent  while  rs  derived  from  C(t)  decay  curve 
is  concentration  independent. 

Solvation  dynamics  in  polar  liquids  with  a  variety  of  probe/solvents  combinations 
have  been  examined  by  different  workers  to  experimentally  verify  the  prediction,  from 
dielectric  continuum  theory  [18,  19]  that  the  solvent  orientational  relaxation  time,  rs, 
as  measured  by  the  fluorescence  spectral  shift  of  a  probe  solute,  should  proceed 
exponentially  in  time  equal  to  solvent's  longitudinal  relaxation  time,  T,,  defined  as 

Tj  =  (fi«>/fiohd  (3) 

where  ex  and  £G  are  infinite  frequency  and  static  dielectric  constants,  respectively  and  id 
is  the  bulk  dielectric  relaxation  time.  The  time  evolution  of  the  solvent-relaxation  is 
usually  quantified  by  the  correlation  function  C(t)  [20]  defined  as 


in  which  vmax(0),  vmax(oo)  and  vmax(t)  are  the  optical  frequencies  that  correspond 
respectively,  to  the  wavelength  maxima  of  emission  spectrum  of  the  probe  at  times  zero, 
infinity  (when  the  solute  solvent  system  is  at  equilibrium),  and  at  intermediate  time 
t  (during  the  relaxation).  In  the  absence  of  specific  interactions  C(t)  provides  a  solute- 
independent  measure  of  solvation  dynamics  as  described  by  Castener  and  Fleming 
[19].  If  the  solvent  orientation  is  described  by  continuum  model,  C(i)  =  exp(—  t/rs), 
i.e.,  C(t)  decays  exponentially  with  a  time  constant  equal  to  ts. 

For  extracting  accurate  C(t)  functions  one  should  use  the  correct  shapes  of  the 
transient  emission  spectra  and  the  peak  frequencies.  The  description  of  the  method  of 
analysis  has.  appeared  in  many  references  [15,  10,  19].  The  method  of  measurement  in 
the  present  system  was  simplified  by  measuring  the  peak  frequencies  v(t)  directly  from 
the  normalized  transient  spectra  (figure  7)  and  v(oo)  was  taken  to  be  the  limiting 
frequency  beyond  which  spectral  shift  did  not  occur.  Typical  normalized  time  evolving 
spectra  for  ethanolic  solutions  is  displayed  in  figure  6.  In  case  of  alcohols,  C(t)  functions 
determined  from  relation  (4)  were  found  to  decay  non-exponentially.  Nevertheless,  the 
decays  were  fitted  well  by  a  single  exponential  dependency  with  time  constants  of 
TS  =  55  ps  for  ethanol  and  65  ps  for  n-butanol  (figure  8).  In  order  to  test  the  validity  of 
dielectric  continuum  model  (DCM)  the  longitudinal  relaxation  times  from  relation  (3) 
were  calculated  using  the  available  dielectric  parameters  at  293  K  [e0  =  26-00; 
£^  =  1-85;  Td  =  130ps,  n0  =  1-359  for  ethanol  [21],  eQ  =  ll-8;  £^  =  3-5;  Td  =  613ps, 
n0=  1-399  for  n-butanol  [19]  and  the  values  are  collected  in  table  2.  In  case  of 
n-butanol  agreement  between  TS  (65  ps)  and  T,  (llOps)  is  sensible,  while  in  case  of 
ethanol  is  (55  ps)  is  about  six  times  greater  than  T,  (9ps)  are  agrees  with  the  value  of 
Ts  =  40  ps  reported  earlier  by  Struve  et  al  [22]  who  used  DMABN  as  a  probe  molecule. 
As  argued  by  Sumi  and  Marcus  [23]  the  better  relation  for  evaluating  the  longitudinal 
relaxation  time  for  alcohols  should  be  T,  =  («o/£o)Td  where  n0  is  the  refractive  index  of 
the  solvent  and  this  relation  yields  il  =  65  ps  which  is  in  excellent  agreement  with  our 
experimental  value.  Interestingly,  our  experimental  value  of  TS  is  in  close  agreement 
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Figure  8.     Stokes  shift  correlation  function  C(t)  for  ethanol  and  n-butanol. 

Table  2.    Dielectric  parameters  and  solvation  times. 


rd 

r^^AoK, 

T(  =  (n2/£o,>T, 

i        ts(obs) 

Solvent 

(ps) 

(ps) 

(ps) 

(ps) 

n-butanol 

613 

120 

67 

65(66a) 

ethanol 

130 

9 

9 

55(40b) 

"Castener  et  al  1987;  bStruve  et  al  1973 

with  earlier  reported  values  of  rs  =  66  ps  for  n-butanol  with  LDS-750  as  a  probe 
molecule  [19].  We  therefore  feel  confident  that  our  simplified  extraction  of  C(t) 
function  from  transient  laser  emission  spectra  are  good  enough  for  the  description  of 
solvation  dynamics  in  n-butanol  except  the  non-exponential  behaviour  of  C(t).  How- 
ever, with  regard  to  ethanol,  the  agreement  between  experimental  and  theoretical 
results  is  obviously  not  good.  These  findings  show  that  the  continuum  approach  is  not 
sufficient  for  the  description  of  actual  dynamics  in  the  present  systems.  As  pointed  out 
by  previous  workers  [24, 25, 26]  the  overall  solvation  dynamics  according  to  molecular 
based  models  should  show  continuous  range  of  times  between  T,  and  rd,  with  the 
shortest  being  rl  and  the  longest  rd.  It  is,  therefore,  concluded  that  the  observed 
deviation  from  DCM  model  particularly  for  ethanol  may  result  from  the  molecular 
aspect  of  solvents  which  is  not  included  in  the  continuum  description.  In  actual 
situation,  the  solute-solvent  interaction  becomes  complicated  due  to  the  change  in  the 
intramolecular  charge  transfer  and  ne  of  S,  state  during  solvation  process.  The  basic 
difference  in  the  two  solvents  may  be  associated  with  the  importance  of  time-dependent 
changes  in  effective  dipole  moments  and  varying  degree  of  polarization  dependence  of 
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applying  reimea  theoretical  models.  As  it  is  well-known,  each  time  scale  has  a  sigmn- 
cance  in  the  upconversion  technique.  However,  the  present  system  utilizing  the 
superradiant  laser  emission  as  a  probe  can  reveal  the  solvation  dynamics  and  is  better 
suited  for  selecting  dyes  as  luminescence  probes  required  for  many  applications  [27]. 
Orientational  relaxation  time  of  78  ps  for  C 102  in  ethanol  obtained  from  induced 
dichroism  technique  is  greater  than  the  solvent-relaxation  time  of  55  ps.  This  shows 
that  rotational  motion  of  the  solute  does  not  influence  greatly  that  of  the  solvent. 


The  authors  are  grateful  to  the  Defence  Research  and  Development  Organisation,  New 
Delhi  for  funding  the  Picosecond  Laser  Spectroscopy  Programme. 
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ract.  A  dedicated  working  setup  for  studying  the  process  of  atomic-field  bremsstrahlung 
its  dependence  on  various  parameters  for  keV  electrons  incident  on  a  solid  or  a  gaseous 
;t  has  been  indigenously  developed.  The  setup  consists  of  a  high  vacuum  scattering  chamber 
;hed  with  a  rotatable  X-ray  photon  detector,  a  home-built  high  voltage  electron  gun  with 
jlaceable  tungsten-filament  cathode,  an  isolated  floating  high  voltage  system,  high  vacuum 
ping  units,  various  signal  processing  electronic  modules  and  an  IBM  PC/XT  based 
nultichannel  analyzer.  A  brief  description  of  the  facility  is  presented.  The  performance  of  the 
ty  has  been  tested  by  recording  the  bremsstrahlung  spectra  from  7-0  keV  electrons  on  thin  Ag, 
nd  7-5  keV  electrons  on  Hf  targets;  the  corresponding  spectra  are  presented  and  discussed.  The 
can  operate  in  the  range  of  0-8-0  kV  accelerating  voltage  in  the  present  configuration.  Other 
ble  experiments  that  can  be  performed  on  the  setup  are  also  briefly  mentioned. 

ivords.    Atomic-field  bremsstrahlung;  electron  gun;  isolated  floating  high  voltage  system. 
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ntroduction 

;tron  bremsstrahlung  (also  called,  atomic-field  bremsstrahlung  (AFB))  is  common- 
nderstood  to  be  the  radiation  of  a  photon  in  the  scattering  of  an  electron  from  an 
n  or  an  ion  under  its  respective  screened  nuclear  Coulomb  field.  It  is  one  particular 
sequence  of  the  general  coupling  of  the  electromagnetic  field  and  the  matter  fields, 
process  is  closely  related  both  to  elastic  electron  scattering  and  to  direct  radiative 
>mbination  in  which  the  incident  electron  does  not  scatter  but  is  captured  by  the 
n.  Electron  bremsstrahlung  can  teach  us  about  many  fundamental  aspects  of  the 
pling  of  radiation  and  matter  fields  in  addition  to  an  application  as  a  probe  for 
lying  the  atomic  structures. 

lectron  bremsstrahlung  is  of  immense  practical  interest  and  importance  for  a 
.ety  of  applications;  for  example,  application  in  astrophysics  (interpretation  of 
ophysical  spectra)  and  for  modelling  hot-plasmas,  in  plasma  programmes  -  both  in 
jnetic  confinement  and  inertial  confinement  (determination  of  plasma-tempera- 
:s,  prediction  of  plasma  properties,  estimation  of  plasma  heating).  Other  applica- 
is  concern  transport  and  energy-loss  of  charged  particle  beams,  shielding  and 
iation  damage  both  to  materials  and  biological  tissues.  The  current  understanding 
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acquisition  system.  Many  experiments  have  been  performed  over  the  years  to  measure 
the  dependence  of  the  radiation  process  on  a  variety  of  parameters,  such  as,  the  incident 
electron  energy,  the  atomic  number  of  the  target  material,  the  photon  emission  angle 
and  the  photon  energy.  Most  of  the  previous  experiments  were  performed  in  the  high 
energy  range  (above  50  keV)  of  the  incident  electrons  with  solid  targets  [2].  Only 
recently,  experimental  data  have  become  available  on  AFB  process  in  low  impact 
energies  with  gaseous  targets  [3]. 

To  study  the  process  of  AFB  generation,  mostly  thin  solid  foils  are  used  as  targets 
however,  in  some  recent  experiments,  gaseous  targets  were  also  employed.  In  the  ideal 
case,  the' target  should  be  so  thin  that  the  incident  electron  undergoes  only  one  single 
collision  on  the  way  through  it  so  that  the  bremsstrahlung  process  is  not  disturbed  by 
additional  (elastic  or  inelastic)  scattering  processes.  The  advantage  in  the  use  of  thin 
targets  is  that  the  effect  of  one  single  interaction  is  much  easier  to  calculate  theoretically 
than  the  corresponding  multiple  electron  scattering  problem.  Hence,  such  experiments 
can  be  compared  much  more  directly  with  theory.  However,  the  problems  involved  in 
measurements  with  small  counting  rates  from  very  thin  target  foils  without  destroying 
them  are  considerable. 

The  subject  matter  of  the  present  paper  is  concerned  mainly  with  the  developmental 
aspect  of  an  experimental  facility  for  investigating  the  AFB  process  as  a  function  of  its 
relevant  parameters.  Design,  fabrication  and  description  of  various  components  of  the 
setup  are  given.  Few  typical  examples  of  the  recorded  spectra  of  electron  bremsstrah- 
lung process  are  also  presented  from  test  runs  on  the  developed  facility.  The  major 
motivation  to  develop  the  said  facility  for  studying  the  AFB  process  with  solid  and 
gaseous  targets  originates  from  three-fold  reasons:  (i)  no  such  experimental  facility  is 
operative  anywhere  in  the  country,  (ii)  there  is  a  great  need  of  experimental  data  for 
AFB  at  keV  energies  for  testing  the  existing  theories  and  (iii)  it  is  today's  demand  and 
concern  to  develop  the  indigenous  components  of  the  setup  in  the  laboratory  itself  and 
to  create  a  culture  of  designing  and  fabricating  the  equipments  for  use  in  research  and 
training  areas.  As  a  matter  of  fact,  these  objectives  have  invited  a  challenging  task  to 
any  researcher  in  the  country.  Our  effort  has  been  directed  to  achieve  the  same  within 
the  constraints  of  our  infra-structural  facility  and  funds  availability. 

2.  Design  and  fabrication  of  AFB  set  up 

The  AFB  setup  consists  of  the  following  major  parts: 

-  a  high  vacuum  scattering  chamber 

-  a  high  voltage  electron  gun 

-  an  isolated  floating  high  voltage  system 

-  high  vacuum  pumping  units 

-  signal  processing  and  data  acquisition  system 

Description  of  each  part  is  given  in  the  respective  subsections  as  under: 
2.1  High  vacuum  scattering  chamber 

The  high  vacuum  scattering  chamber  comprises  the  following  components: 
(1)  main  scattering  chamber 


subsections: 

2.1.1  Main  scattering  chamber:  The  high  vacuum  scattering  chamber  is  a  cylindrical 
stainless  steel  pipe  of  190  mm  height  and  44  mm  inner  diameter  as  shown  in  figure  la.  It 
has  two  collinear  ports  each  fitted  with  a  KF  10  coupling;  one  adopts  a  high  voltage 
electron  gun  and  the  collimating  slits  St  and  S2  (see,  figure  2b),  the  other  couples  an 
insulated  Faraday  cup  (see,  subsection  2.1.2).  Two  slots  of  height  8-0  mm  are  cut  in  the 
cylinder  on  both  sides  of  the  collinear  ports  in  a  horizontal  plane  containing  the 
electron-beam,  the  target  and  the  axis  of  photon  detector.  Slots  are  covered  with  a  6  fi 
thick  hostaphan  foil  which  is  strong  enough  to  withstand  the  pressure  in  the  chamber 
and  transparent  enough  to  transmit  the  X-ray  photons  of  2-0  keV  energy  up  to  70%. 
The  top  flange  of  the  chamber  adopts  a  movable  target  holder  in  the  centre  which 
permits  the  positioning  of  targets  in  place  without  cycling  the  vacuum  of  the  system. 
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Figure  la.    Cross-section  view  of  the  main  scattering  chamber.  A:  $  10mm 
rotational  vacuum  feedthrough;  B:  KF  10;  C:  SS-collar;  D:  Al-rotatable  platform. 
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2.1.2  Faraday  cup:  Faraday  cup  is  a  device  to  collect  the  charge  of  the  transmitted 
electron  beam  through  the  target.  In  our  setup,  it  consists  of  two  coaxial  hollow-Al- 
cylinders;  the  length/diameter  of  the  inner  and  outer  cylinders  are  25  mm/7  mm  and 
56  mm/16  mm,  respectively.  The  outer  electrode  is  negatively  biased  to  prevent  second- 
ary electrons  produced  in  the  inner  cylinder  from  escaping,  while  the  monitoring  of  the 
collected  charge  is  made  on  the  latter.  The  two  electrodes  are  mounted  on  a  perspex 
flange  which  has  BNC  connectors  to  provide  for  biasing  and  current  monitoring 
outputs.  The  two  electrodes  mounted  on  the  perspex  flange  are  coupled  to  an 
aluminium  enclosure  (not  shown  in  figure  Ib)  through  a  KF  10  coupling  at  its  other 
end.  The  current  is  measured  and  integrated  by  a  current  integrator  (ORTEC  439). 
Figure  Ib  shows  a  cross-section  view  of  the  Faraday  cup  with  dimensions. 

2.1.3  Target  device:  Thin  foils  of  solid  targets  are  mounted  on  the  aluminium  frame 
having  4-consecutive  holes  each  of  12-0  mm  diameter.  The  frame  is  attached  on  an 
insulated  SS-shaft  which  enables  the  positioning  of  foils  at  a  desired  place  and  at 
a  chosen  angle  in  the  centre  of  the  chamber  using  a  vacuum  seal  device  (see,  figure  Ic). 
The  gaseous  targets  are  introduced  in  the  collision  zone  through  a  tube  attached  with 
a  hypodermic  needle  (4>  0-5  mm)  or  with  a  multicapillary  tube  with  pore  size  100  /j.  (not 
shown  in  figure).  The  effusing  gas  is  finally  pumped  down  by  a  diffusion  pump  placed 
underneath.  The  adjustment  of  the  input  gas  pressure  in  the  chamber  is  done  by  a  fine 
needle  valve  (FCV  10K,  L  H  Alliston,  England)  in  such  a  way  that  a  single  collision 
condition  is  maintained.  The  pressure  of  the  gas  in  the  vicinity  of  the  collision  zone  is 
measured  by  a  MKS  capacitance  manometer  (Model  127aa-00001D).  The  single 
collision  condition  is  ensured  by  measuring  a  linear  dependence  of  the  gas  pressure  as 
a  function  of  bremsstrahlung  photon  counts  in  a  chosen  energy  window  (A/c)  per 
incident  electron  for  a  given  impact  energy  of  the  electrons.  The  target  gas  pressure 
measured  by  the  capacitance  manometer  under  this  condition  has  been  found  to  be 
typically  of  the  order  of  3  x  10~3mbar.  The  gas  density  relative  to  this  pressure  is 
estimated  to  be  about  1013  atom/cm3  in  the  interaction  zone.  This  'thin'  target  ensures 
that  multiple  scattering  events  are  negligible. 
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Figure  Ib.    Cross-section  view  of  the  Faraday  cup  assembly. 
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Figure  Ic.     Solid  and  gas  target  devices.  A:  solid  target  holder;  B:  gaseous  target 
device. 

2.1.4  Rotatable  Si(Li)  X-ray  detector:  An  aluminium  platform  (530mm  x  250mm  x 
10  mm)  is  coupled  with  the  main  chamber  at  its  one  end  and  rotates  about  the  collision 
centre  (see,  D  in  figure  la).  A  Canberra  Si(Li)  X-ray  photon  detector  (Model  SL80175, 
Cryostat  Model  7905-7-5)  is  placed  on  this  platform  with  its  Be-window  facing  the 
8-0  mm  slot.  The  detector  can  be  positioned  at  any  angle  between  30°  and  150°  on  one 
side  and  between  210°  and  270°  on  the  other  side  of  the  collinear  ports  with  a  precision 
of  +  1°.  Alignment  of  the  cryostat  with  respect  to  the  centre  of  collision  can  be  made 
manually  on  the  platform.  The  bottom  of  the  platform  is  attached  with  three  metallic 
supports  which  rest  on  the  ground.  These  supports  are  fitted  with  an  arrangement  to 
adjust  their  heights  as  well  (not  shown  in  figure  la). 

2.2  High  voltage  electron  gun 

In  the  present  setup,  we  have  used  an  electron  gun  taken  out  from  one  of  the  discarded 
cathode  ray  oscilloscopes.  Its  indirectly  heated  oxide-coated  cathode  assembly  was 
removed  and  it  was  replaced  by  an  arrangement  containing  a  hair-pin  tungsten 
filament  (00-15  mm)  and  a  pair  of  voltage/current  feedthroughs  as  shown  in  figure  2a. 
This  arrangement  with  a  'replaceable-filament'  is  desirable  because  it  allows  us  to 
operate  the  gun  with  frequent  exposures  to  atmosphere.  The  tungsten  filament  is  more 
rugged  and  resistant  to  oxidation  contrary  to  the  oxide-coated  cathodes.  The  arrangement 
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Figure  2a.  Cross-section  view  of  the  replaceable  filament-assembly  of  the  high 
voltage  electron  gun.  A:  copper  screw;  B:  ceramic  tubing;  C:  Al-cylinder;  D:  Brass 
nut/bolt;  E,  E:  Filament  connecting  leads;  F:  Tungsten  filament;  G:  Grid  cylinder. 


F.A. 


Figure  2b.  Schematic  sketch  of  the  electron  gun  with  different  electrodes.  FF: 
Filament;  G:  Grid:  A1,A2,A3:  Anodes;  FA:  Focussing  anode;  X^Xj,  Y^Y^ 
Deflector  plates;  S1,S2'  Collimating  slits. 


components  of  the  electron  gun,  namely,  control  grid,  anode,  focussing  electrode,  X- 
and  Y-deflector  plates,  etc.  are  shown  in  figure  2b.  The  input  voltages  to  each  electrode 
of  the  gun  are  provided  by  pins  mounted  on  a  perspex  flange  having  a  diameter 
100-0  mm  and  a  thickness  12-0  mm.  The  complete  electron  gun  is  made  to  rest  vertically 
on  the  perspex  flange  with  three  aluminium  pillars  (0  3-0  mm)  attached  to  the  base  of 
the  gun.  An  enclosure  for  the  electron  gun  has  been  fabricated  out  of  aluminium  which 


is  coupled  with  the  main  scattering  chamber  through  a  KF  10  coupling.  The  other  two 
ports  fitted  with  KF  10  and  KF  25  couplings  of  the  enclosure,  respectively,  provide  the 
facility  to  connect  a  Penning  gauge  and  a  differential  pumping  system  (VS-114D). 
A  collimating  slit  assembly  B!  and  S2  having  carbon-apertures  of  3-0  mm  on  a  copper 
pipe  of  length  50-0  mm  is  fitted  between  the  main  scattering  chamber  and  the  electron 
gun  enclosure.  This  arrangement  facilitates  the  alignment  of  the  beam  on  the  target 
more  precisely  on  one  hand  and  cuts  down  the  background  radiation  considerably  on 
the  other.  Various  voltages  applied  to  gun-electrodes  are  discussed  in  §2-3. 

2.3  Isolated  floating  high  voltage  system 

An  isolated  floating  high  voltage  system  for  grid  and  filament  power  supplies  has  been 
developed.  This  was  necessary  in  order  to  keep  the  anode  at  earth  potential  and  the 
cathode/grid  at  a  high  negative  voltage.  A  20  kV  isolation  transformer  (M/s  Sairam, 
Bombay)  was  used  to  fulfil  this  job.  Necessary  grounding  for  high  voltage  and  for  other 
units  is  derived  from  a  common  earthing  terminal.  All  units  floating  on  high  voltages 
are  enclosed  in  a  cage  consisting  of  12mm  thick  perspex  sheets  covered  with 
a  grounded  metallic  net.  The  regulation  of  voltage/current  is  made  from  outside  of  the 
cage  using  the  insulator  shafts  attached  to  the  power  supplies.  A  block  diagram 
showing  the  electrical  connections  for  the  isolated  floating  high  voltage  system  is  given 
in  figure  3. 

Two  regulated  high  voltage  dc  power  supplies  (0-20  kV,  0-1-5  mA)  purchased  from 
M/s  FUG  GmbH,  Germany,  were  used  to  provide  accelerating  and  focussing  voltages 
in  the  gun.  The  voltage  uncertainty  of  these  power  supplies  is  about  1  %.  The  tungsten 
filament  (6  watt)  was  heated  by  a  constant  current  power  supply  (0-15  V,  0-15  A)  and 
the  grid  voltage  was  kept  slightly  negative  to  the  cathode  voltage  by  a  separate  power 
supply.  A  typical  set  of  values  for  voltage/current  on  different  electrodes  for  obtaining 
a  well  collimated  beam  (4>  3-0  mm)  of  6-0  n A  on  a  Ag-target  with  6-0  keV  electrons  is 


F   F 


A.C.MAINS 


ISOLATION  CAGE 

Figure  3.  Electrical  controls  for  an  isolated  floating  high  voltage  system.  FPS: 
Filament  power  supply;  GPS:  Grid  power  supply;  IT:  Isolation  transformer;  FF: 
Leads  to  filament;  G:  Lead  to  grid. 
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Table  1.  Typical  values  of  various  voltages  and  cur- 
rents applied  to  gun-electrodes  for  a  beam  of  6-0  nA, 
diameter  of  3-0  mm  on  the  target  for  6-0  keV  electrons 
incident  on  a  thin  Ag  foil  (157/zgm/cm2). 

Filament  voltage  2-1 V 

Filament  current  3-0  A 

Cathode  voltage  -  6-0  kV 
Emission  current  0-007  mA 

Focussing  voltage  —  5-92  kV 
Emission  current  0-000  mA 

Grid  voltage  -0-5V 
Xrdeflector  plate  0-0  V 

X2-deflector  plate  100-0  V 
Y! -deflector  plate  0-0  V 

Y2-deflector  plate  100-0  V 
Beam  current  6-0  nA 


presented  in  table  1.  A  visual  check  of  the  dimensions  of  beam  spot  was  made  on  a  ZnS 
screen  placed  at  the  target  position. 

2.4  High  vacuum  pumping  system 

The  main  scattering  chamber  is  evacuated  by  a  2-5"  oil-diffusion  pump  (VS-65D, 
100 1/s)  and  the  electron  gun  enclosure  by  a  4-5"  oil-diffusion  pump  (VS-1 14D,  300 1/s); 
both  pumping  units  are  purchased  from  M/s  Hind  High  Vacuum  Co.  Pvt.  Ltd., 
Bangalore.  The  pressures  maintained  by  rotary  and  diffusion  pumps  are  measured  by 
a  Pirani  and  a  Penning  gauge  respectively.  The  base  pressure  attained  in  the  scattering 
chamber  and  that  in  the  electron  gun  enclosure  is  better  than  2  x  10  ~6  mbar  without 
using  a  liquid  nitrogen  trap. 

2.5  Signal  processing  and  data  acquisition  system 

A  block  diagram  of  various  electronic  modules  used  for  processing  the  photon  and 
electron  signals  is  shown  in  figure  4.  Standard  NIM  modules  have  been  used  to  process 
the  photon  signals  generated  by  the  Si(Li)  detector.  A  Canberra  Si(Li)  X-ray  photon 
detector,  preamplifier  (PA,  Model  2008),  H.V.  power  supply  (Canberra,  Model  30020) 
and  spectroscopy  amplifier  (Canberra,  Model  2020)  constitute  an  integral  part  of  the 
X-ray  photon  detection  and  corresponding  signal  processing  system.  An  IBM  PC/XT 
based  4K-Multichannel  Analyzer  (MCA)  (M/s  Nucleonix,  Hyderabad)  is  interfaced 
with  the  photon  signals  through  an  ADC  (see,  figure  4).  Data  acquisition,  stripping  and 
plotting,  etc.  are  carried  out  on  this  unit.  Monitoring  of  the  beam  current  is  accom- 
plished by  a  picoammeter  (UNIBI,  Bielefeld)  and  -integration  of  which  is  done  by 
a  current  integrator  (ORTEC  439).  All  spectra  are  recorded  in  the  pulse-height  analysis 
(PHA)  mode  of  the  MCA  and  stored  in  the  hard  disc/floppy  disc  of  the  computer.  The 
spectroscopy  amplifier  (SA)  has  a  built-in  facility  to  switch  the  pulse-pile-up  rejection 
on  or  off  while  recording  the  spectra.  This  provision  facilitates  the  rejection  of 
pulse-pile-up  events  during  acquisition  of  the  data.  The  shaping  time  of  the  SA  was 
chosen  to  be  12  us  in  order  to  obtain  the  best  resolution  of  the  Si  (Li)  detector 
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Figure  4.  Signal  processing  and  data  acquisition  system.  PA:  Preamplifier;  SA: 
Spectroscopy  amplifier;  ADC:  Analog  to  digital  converter;  HV:  High  voltage  power 
supply;  CRO:  Cathode  ray  oscilloscope;  C/T:  Counter  timer;  VT:  Video  terminal; 
PLT:  Plotter,  CI;  Current  integrator;  PI-A:  Pico  ammeter;  FC:  Faraday  cup;  EG: 
Electron  gun. 


(FWHM  =  250  eV  at  5-9  keV).  Emulation  software  of  4K-MCA  is  used  to  accumulate, 
strip  and  analyze  the  data. 

3,  Operation,  test  runs  and  energy  calibration 

A  successful  operation  of  the  developed  facility  essentially  involves  simultaneous 
smooth  functioning  of  the  vacuum  system  maintaining  the  base  pressure  of  the  system 
better  than  2  x  10  ~6  mbar,  a  stable  output  of  the  collimated  beam  from  the  electron 
gun,  a  photon  detector  optimized  for  a  high  signal  to  noise-ratio  and  properly  shaped 
and  amplified  photon-signals  as  input  to  ADC.  When  all  of  these  are  made  functional 
with  their  optimized  values,  accumulation  of  the  desired  AFB  spectra  for  a  chosen 
target  and  incident  energy  is  carried  out  on  an  IBM  PC/XT  based  MCA  in  PHA  mode 
(see,  figure  4).  As  AFB  intensity  depends  on  Z2  (Z  =  atomic  number  of  the  target), 
accumulation  time  of  the  AFB  spectra  depends  on  the  target  and  beam  current  used  in 
the  experiment.  For  targets  having  Z  between  40  and  80,  data  collection  times  with 
1-3  nA  beam  current  are  found  to  be  20-30  minutes  more  than  sufficient  for  a  2-3% 
counting  statistics.  The  background  radiation  arises  due  to  elastic  and  inelastic 
scattering  of  the  incident  electrons  from  materials  in  the  neighbourhood  of  the  target 
used,  in  particular,  the  radiation  originated  in  the  zone  being  away  from  the  beam- 
target  interaction  region  and  seen  by  the  detector  in  a  given  geometrical  configuration 
contributes  maximum.  This  problem  of  background  radiation  can  be  circumvented  by 
collimating  the  incident  beam  with  narrow  slits  and  by  employing  a  limited  solid  angle 
of  the  detector.  The  latter  part  can  be  readily  achieved  by  using  a  narrow  aperture  on 
the  detector's  window.  We  have  investigated  the  background  radiation  as  a  function  of 
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impact  energy  of  the  electrons.  We  found  that  it  increases  almost  by  a  factor  of  two  by 
changing  the  incident  energy  of  electrons  from  2-5  keV  to  7-0  keV.  However,  having 
considered  the  above  points,  one  must  make  note  that  mere  presence  of  the  foreign 
materials  in  the  vicinity  of  target  can  cause  background  radiation,  for  example,  the 
materials  of  the  frame  on  which  the  targets  are  mounted  are  the  main  source  &f 
background  radiation.  In  order  to  reduce  the  intensity  of  the  background  radiation,  it 
is  always  preferred  to  use  a  low-Z  material  for  constructing  the  target-frame.  Generally, 
the  background  radiation  is  recorded  by  introducing  the  empty  frame  in  front  of  the 
beam.  Nevertheless,  this  method  is  good  enough  as  long  as  there  exists  a  small 
probability  of  the  incident  electrons  that  are  elastically  scattered  from  the  target.  In 
AFB  spectra,  the  background  subtraction  is  a  serious  and  important  task  while 
analyzing  the  data  for  determination  of  the  AFB  cross-sections. 

A  few  test  runs  for  AFB  spectra  produced  from  7-0  and  7-5  keV  electrons  on  thin  Ag 
(157/igm/cm2),  Au  (193^gm/cm2)  and  Hf  (6-6mg/cm2)  targets  have  been  carried  out 
on  the  present  facility  and  displayed  in  figures  5(a-c)  respectively.  The  spectra  have 
been  recorded  by  placing  the  photon  detector  at  an  angle  of  90C  and  the  foil  tilted  at  45° 
with  respect  to  the  incident  beam  direction.  The  channel  numbers  of  the  MCA  are 
calibrated  for  corresponding  photon  energies  by  using  a  standard  55Fe  radioactive 
source.  The  source  is  placed  at  the  target  position  and  the  characteristic  Mn-Ka  and  K^ 
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Figure  5a.     Electron  bremsstrahlung  photon  energy  spectrum  for  7-0  keV  electrons 
on  a  thin  Ag  target. 
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Figure  5b.    Electron  bremsstrahlung  photon  energy  spectrum  for  7-0  keV  electrons 
on  a  thin  Au  target. 
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Figure  5c.    Electron  bremsstrahlung  photon  energy  spectrum  for  7-5  keV  electrons 
on  a  thin  Hf  target. 

lines  appearing  at  5-9  keV  and  6-492  keV  respectively  are  recorded  using  the  identical 
electronic  setup  which  was  employed  for  accumulating  the  AFB  spectra  of  different 
targets.  A  typical  spectrum  of  55Fe  radioactive  source  as  recorded  is  shown  in  figure  6. 
The  relative  efficiency  of  the  Si(Li)  detector  e(fc)  was  determined  by  recording  the 
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Figure  6.    A  typical  photon  energy  spectrum  from  a  55Fe  radioactive  source. 


bremsstrahlung  radiation  emitted  from  7-0  keV  e  -Au  collision  in  the  photon  energy 
range  of  2-0-7-0  keV  following  the  procedure  given  in  ref.  [4].  The  efficiency  was  placed 
on  an  absolute  scale  by  the  measurement  of  Mn  Ka-line  from  a  calibrated  5  Fe 
radioactive  source.  The  corrections  for  electron  back-scattering,  energy  loss  and  X-ray 
absorption  were  taken  into  account.  The  geometrical  efficiency  of  our  detector  was 
experimentally  determined  to  be  1-71  x  10~4  at  k  =  5-9 keV  while  its  value  from  the 
theoretical  AFB  cross-section  was  found  to  be  1-38  x  10~4.  Both  results  are  found  to 
agree  with  each  other  within  the  uncertainty  of  the  target  thickness  of  20%.  The 
electron  bremsstrahlung  spectra  produced  from  gaseous  targets  are  similar  and  the 
work  in  this  direction  is  in  progress. 

4.  Typical  results  and  discussion 

Figures  5a-c  display  the  bremsstrahlung  photon-energy  spectra  produced  from 
7-0  keV  and  7-5  keV  electrons  on  thin  foils  of  Ag,  Au  and  Hf  respectively.  These  spectra 
exhibit  a  continuous  radiation  of  photons  with  a  well-defined  high  energy  'tip' 
appearing  at  the  maximum  energy  of  the  incident  electrons.  In  figure  5a,  the  energy 
spectrum  shows  a  well-pronounced  characteristic  Ag-L  peak  around  3-OkeV  over 
a  continuous  bremsstrahlung  spectrum.  Ag-L  line  could  not  be  resolved  into  its 
constituent  components  due  to  insufficient  resolution  of  the  Si(Li)  detector.  The  cut-off 
of  the  low  energy  photons  in  all  spqctra  below  about  2-0  keV  arises  due  to  partial 
absorption  of  X-rays  in  the  dead  layers  of  silicon  and  gold  of  the  detector  at  Si-K  and 
Au-M  edges  and  partials  in  the  target.  Our  recent  results  for  the  ratio  of  double 


differential  cross-sections  for  the  bremsstrahlung  photon  emission  from  Ag  and  Au  thin 
targets  when  bombarded  with  7-0  ke V  electrons  yield  an  excellent  agreement  with  the 
theoretical  predictions  on  the  relative  shape  of  the  spectrum  [5]. 

5,  Other  feasible  experiments  with  the  facility 

The  present  setup  as  it  is  and  one  with  some  further  modification,  can  be  used  for  other 
feasible  experiments  as  mentioned  below: 

1.  Angular  distribution  measurements  of  the  bremsstrahlung  spectra  as  a  function  of 
the  incident  electron  energy  ( T)  and  the  atomic  number  (Z)  of  the  target. 

2.  Innershell  ionization  cross-section  of  gaseous  and  solid  targets  at  keV  electron 
energies  as  a  function  of  Z  and  T. 

3.  Measurement  of  bremsstrahlung  photons  detected  in  coincidence  with  decelerated 
electrons  scattered  in  a  fixed  direction;  thus  information  on  the  elementary  radiation 
process  can  be  obtained  and  a  more  stringent  test  of  theoretical  work  is  possible. 

4.  Multiple  ionization  of  free  atoms  by  using  a  parallel  plate  electrostatic  analyzer  and 
a  time-of-flight  spectrometer. 

The  investigations  on  problems  (3)  and  (4)  above  are  under  active  plan  which  will  be 
taken  up  in  the  next  phase  of  the  development. 

Conclusions 

In  this  work,  we  have  presented  the  description  of  a  dedicated  experimental  setup 
developed  indigenously  for  investigating  the  atomic-field  bremsstrahlung  process  for 
keV  electrons  on  a  solid  or  a  gaseous  target.  The  design  and  fabrication  of  various 
components  of  the  setup  has  been  given.  The  high  voltage  electron  gun  can  operate  in 
the  range  of  0-8  kV  accelerating  voltage  in  the  present  configuration.  The  performance 
of  the  facility  has  been  tested  by  recording  the  electron  bremsstrahlung  spectra  for 
collisions  of  7*0-7-5  keV  electrons  with  thin  solid  targets  and  shown  that  the  aimed 
objectives  are  fulfilled  with  satisfaction.  It  may  be  mentioned  that  the  estimated  cost  of 
our  indigenously  built  high  voltage  electron  gun  system  is  about  5-6  times  lesser  than 
that  of  the  custom-built  unit.  The  work  with  gaseous  targets  is  in  progress.  Further, 
other  possible  experiments  on  the  facility  in  the  area  of  electron-atom/molecule 
collisions  at  keV  energies  are  also  mentioned. 
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oscillator  equation 
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Abstract.  The  Painleve  analysis  is  applied  to  the  anharmonic  oscillator  equation 
x  +  dx  +  Ax  +  Bx2  +  Cx3  =  0.  The  following  three  integrable  cases  are  identified:  (i)  C  =  0, 
d2  =  25  A/6,  A  >  0,  B  arbitrary,  (ii)  d2  =  9/4/2,  B  =  0,  A  >  0,  C  arbitrary  and  (iii)  d2  =  -  9 A/4, 
C  =  2B2/(9A),  A  <  0,  C  <  0,  B  arbitrary.  The  first  two  integrable  choices  are  already  reported  in 
the  literature.  For  the  third  integrable  case  the  general  solution  is  found  involving  elliptic 
function  with  exponential  amplitude  and  argument. 

Keywords.    Anharmonic  oscillator;  Painleve  analysis;  exact  solution. 
PACSNo.    02-90 

1.  Introduction 

In  this  paper  we  study  the  integrability  of  the  nonlinear  system 

x  +  dx  +  Ax  +  Bx2  +  Cx3  =  0,  (1) 

where  an  overdot  denotes  differentiation  with  respect  to  time  t,d  is  the  damping 
coefficient,  B  and  C  are  quadratic  and  cubic  nonlinearities  coefficients,  and  A  is  the 
square  of  the  natural  frequency  of  the  system.  Very  recently,  chaotic  behaviour  in  (1) 
subjected  to  a  harmonic  parametric  excitation  is  studied  [1].  The  anharmonic  oscil- 
lator equation  (1)  arises  in  many  situations  in  physics  and  engineering  [2]  apart  from 
the  application  to  a  classical  anharmonic  oscillator.  The  addition  of  an  external 
periodic  forcing  term  to  (1)  with  B  =  0  results  in  the  Duffing  oscillator  and  with  C  =  0 
gives  mechanical  oscillator.  System  (1)  with  external  periodic  force  and  parametric 
perturbation  exhibits  complex  dynamics  [1, 3-5].  Some  special  analytical  solutions  of 
(1)  are  reported  in  [2].  The  integrability  of  (1)  with  B  —  0  is  studied  by  Euler  et  al  [6], 
Duarte  et  al  [7],  Parthasarathy  and  Lakshmanan  [8]  and  Estevez  [9]. 

Our  motivation  in  the  present  paper  is  to  investigate  the  integrability  of  (1)  for 
nonzero  values  of  d,  A,  B  and  C.  Particularly  by  applying  the  Painleve  analysis  [10, 1 1] 
we  show  that  the  system  (1)  is  integrable  for  the  specific  choice  of  the  parameter  values 
d2  =  - 9 A/4,  C  =  2B2/(9A),  A<Q  and  B  arbitrary  in  addition  to  the  two  other 
integrable  choices  reported  in  the  literature  for  special  form  of  (1).  For  this  choice  we 
then  give  the  analytical  solution. 

2.  Painleve  analysis  of  the  damped  anharmonic  oscillator 

The  P-property  requires  that  the  solutions  of  (1)  may  be  written  as  Laurent  series 


expansion  in  the  complex  variable  [10, 1 1]  T  =  t  -  t0  with  leading  order 

x  ~  a  TP  (2) 

where  p  is  yet  determined.  Inserting  the  dominant  term  (2)  in  (1)  one  finds  two 
possibilities 

(i)  p  =  -  2,     a0  =  -  6/J5,     C  =  0. 

Now  to  find  higher  order  terms  we  write 

Y  —  n  TP  +  aTp  +  r  (3) 

,V    "Q   <•         ~l*(.  V-V 

and  substitute  in  (1)  to  obtain  resonances,  that  is,  conditions  such  that  arbitrary 
constants  may  enter  in  the  expansion  (3).  For  the  case  p  =  —  2  resonance  condition  is 
(r  +  1)  (r  —  6)  =  0.  In  other  words,  the  resonances  occur  at  r  =  —  1, 6.  For  the  case  (ii) 
(p=  -  1)  the  resonance  values  become  r=  —  1,4.  The  root  -  1  corresponds  to  the 
arbitrariness  of  r0. 

To  verify  the  occurrence  of  sufficient  number  of  arbitrary  constants  we  introduce  the 
series  expansion 

r 

0  i_j       k  ,  \"/ 

k=l 

in  (1)  and  equate  the  coefficients  of  various  powers  of  T  to  zero. 

Case  (i):  Here  the  resonances  are  r  =  -  1, 6.  The  values  of  the  coefficients  «,., 
j  =  1, 2, . . . ,  5  are  given  by 


),    a2  =  (d2-25A)/(50B\    a3= 

a4  =  (Id4  -  125A2)/(5000J3),    a5  =  (19d5  -  1375dA2)/(75000J3). 
For  a6  we  obtain 

a6(0)  =  25A  -  6d2. 
Hence,  case(i)  possesses  P-property  for 

C  =  0,    d2  =  25A/6,    A>Q,    B  arbitrary.  (5) 

Case  (ii):  Here  p  =  -  1,  r  =  -  1,  4.  From  the  coefficients  of  r"3,  r'2,  T~  l,  x°  we  obtain 

aJ=-2/C,    a1=-(2B  +  Crfa0)/6C,    'a2  =  a0(6AC  -  2B2  - 


-  3B2  -  -  (6) 


respectively.  From  the  coefficients  of  r1  in  (1)  we  get 

a4(0)  =  -  2da3  -  2a0a3(B  +  3Ca,  )  -  3Ca0a22 


Simple  manipulation  of  the  above  equation  yields  a3  =  0.  From  (6)  we  note  that  a 
becomes  zero  either  for  da0^/(WSC)  +  a,  =  0  with  at  ^  0,  a2  ^  0  or  for  ax  =  a2  =  0 
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where 

<*l=9AC-3B2-2d2C,    a2  =  B(252-9/lC)/(108C2).  (7) 

In  the  next  section  we  show  that  analytical  solution  exists  for  al  =  a2  =  0,  that  is,  for 
2B2-9AC  =  0,    9AC-3B2-2d2C  =  Q.  (8) 

From  (8)  the  integrable  conditions  are 

d2  =  -  9 A/4,    C  =  2B2/(9A),    A  <  0,     B  arbitrary.  (9) 

Further  from  (7),  the  conditions  <x.1  —  0,  «2  =  0  with  B  =  0  gives 

d2  =  9 A/2,    B  =  0,    A>Q,    C  arbitrary.  (10) 

Thus  we  find  that  (1)  possesses  P-property  for  three  parametric  restrictions  given  by  (5), 
(9)  and  (10).  The  third  integrable  choice  (10)  is  reported  in  [6, 8, 9].  To  the  best  of  our 
knowledge  the  integrable  choice  (9)  has  not  been  listed  in  the  literature. 

3.  Analytical  solution  for  the  integrable  choice  (9) 

Analytical  solution  for  the  integrable  choices  (5)  and  (10)  are  given  in  [12, 2, 8].  In  this 
section  we  obtain  the  exact  solution  of  (1)  for  the  parametric  restrictions  (9).  For 
nonlinear  differential  equations,  at  present,  no  general  methods  are  available  to  find 
analytical  solutions.  However,  exact  analytical  solution  is  reported  in  the  literature  for 
certain  form  of  second-order  ordinary  differential  equation.  For  example  in  ref.  [12] 
analytical  solution  is  given  for  the  equation 

W+AW+BW3=Q,  (11) 

where  A  and  B  are  arbitrary.  Since  the  above  equation  is  a  special  case  of  (1),  we  check 
the  possibility  of  reducing  (1)  into  (11)  by  a  suitable  transformation.  This  may  be 
possible  for  specific  parametric  choices. 
First  we  write 

x  =  uV  (12) 

and  find  V  so  that  no  first  derivative  in  u  appears.  We  obtain  V  =  V0e~d/2t.  For 
convenience  we  choose  V0  =  1.  Equation  (1)  now  becomes 

u  +  (A-  d2/4)u  +  Bedl2tu2  +  Ce~dtu3  =  0.  (13) 
Then  we  use  the  transformation  linear  in  W  of  the  form 

u  =  a.eat  +  $W(Z}eg\  (14a) 

Z  =  beft.  (14b) 

When  (14a)  is  substituted  in  (13)  exponential  time  dependence  in  the  second  term  of 
(14a)  gives  a  first  derivative  in  W.  To  eliminate  the  first  derivative  of  W  the  transfor- 
mation (14b)  is  introduced.  Now  we  get  a  differential  equation  in  W.  To  determine  the 
constants  of  (14),  we  substitute  (14)  in  (13)  and  equate  the  coefficients  of  W°,  W2  and 
dW/dZ  to  zero.  In  the  reduced  differential  equation  all  the  terms  contain  exponential 
time  dependence  except  the  coefficient  of  W.  So  we  equate  the  coefficient  of  W  to  zero. 
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Further  when  f=-2g  the  exponential  time  dependent  terms  disappear  and  we  obt 

d2W/dZ2=W\ 
The  constants  in  (14)  are  found  to  be 

and  further  we  require 

d2  =-9,4/4,     C  =  2B2/9A. 

We  note  that  the  conditions  given  by  (16b)  are  the 
P-analysis  (clarify  eq.  (9)).  More  precisely 


=  (2d2/3B)(l 

(17 

(171 
(16b)  reduces  to  (15)  which  is  (11)  when  A  =  0 


, 

where  c0  is  an  integration  constant.  We  consider  two  cases  in  which  c  -  0  pnH  ^ 
separately.  For  CQ  *  0  with  W(0)  -  M^0  =  (-  a^)1/^  and  d  WM7\  V  f  ,  C°  *  ' 
of  (15)  exist  only  for  c0  <  0  and  are  given  by  7  'Z=°  =  °  ^  S°Iuti°n 

W=W0/cn(W0v;k),     k2  =  l/2,     v  =  Z~Z0,  (19 

where  Z0  is  a  second  integration  constant  so  that  the  solution  of  (1)  becomes 
x(t)  =  (2d2/3B)tl  +  WQ  exp(-dt/3)/cn(WQZ;  k)l 


0. 
The  solution  (20)  for  d  >  0  exponentially  decays  to  the  stable  fixed  nnint  t     *  ,  •     u 

r^r^^^^ 

this  case,  --iJm-ud^te^C^a^^ 

x(t)  =  (2d2/3B)[l  ±  l/(l 
where  cx  is  the  integration  constant. 
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Abstract.  A  ^-deformed,  y-Hermitian  kinetic  energy  operator  is  realised  and  hence  a  q- 
Schrodinger  equation  (<?-SE)  is  obtained.  The  q-SE  for  a  particle  confined  in  an  infinite  potential 
box  is  solved  and  the  energy  spectrum  is  found  to  have  an  upper  bound. 
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PACS  Nos    01-55;  02-90;  03-65 


1.  Introduction 

The  study  of  g-deformations  of  Lie  algebras  and  ordinary  calculus  has  grown  into 
a  major  area  of  research  in  mathematical  physics.  The  g-deformed  calculus,  when 
applied  to  quantum  mechanics  with  its  fundamental  postulates  preserved,  gives  rise  to 
g-quantum  mechanics.  Several  years  ago  Janussis  et  al  [1, 2]  discussed  ^-quantization 
and  the  eigenvalue  problem  of  ^-differential  operators.  Quantum  mechanics  is  usually 
deformed  in  two  different  ways:  one  may  either  replace  the  canonical  commutation 
relation  by  a  ^-commutation  relation  or  the  momentum  operator  in  the  Schrodinger 
equation  (SE)  by  a  g-deformed  one.  Most  of  the  work  in  this  area  is  based  on  the  former 
approach  [3-5].  Minahan  [6]  has  considered  a  g-extension  of  SE  (g-SE)  and  obtained 
the  spectrum  of  ^-harmonic  oscillator.  The  energy  spectrum  of  a  g-analog  of  hydrogen 
atom  has  been  obtained  by  Yang  and  Xu  [7].  In  this  paper,  following  the  g-SE  method, 
we  determine  the  spectrum  of  a  particle  in  a  one  dimensional  infinite  potential  well. 

The  infinite  potential  well  is  characterized  by  a  constant  potential  in  a  finite  region 
outside  which  the  potential  is  infinite.  Particles  subject  to  such  a  potential  are  trapped 
inside  the  constant  potential  region.  This  model  potential  has  been  used  in  the  free 
electron  model  of  metals.  Study  of  any  problem  in  g-quantum  mechanics  is  of  interest 
not  only  for  pedagogic  reasons  but  also  for  comprehending  the  uniqueness  of  standard 
(g=l)  quantum  mechanics.  One  cannot  also  rule  out  the  possible  existence  of 
g-systems  in  nature. 

2.  Elements  of  ^-calculus 

The  mathematical  idea  of  g-deformation  has  deep  roots  running  down  to  the  middle  of 

of 


G  Vinod  et  al 
As  q  -»  1,  [n]  ->«.  The  g-factorial  is 


The  ^-difference  operator  Dx  is  defined  by 
f(qx)-f(x) 


If  q  -»  1,  then  the  ordinary  derivative  d/(x)/dx  is  obtained,  if  it  exists.  The  prodt 
and  quotient  rule  for  ^-difference  operators  are 

Dx{u(x)v(x)}  =  [v(x)Dxu(x)  +  u(qx)Dxv(x)} 

v(x)Dxu(x)-u(x)Dxv(x) 

Dx{u(x)/v(x)}  = — r-r-. • 

xt  ;  v(qx)v(x) 

The  g-analogue  of  integration  in  the  case  of  finite  limits  a,  b  is  defined  as: 


)-a  £  <?W«)    - 

-Q  r  =  0  J 

The  product  rule  for  ^-integration  is 

Sv(x)Dxu(x)d(qx)  =  u(x)v(x)  -  Su(qx)Dxv(x)d(qx). 

The  ^-analogues  of  exponential  function  and  trigonometric  functions  have  als 
constructed 

co        v« 
.    .  ^-—i       -^* 

Q\     '  Zj     r.^H  I 


singx  =  -  (Eq(ix)  -  Eq(-ix)} 


cosqx  =  -{Eq(ix 


3.  <jr-quantum  mechanics 

Even  though  it  is  customary  [8]  to  take  the  ^-commutation  relation 
Dxx-qxDx=l 

as  the  starting  point  of  ^-quantum  mechanics,  the  Dx  operator  is  neither  ^f- 
nor  g-skew  Hermitian  (Appendix  1).  Therefore  it  is  not  possible  to  define  a  simj 
straightforward  realisation  of  the  momentum  operator  which  is  g-Hermitian.  H< 
this  difficulty  can  be  circumvented  in  an  approach  based  on  the  g-Schrodinger  eq 
which  is  written  as 

H,^  =  V, 
where  the  ^-deformed  Hamiltonian  Hq  is  the  sum  of  a  q-deformed  kinetic 
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operator  and  a  potential  energy  operator.  Since  in  one  dimension  the  kinetic  energy 
operator  contains  d2/dx2  in  the  undeformed  (q  =  1)  SE,  we  expect  the  ^-deformed 
kinetic  energy  operator  to  contain  D2.  Besides,  we  demand  that  Hq  is  a  q-Hermitian 
operator  (Appendix  1).  The  g-adjoint  operation  is  defined  as 


(12) 
where  the  g-integration  is  over  the  q-line  segment.  Using  (12)  we  can  prove  that 

where  the  action  of  qxix  is  given  by 

So  D2q~x''*  serves  as  a  q-hermitian  operator  (Appendix  1).  We  rewrite  the  q-SE  as 

^-  Djq-"*  +  V(x)|  <//,(*)  =  Eqil/,(x)  (14) 

where  Eq  is  a  q-deformed  energy  eigenvalue. 

In  q-deformed  space  we  can  take  d(qx)  as  the  elementary  volume,  or  in  the  one 
dimensional  problem  d(qx)  serves  as  the  elementary  length.  So  |  t/^(x)  | 2  is  the  probabil- 
ity that  a  measurement  performed  on  the  system  will  locate  it  in  the  element  d(qx)  of  the 
q-line.  Therefore,  if  we  q-integrate  \^q(x)\2  over  the  q-line  segment  of  finite  length,  we 
will  get  unity 

S\*l/q(x)\2d(qx)=l.  (15) 

This  probability  interpretation  demands  i/^(x)  to  be  single  valued  and  finite  every- 
where, as  in  the  standard  case.  Also  \l/q (x)  and  Dxi(/q(x)  should  vanish  at  endpoints  of  the 
q-line  segment. 

4.  The  particle  in  a  box 

In  this  problem, 

V(x)  =  0  for  0  ^  x  ^  a, 

=  oo  otherwise.  (16) 

We  can  write  the  q-SE  for  this  potential  as 

(D2q~x'"x  +  /c2)i//q(x)  =  0  (17) 

where 

2mE 


Solution  of  the  <j-SE  is 
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N  is  the  normalization  constant.  Since  i//9(0)  =  \l/q(a)  -  0,  the  admissible  solutions 
are  those  which  satisfy 


v  (-i)Y(r+2)(M2r+1    n 


Only  numerical  solutions  of  this  equation  are  possible.  From  (18),  the  energy 
values  are  given  by 


Q       2ma2 
The  ^-normalization  constant  N  is  evaluated  from 


which  follows  from  the  assumption  that  the  particle  is  confined  between  x 
x  =  a.  Let  i/'*)/',  be  expressed  as 


for  even  values  of  r, 

(r-2)/2 


and  for  odd  values  of  r, 

+  2)  -(r  - 


Since  the  series  expansion  of\j/*(kx)i{/q(kx)  is  convergent,  we  may  ^-integrate  eac 
in  the  expansion  (24)  employing  the  identity 


which  gives 


«    ^v^_ 

"^'prrij-1 


or, 

-1/2 


where  Cn  are  the  solutions  of  ka  given  by  (20). 

5.  Analytic  solutions  for  q  «  1 

Analytic  solutions  exist  when  q  is  close  to  unity.  Let  us  take  q  =  1  —  6,  6  being 


[n]  =  n{l-(n-  1)6/2} 

[«]!=«!{l-n(n-l)<5/2}. 
The  wavefunction  is  approximated  by 


r  =  0 


The  coefficients  br  take  the  form 

9y(r-2)/2  A 
•£•£-'«  — r\         ii,  c. 


2-r(2r+l) 


for  even  r 
for  odd  r 


(26) 

(27) 

(28) 


(r-l)(2r 


where  Ar  s  is  given  by 

1 
r's     (2s+l)!(2r-2s-l)! 

6.  Numerical  results  and  discussion 

Here  we  have  deformed  the  SE  by  giving  a  nonstandard  realization  of  the  kinetic  energy 
operator,  which  coincides  with  the  corresponding  operator  in  the  standard  SE  when 


40.00 


30.00  - 
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Figure  1.    Variation  of  | kqa]  (corresponding  to  the  ground  state)  with  q. 
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Table  1.    Allowed  values  of  ±  kaa. 


0-1 

35-11869 

0-2 

13-91709 

69-87712 

0-3 

8-564421 

28-97956 

0-4 

6-335347 

16-46896 

41-17533 

0-5 

5-319289 

10-92827 

22-82981 

45-21165 

0-6 

4-822798 

8-039686 

15-5551 

24-57137 

0-7 

4-044871 

7-795353 

11-70577 

15-03165 

0-8 

3-770396 

7-872589 

13-08165 

14-74317 

0-9 

3-394796 

6-803214 

8-21286 

q-*\.  Numerical  solutions  of  the  q-SE  for  a  particle  in  a  box  are  obtained  for 
ranging  from  0  to  1.  It  is  observed  that  for  lower  values  of  q,  higher  energy  \e\ 
forbidden.  This  is  due  to  the  rapidly  converging  nature  of  the  wavefunction  fo 
values  of  q.  The  numerical  solutions  are  tabulated  and  the  solution  corresponi 
the  ground  state  of  the  system  is  plotted  against  q  which  is  best  fitted  for  a  poly 
in  q.  The  numerical  solutions  show  that  for  q  ^  1,  the  energy  eigenvalues  have  ar 
bound  even  in  systems  which  possess  an  infinite  number  of  energy  eigenvalue 
<7=1. 


Appendix  1 

As  the  problem  discussed  in  the  text  is  a  one  dimensional  one,  all  ^-integrations  a 
a  segment  of  finite  length.  The  g-adjoint  operation  is  defined  as 


where  the  functions  $ 
segment. 
So  a  <j-Hermitian  operator  D  satisfies 


and  \j/q(x)  are  assumed  to  vanish  at  the  end  points 


=  S(W>q(x))*il,q(x)d(qx). 
A  g-Hermitian  operator  has  the  following  properties: 

(i)  The  eigenvalues  of  a  g-Hermitian  operator  are  real. 
Let  Cl\l/q(x)  =  coij/(x)  from  (A2), 


which  implies  co*  =  CD. 

(ii)  The  eigenvectors  of  a  <?-Hermitian  operator  belonging  to  different  eigenvah 

^-orthogonal. 

Suppose  that  \}/ql(x)  and  \jjq2(x)  are  two  eigenvectors  of  Q  with  eigenvalues  o^  a 
respectively. 


or, 

(cOi  —  (D2}S\ljql(x}\ljq2(x]d(qx)  =  0  (g-orthogonality). 

Here  we  wish  to  show  that  D*q~*11*  is  g-Hermitian: 


where  we  have  employed  the  product  rule  for  ^-integration  given  by  (7).  Similarly, 


q~xS*  is  a  g-Hermitian  operator. 
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Abstract.  Following  the  recent  work  of  Chandler  et  al  on  quasi  probability  distributions  for 
spin-1/2  particles,  we  show  that  polarized  light  can  be  interpreted  in  terms  of  trivariate 
probability  distributions  in  two  different  ways  by  choosing  the  variates  to  correspond  to  (i)  the 
co-ordinates  on  the  Poincare  sphere,  (ii)  the  components  of  the  spin  operator  of  the  photon.  In 
either  case,  it  is  shown  that  the  Margenau-Hill  procedure  leads  to  probability  mass  functions 
while  the  Wigner-Weyl  approach  leads  to  probability  density  functions  and  the  well-known 
Stokes  parameters  are  also  realised  as  appropriate  averages  with  respect  to  these  distribution 
functions. 

Keywords.    Density  matrix;  polarization;  Stokes  parameters;  quasi-probability  distributions. 
PACS  Nos    14-80;  03-65;  02-50;  07-60 


1.  Introduction 

The  subject  of  polarization  of  light  [1]  is  an  ancient  one  going  back,  according  to  Born 
and  Wolf  [2],  to  Huygens  [3]  of  the  seventeenth  century.  The  state  of  polarization  was 
characterized  by  Stokes  [4]  in  terms  of  a  set  of  four  parameters  S0,sl5s2,s3,  named 
after  him.  Poincare  [5]  developed  a  geometrical  representation  for  all  states  of 
polarization  in  terms  of  points  on  a  sphere  known  by  his  name.  It  is  well-known  [6]  that 
photons,  the  quanta  associated  with  light,  are  spin-1  particles  which  are  massless  and 
hence  deprived  of  the  |10>  or  longitudinal  state.  The  review  articles  by  Fano  [7]  and 
McMaster  [8]  show  how  a  partially  polarized  state  of  light  can  be  represented  in  terms 
of  a  2  x  2  hermitian  and  positive  semidefinite  matrix  p  employing  the  concept  of  the 
density  matrix  [9].  What  is  the  composition  in  a  partially  polarized  beam  of  light?  If 
p  has  eigen  values  At  and  X2  with  corresponding  eigen  states  \e.l  >  and  |e2  >  respectively, 
one  can  say  that  the  beam  consists  of  N^.l  photons  with  polarization  \el }  and  AT/12 
photons  with  polarization  |£2>,  where  N  denotes  the  total  number  of  photons  which  is 
proportional  to  the  Stokes  parameter  s0^(si  +S2  +  SIY12  representing  the  total 
intensity  /  of  the  beam.  The  equality  corresponds  to  completely  polarized  or  pure  states 
characterized  by  one  of  the  eigenvalues  being  zero.  The  recent  work  of  Chandler  et  al 
[10]  on  spin-1/2  particles  shows  that  one  can  interpret  the  contents  of  a  2  x  2  density 
matrix  for  spin-1/2  particles  in  probabilistic  ways  based  on  the  correspondence  rules  of 
Margenau  and  Hill  [11]  on  one  hand  and  Weyl  [12]  and  Wigner  [13]  on  the  other. 
More  recently  Usha  Devi  et  al  [14]  have  adduced  arguments  following  Margenau  and 
Hill  to  show  that  an  aligned  spin-1  system  can  be  viewed  in  terms  of  bivariate 
probability  distribution  by  identifying  the  associated  characteristic  function.  It  is 
interesting  to  note  that  a  beam  of  photons  is  akin  to  an  aligned  spin-1  system  when  the 


Stokes  parameter  s3  is  zero.  It  is  interesting  to  investigate  whether  a  joint  probability 
distribution  interpretation  can  be  given  to  an  arbitrarily  polarized  beam  oflight.  In  this 
paper,  we  extend  the  Margenau-Hill  procedure  to  provide  a  probabilistic  interpreta- 
tion for  partially  polarized  light,  in  terms  of  a  trivariate  probability  distribution  even  in 
the  general  case  when  s3  ^  0.  We  find,  in  fact,  that  there  are  two  convenient  ways  in 
which  this  extension  can  be  carried  out  viz.,  using  the  operators  representing 

(i)  the  co-ordinates  on  the  Poincare  sphere 
(ii)  the  components  of  the  spin  of  the  photon 

as  the  variates.  This  leads  to  a  probability  mass  function  where  the  variates  assume 
discrete  values.  On  the  other  hand  a  probability  density  function  can  be  obtained  using 
the  Wigner-Weyl  correspondence  rule  where  once  again  the  variates  may  be  chosen  in 
two  alternative  ways  (i)  and  (ii). 

2.  States  of  polarization  and  the  density  matrix  for  photons 

Our  notations  are  as  follows:  Using  a  right  handed  co-ordinate  system  where  the  z-axis 
is  chosen  parallel  to  the  momentum  vector  k  of  the  photon,  all  possible  states  of 
polarization  can  be  represented  by  [15] 

[ea/3  >  =  (cos  jS  cos  a  -  i  sin  j8  sin  a)  \ex  >  +  (cos  ft  sin  a  +  i  sin  /J  cos  a)  |  e^  >      (1) 

where  |ex),  |ey>  denote  linear  states  of  polarization  along  x,  y  axes  respectively.  Here 
0  ^  a  <  n  and  —  rc/4  ^  /?  s$  n/4  are  angles  such  that  0  =  n/2  —  2(1  and  0  =  2a  designate 
points  on  the  Poincare  sphere  where  the  north  pole  corresponds  to  right  circular 
polarization  and  the  south  pole  corresponds  to  left  circular  polarization.  Clearly,  |£a/)> 
and  |fia+]t/2-0>  are  orthogonal  states  in  a  two  dimensional  complex  vector  space  and 
correspond  geometrically  to  diametrically  opposite  points  on  the  Poincare  sphere.  We 
choose  the  circular  polarisation  states 


(2) 

as  our  basis  with  respect  to  which  the  density  matrix  p  assumes  the  form 

P  =  ^[l+<rP]  (3) 

where  c^  ,  <72,  °3  denote  the  standard  Pauli  matrices,  /  =  Tr  p  =  s0  denotes  the  intensity 
of  the  light  beam.  Moreover, 

(4) 


where  x',  y'  denote  linearly  polarized  states  with  jff  =  0  and  a  =  rc/4  and  3rc/4  respectively. 
Note  that  s^s^Sj  are  expectation  values  of  the  operators  <7l5o-2,cr3  respectively  and 
hence  we  could  identify  o-ls  (72,  <r3  as  operators  representing  the  co-ordinates  of  a  point 
on  the  Poincare  sphere.  On  the  other  hand,  considering  photon  as  a  spin-1  particle,  the 
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density  matrix  can  be  expressed  in  terms  of  the  Fano  statistical  tensors  [7]  tk  through 


Trp  f     f    , 

~y~  ~  _ 


(5) 


where  T*(J)  are  spherical  tensor  operators  constructed  out  of  the  spin  operators  J  of  the 
photon  and  are  normalized  such  that 

<lm'|Tj(J)|  lm>  =  c(l/cl;  mqm')^/2k  +  1  (6) 

where  c(l/cl;  mqm')  are  Clebsch-Gordan  coefficients.  An  explicit  3x3  matrix  form  for 
p  is  given  by  (1)  of  [16].  Observing  that  the  basis  states  |lm>  are  such  that 
|11>  =  —  \sRy  and  |1  —  1>  =  |£L>  leads  to  the  identification, 


(7) 


3Y/2 


!/       2  So 
r^-fi^o. 

Clearly  the  system  is  aligned  [17]  if  s3  =  0  and  an  unitary  transformation 


e~i0'2  0  0 
0  1  0 
0  0  e+i6 


\ 


(8) 


with  tan  B  —  s2/sl  takes  the  photon  density  matrix  into  its  principal  axes  of  alignment 
frame  (PAAF)  [17], 

/  c.  O       /c2J-c2W2\ 


- 

PPAAF  ~~ 


0 


\ 


0 

S0 


(9) 


/ 


The  conclusions  drawn  by  Usha  Devi  et  al  [14]  apply  in  toto  here,  together  with  the 
identification, 


s0 


(10) 


y=5o 
n  =o 


!>••«»•«»•«         T 


ioo« 
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and  we  can  explain  the  polarization  in  terms  of  the  bivariate  joint  probabilit: 
with  any  two  of  the  spin  components  of  the  photon  as  variates.  To  describe  tf 
general  situation  i.e.,  when  s3  ^  0  we  need  a  trivariate  probability  distribution  w 
have  to  treat  all  the  three  components  of  photon  spin  as  variates. 

3.  Margenau-Hill  probabilities  for  photons 

The  Margenau-Hill  trivariate  characteristic  function  for  three  non-commutin 
ators  X1,X2,X3  is  defined  by, 


+  exp(z'j£2/2)exp(zT1/1)exp(zT3/3) 
+  exp(zj£3/3)exp(zT1/1)exp(zT,/2) 
+  exp(z'j£3/3)exp(z'j£2/2)exp(L£1/1)}]. 
Choosing  X1  ,X2,X3iobtol,ff2,a3  and  p  as  given  in  (3),  we  can  simplify  (1 1)  to 

$MH  (A  ^2  >^3  )  ~  S0  COS  J  1  COS  ^2  COS  73  +  IS  t  sin  7  !  COS  1 2  COS  /3 

+  is2  cos  / 1  sin  1 2  cos  /3  -f-  zs3  cos  / 1  cos  I2  sin  /3 . 
Simplifying  (12)  further,  ^MH^H^^S)  could  be  cast  into  the  form 

$81(^2  ^3)  =  Z  /MH(mi'W2>W3)eXP('(w1/1+W2/2+l 

Wi,'«2,m3=  —  1,+  1 

where  /^(ml3  m2,w3)  could  be  identified  as  the  Margenau-Hill  trivariate  prot 
mass  function  for  photons  and  is  given  by 

/MHK^2»w3)  =  -[s0  +  m151+m2s2+m3s3]. 
o 

m  j ,  m2 ,  m3  are  the  classical  random  variables  representing  the  operators  o  l ,  o2 , 
the  Stokes  parameters  could  be  identified  as  first  moments  of  the  distri 
/MHK,w2,m3)i.e., 

sk  =  Tr(p<rk)=          £  wfc/^(m1,m2,m3),    k  =  1,2,3. 

/Wi»/tt2tWia=s  ~"l,~l"  1 

On  the  other  hand  Xl,X2tX3  in  (11)  could  be  chosen  as  J15J2,  J3,  the  photc 
components  and  the  density  matrix  p  is  then  given  by  (5).  To  simplify  the  charac 
function  ^i2,(/! ,  /2 ,  /3 )  we  follow  the  approach  outlined  in  [14]  where  we  ident: 
the  unitary  matrices 


1        Jl          1 


/ 


1 


0 
2       -1 


UA.pv1''  i1  ]./  —  '•'I  v-AjJv1''  i  1  i  ;  ^  1  3  ^Ay^u/  2  *  2>  —  ^  2OArH1*'  22/^2  \L  '  I 

in  (1  1)  where  Jj  and  Jd2  are  diagonal  matrices  with  diagonal  elements  1,  0,  —  1.  We  can 
now  simplify  the  characteristic  function  further  to  yield 


where 

/a(m1,m2,m3)  =  i 


mi,»t2,m3  =  —  1,0,4-  1 

x  expO'Or^/j  +m2/2  +m3/3))  (18) 


7t/l)«aMl(^2)m,*3(C7l)mtm,]-(19) 

Explicitly  it  may  be  seen  that  /^(mj ,  m2 ,  m3)  assumes  the  simple  form 

1 

if  m1,w2,w3  =  ±  1; 

/MnK  >m2>ms)  =  —  (m1m252  +  3m^[s0  -(mj  -  m^sj  -  2m3s3)        (21) 

if  one  of  the  variables  mt ,  m2,  m3  assumes  the  value  0  and 

1 

/w)     t  \  _«  /T^\ 

MH^^l  )^2'^3/  ==        T^^3^3  l-^^y 

I/ 

if  more  than  one  of  m1 ,  m2,  m3  take  the  value  0.  Consequently  the  Stokes  parameters 
could  now  be  realised  as  the  classical  averages, 

i 
Z        /MH(w»i,m2,m3)(2m1m2) 

i 
s,=  _Tr[pJ,]=-         Y        /^(m1,m2,m3)m3  (23) 

3  \~r      3-1  f__-»  J  MH*-       l*2"3/o  ^         ' 

4.  Wigner-Weyl  probabilities  for  photons 

The  Wigner-Weyl  characteristic  function  is  defined  by 

Substituting  X^  —  ffi,X2  =  ff2,X3  =  cr3  and  p  given  by  (3)  in  (24),  we  get 

-I^]  (25)' 

where  /  =  (/^  +  72  4-  /3)1/2- 
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Fourier  transform  of  the  characteristic  function  <Ww(/i,/2,  J3)  yields  the  V 
Weyl  trivariate  probability  density  function  /wjw(x1,x2,x3): 


— 

ooo  *     J 

x  expC-itXi/r  +  x^+Xj/a)). 
Carrying  out  the  integration  over  the  solid  angle  dQ,  =  sin  d/dd/d^/,  we  obta 

r  °° 

d/Jcos/sin/x-P-V,        d/sin/i 

o  Jo 

where  V,  =  (JL,J-tJL.\  x  =  (x2,  +  x22  +  x\}11-2.  We  can  simplify  (27)  furthe 
\dxl  dx2  oxzj 


where 


Id 


Now  the  quantum  mechanical  expectation  values  could  be  obtained  in  te 

J  WWv-^l  >  ^-2 '  ^3/  3£, 


•          /"/(Jl        /  \  I        1      1     11 

Alternatively  we  can  replace  X± ,  X2,  X3  in  (20)  by  J1  ,J2,J3  in  which  case 
We  now  express 


where  R(a,  /?,  7)  denote  rotations  through  Euler  angles  a,  /?,  y.  Making  use  of  (5) 
and  the  transformation  properties  of  the  spherical  tensor  operators  r*(J) 
rotations  viz., 


where  D*  denotes  the  standard  (2k  +  1)  dimensional  irreducible  representa 


rotations  [18],  we  obtain 

^(J)    (r      r     r  \_  L  3s3/3        Y 

9'W\V\11->12'13)~   \  S0  i^T^          2-, 

L  z    l    i»=- i.o.i 


mexp(im/) 


x        £       (3m2-2)exp(/m 

m=-l,0,l 


/m/)  . 

J 


(34) 


Fourier  transform  of  ^ww^i^a^s)  could  now  be  carried  out  by  generalising 
the  techniques  used  in  (26)  to  (29)  to  yield  the  Wigner-Weyl  probability  density 
function, 


i>*2>  *s)  =    SO&Q(X)  - 


where 


(35) 


(x)  =--—\\-T- 

ov  ' 


dx 


<5(x  —  m) 


<F  /„>_    !    y/3m2  np  d  f1  d  /e(m-x) 

tx^  2  ^-^J  —  ^T~j        /     \3ffl  -L^  I          ~       ^          "       I 

•         /A.TT  ^^  lYnYiYnVV  Y 

*•"»(•     n,  -*  VJA.     I   ^V  VJ.A.    \  A 


Here  e(m  —  x)  denotes  the  step  function- 

I00   e1'0'  l     if  a>0 


Making  use  of  the  Weyl  correspondence  rule 

1      2      3  "~"^      1123/ 


(36) 


(37) 


(38) 


where  S  stands  for  the  symmetrizer,  the  Stokes  parameters  could  be  realised  as  the 
classical  averages, 


-x?)d3x 


«/-(J)    CY      Y      Y   W   H3Y 
7  WW\A1'  A2'  A3/A3U    A 
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1.  Introduction 

Recently,  in  a  short  paper  by  Salarnin  [1],  an  attempt  was  made  to  solve  the  Dirac 
equation  with  anomalous  magnetic  moment  in  the  presence  of  a  plane  electromagnetic 
wave.  Apart  from  the  fact  that  the  exact  nature  of  the  electromagnetic  field  is  not  stated 
clearly,  the  solution  presented  in  the  paper  does  not  satisfy  the  equation  which  is 
proposed  to  be  solved.  The  object  of  the  present  paper  is  to  investigate  the  same 
problem  for  a  general  class  of  electromagnetic  field  considered  in  [1]. 
The  field  we  consider  is  a  function  of  a  single  variable 

e  =  R»x^  =  Rx,    xfl  =  (x°  =  cr,  r). 

We  use  the  notation  ab  =  cfb^  for  any  two  vectors  a  and  b.  The  Lorentz  potential  for 
the  external  electromagnetic  field  is  A*  =  A**(Q).  For  the  wave  field, 

RR  =  R*Rll  =  Q.  (1) 

So  that,  without  any  loss  of  generality,  with  the  choice  of  the  gauge  and  the  Lorentz 
condition,  , 

&4"         dA 


In  general  the  electric  and  magnetic  fields  in  these  cases  are  always  in  the  plane  normal 
to  the  fixed  direction  R/jR0,  and  the  most  general  form  of  this  class  of  field  may  be 
written  as, 


(2) 

J-oo 

where 

erR  =  0,    ere,.  =  (50.    (ij=l,2).  (2') 

In  the  case  of  radiation,  it  is  not  necessarily  monochromatic,  R0  is  proportional  to  the 
frequency.  The  Dirac  equation  without  the  Pauli-term  for  some  special  class  of  fields 


belonging  to  this  category  was  first  solved  by  Volkov  [2]  and  later  by  others  [3- 
the  investigation  of  the  scattering  of  electromagnetic  radiation  and  other  pi 
problems. 

In  the  following  sections  we  proceed  to  construct  the  solution.  The  prob 
reduced  to  the  solution  of  a  second  order  ordinary  differential  equation.  It  rr 
emphasized  that  the  problem,  in  general,  cannot  be  reduced  to  quadrature  w 
further  specification  of  the  nature  of  the  functions  /t(0)  and  f2(9).  The  last  sec 
devoted  to  discussion  of  nature  of  the  solution  for  various  special  cases. 

2.  The  equation  and  the  solution 

The  Dirac  equation  with  the  Pauli-term  may  be  written  as 


0"v  =  (1,  —  1,  —  1,  —  1)  and  e  =  e/hc,  M  =  mc/h,  e  and  m  are  the  charge  and  mass 
particle  and  X  is  the  parameter  depending  on  the  strength  of  the  anomalous  ma, 
moment  of  the  particle.  Since  d/dv,  d/dvl  and  d/dv2,  where  v  =  R0x0  +  R-r  and  t;,. 
commute  with  the  operator  on  *F  (eq.  (3))  they  are  constants.  Hence,  one  can  wri 
solution  in  the  form, 


where  p*  is  an  arbitrary  vector  and  S(0)  is  a  scalar  function  of  6.  Substituting  this 
one  gets, 


By  virtue  of  eq.  (!') 


Again  fron  eqs  (!')  and  (2) 

(RA)  =  0  and  yRyA  +  yAyR  =  0. 
Hence  multiplying  eq.  (5)  by  yR  from  left, 
=  {y(p  +  sA)  -  M}yR®, 


i.e., 

2(pK){y(p  +  sA)  +  M}$  =  {(p  -f  eA)(p  +  eA)  -  M2}yR®. 
We  choose  p  and  S  such  that 


dS 

~  +  2spA  +e2AA  =  Q 

Qu 
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Finally  from  (5)  and  (8), 


o.  (ID 

In  the  absence  of  the  Pauli-term  i.e.,  for  A  =  0,  <D  is  constant  and  the  problem  reduces  to 
that  of  the  usual  Dirac  equation  which  has  been  already  mentioned.  From  (9)  hp  is  the 
free  particle  momentum.  It  is  important  to  note  that  eq.  (10),  hence  the  expression  for 
S  is  independent  of  L 

3.    Special  cases 

Although  eq.  (11)  is  linear,  the  determination  of  $  cannot  be,  in  general,  reduced  to 
quadrature,  because  it  contains  at  least  two  mutually  anti-commuting  matrices.  We 
consider  special  cases: 

a)  One  of  the  functions  ft  and  /2  in  (2)  is  zero,  or  equivalent^  the  ratio  fl  //2  is 
a  constant,  i.e., 


(12) 
In  this  case  the  electric  and  magnetic  fields  E  and  H  are  given  by, 

and 

(/)  If  /  is  a  linear  function  of  9,  the  field  consists  of  constant  mutually  perpendicular 
electric  and  magnetic  field  of  same  magnitude. 

and 

(iz)  If/  is  a  periodic  function,  e.g.,  /  =  F  cos(6  +  6)  the  field  is  due  to  plane  polarized 
radiation,  not  necessarily  monochromatic,  along  a  fixed  direction  R/R0.  Both 
these  types  of  fields  are  important  for  applications  in  physical  problems.  From 
(11),  one  gets  directly 

(14) 


where  <D0  is  a  constant  spinor.  This  case  has  been  discussed  by  Chakrabarti  [6]. 

b]  In  the  general  case  eq.  (11)  cannot  be  directly  integrated.  But  the  solution  can  be 
expressed  in  terms  of  solutions  of  second  order  linear  differential  equations.  We  may 
write 


A  =  F^COS/J  +  e2sin/i) 

where  (15) 

F2=fl+fl    and 
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Let  u±  be  eigenspinors  of /(e1-y)(e2-y)  with  eigenvalues  +  1,  i.e., 
i(erv)(e2-y)u±=  ±«± 

(e2-y)u±  =  +  z(e! "}')"+, 
and 

i(err)w±=«+, 

«+•«+  =  0,       M+-M+  =  1. 

Let 

rRQ  =  g+(S)u+  +  e-(0)u_. 

Substituting  these  in  eq.  (11),  one  is  lead  to  two  simultaneous  first  order 
equations, 


dp 


dp 

where  dp  —  FdB.  It  is  clear  that  g+  and  g_  are  mutually  complex  conjugate  c 
other.  It  follows  from  above  that 


dp2       dp  dp  + 

dp2        dp  dp 

Thus  the  problem  ultimately  reduces  to  solving  the  second  order  linear  diffe 
equation.  It  must  be  emphasized  that  one  has  to  restrict  to  only  those  solutions  > 
which  satisfies  the  first  order  (18).  One  cannot  proceed  further  without  kn 
explicitly  the  nature  of  functions  f1  and  /2.  We  can  consider  some  of  the  par 
cases  of  physical  importance;  namely  the  case  of  circular  and  elliptic  pol 
radiation  field.  For  monochromatic  radiation 


A  =  Cj  alcos(6  +  <5X)  +  e2 
Introducing  suitable  rotation  in  the  plane  of  et  and  e2  one  can  write 
A  =  e'j  a\  cos  6  +  e'2  a'2  sin  6, 

where  e\  ,  e't  are  the  new  mutually  orthogonal  unit  vectors  in  the  same  plane  and  c 
a'2  depend  on  av  ,  a2,  di  and  <52  .  The  radiation  is,  in  general,  elliptically  polarized, 
particular  for  a\  =  a'2  =  a  (say)  the  polarization  is  circular  and  the  expressions 
and  g_  take  the  simple  form, 


where  B+  and  B_  depend  on  A  and  a.  They  are  to  be  so  choosen  that  the 
simultaneous  equation  for  g+  and  g_  (eqs  (18))  are  satisfied. 
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Dime  equation  with  Pauli-term 

Finally,  once  the  solution  of  eq.  (  1  1  )  is  obtained,  (for  the  special  cases  considered  they 
are  given  by  eqs  (14)  and  (20)),  one  can  finally  obtain  O  from  (8).  Thus 


2(pK) 

where  <J>'0  stands  for  the  solution  eq.  (1  1),  so  that 

,  x)  =  e-''(poc+s<0))a>(A  x).  (22) 


It  should  be  noted  that  each  of  the  spinors  u  ±  introduced  by  (  1  8)  is  two  dimensional  and 
M±  arbitrary  unit  spinor  in  the  respective  subspaces.  This  degeneracy  is  removed  by  the 
operator  (yA)  which  also  changes  u+  to  UT.  Again  the  part  (ypj)  of  (yp)  where  pt  is 
a  vector  in  the  plane  perpendicular  to  R,  changes  u±  to  11+  .  The  subspace  is  invariant 
with  respect  to  the  remaining  part  of  yp,  namely  y°p0  —  yp,,  (p,,  along  R),  but  the 
degeneracy  may  be  removed. 

4.     Remarks 

Since,  eq.  (2)  is  linear  the  most  general  solution  is  obtained  by  suitable  superposition  of 
the  solutions  given  by  (22).  The  most  general  solution  is  obtained  in  the  form 

)dp.  (23) 

The  function  K(p)  is  to  be  obtained  by  the  given  initial  and/or  boundary  condition. 
There  is  no  need  of  appeal  for  the  field  to  vanish  at  infinity.  Again,  we  want  to 
emphasize  that  the  radiation  field  need  not  be  monochromatic  but  it  should  be 
unidirectional.  One  can  also  similarly  investigate  the  class  of  fields  for  which  R  is  not 
a  null  vector,  R,R^O.  This  encompases  a  wider  class  of  fields  which  we  hope  to 
consider  in  future. 

It  can  be  easily  checked  in  the  absence  of  the  anomalous  magnetic  moment,  i.e.,  /I  =  0, 
the  solution  given  by  eq.  (22)  reduced  to  known  solution  [3-5].  On  the  other  hand  for 
particles  with  no  charge  but  anomalous  magnetic  moment,  i.e.,  e  =  0  but  A  ^  0,  which 
may  be  of  importance  in  astrophysical  problem  is  obtained  from  (21)  and  (22)  putting 
e  =  0  and  hence  S  =  0. 
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Abstract.  Fritzsch  like  mass  matrices  with  non-zero  22-elements  both  in  U  sector  and  D  sector 
have  been  investigated  in  the  context  of  latest  data  regarding  mrphys,  j  FjJ,  |  Vcb\,  \  Vtd\  and  |  Vts\. 
Unlike  several  other  phenomenological  models,  the  present  model  not  only  accommodates  the 
value  of  mtphys  in  the  range  150-240  GeV,  encompassing  the  CDF  and  DO  values,  but  is  also  able 
to  reproduce  |  Vcb\  ^  0-040  and  |  Fub/KJ  =  0-08  ±  0-02  and  |  Vtd\  is  predicted  to  lie  in  the  range 
0-005-0-014.  Further,  the  angles  of  the  unitarity  triangle,  related  to  the  CP-violating  asym- 
metries, are  calculated  to  be  in  the  ranges  —  !-0<sin2as$  —  0-1,  0-6<sin2a<  1-0  and 
0-48  <  sin  2/?  ^  0-56,  which  are  in  agreement  with  other  recent  calculations. 

Keywords.  Quark  mass  matrices;  quark  mixing  matrix;  unitarity  triangle;  CP  violation; 
fermion  masses. 
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1.  Introduction 

The  recently  observed  evidence  for  the  top  quark  in  pp  collisions  at  Fermilab  [1,2]  has 
added  another  success  to  the  standard  model's  (SM)  already  long  list  of  achievements. 
This,  however,  is  certainly  going  to  accelerate  the  search  for  discovering  physics  beyond 
the  standard  model.  To  this  end,  a  peep  into  the  physics  beyond  the  SM  is  provided  by 
the  fermion  mass  matrices  and  mixing  angles  which  essentially  enter  as  free  parameters 
in  the  SM.  In  the  absence  of  a  dynamical  theory  for  quark  mass  matrices,  several 
phenomenological  models  [3-14]  have  been  considered  with  fair  degree  of  success.  The 
most  widely  studied  as  well  as  the  most  economical  model  is  that  of  Fritzsch  which, 
unfortunately,  has  been  ruled  out  by  the  data  [1, 2, 15].  The  observation  of  rather  high 
rotphys  =  176  ±  13  GeV  [1]  as  well  as  wphys  =  199  ±  30  GeV  [2]  at  Fermilab  has  further 
complicated  the  situation  regarding  phenomenological  mass  matrices.  In  fact,  rather 
high  value  of  mfhys,  although  in  agreement  with  limits  on  mphys  expected  from  radiative 
corrections  within  the  SM,  has  posed  serious  challenge  to  some  of  the  phenomenologi- 
cal models  for  mass  matrices. 

In  the  absence  of  a  viable  dynamical  theory  for  quark  mass  matrices,  it  is  perhaps 
desirable  to  develop  phenomenological  models  which  are  simple  as  well  as  predictive.  If 
such  models,  while  remaining  in  tune  with  the  data,  are  able  to  generate  simple 
relations  amongst  Cabibbo-Kobayashi-Maskawa  (CKM)  matrix  elements,  then  these 
models  may  provide  vital  clues  for  formulating  a  dynamical  theory.  We  feel  that  in  the 
formulation  of  a  viable  phenomenological  scheme,  ideas  such  as  the  hierarchical 
structure  of  mass  matrices,  relative  smallness  of  CP  violation,  etc.,  may  play  a  vital  role. 

At  present  we  do  not  attempt  to  go  into  the  origin  of  mass  matrices,  rather  try  to  see 
the  implications  of  these  for  the  CKM  matrix  (VCKM)  phenomenology.  In  fact,  we  shall 
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mass  matrices  accommodate  recently  found  value  of  mrphys  along  with  the 
refinements  in  the  measurements  of  FCKM  elements.  Further,  it  would  be  interes 
investigate  the  implications  of  such  mass  matrices  for  hitherto  unknown  CKM 
elements,  |  Vtd\  and  |  Vls\  as  well  as  the  different  angles  of  the  unitarity  triangle  [: 
The  plan  of  the  paper  is  as  follows.  In  §  2  we  present  the  essential  details 
phenomenological  quark  mass  matrices  considered  here.  In  §  3,  we  present  defi 
as  well  as  relations  of  angles  of  the  unitarity  triangle  with  the  asymmetries  in  B  i 
In  §4,  we  present  the  details  of  our  calculations  and  results.  Section  5  summari: 
principal  findings. 

2.  Mass  matrices 

For  the  sake  of  completeness  and  readability  we  reproduce  some  of  the  essential 
of  Gill  and  Gupta  [16].  To  begin  with,  we  consider  hermitian  mass  matrices 
following  form 


M'  = 


0 
A*1 


A1      0 

Df     B1 
B*i    Ci 


[i  =  M,  d], 


where 


and  the  elements  of  M'  are  supposed  to  follow  the  hierarchical  structur 
|^I'|«|B'|«D''<Ci. 
The  above  matrices  M'  can  be  expressed  as 

M'WM'P", 
where  real  matrices  M'  are 


0      IX1' |      0 

\Al\      JO'      |B* 

.  0       \Bl\      C  . 


and 


l  =  diagjl,  exp(  -  ia,-),  exp(-  i(a 


The  matrices  M'  can  be  diagonalized  exactly  by  orthogonal  transformatic 
example, 


where 


j,;    =diag(m1,-m25m3), 


with  subscripts  1,  2,  3  referring  to  u,  c,  t  in  U  sector  and  d,  s,bmD  sector. 
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Using  tr(M'),  tr(M')2,  det(M'),  the  values  of  matrix  elements  A\  B\  C  are  expressed 
in  terms  of  quark  masses  as 


C  =  K  ~m2+m3-  D'),  A1  =  (mr 
Bl  =  (-  (A1)2  +  CD1  +  ml  m2  +  m2m 


23 


m 


The  diagonalising  transformations  0'  can  be  expressed  as 
0'  = 


-Km3/2/A2)1/2     (m.m^/A,)1'2 
(Cm2/2/A2j1/2     (Cm3/3/A3)1/2 


where 


(7) 


(8) 


.-KA/3/A^1/2 

/i  =  m-,  —  m,  —  D1',  A  =  m,  +  m,  —  D1;  /,  =  m-,  —  m, 

•*    *.  J  *•  *  J    £.  j  i  *    4/    j  £,  I 

A,.  =  C(m3  -  ma  )(m2  +  mt );  A2  =  C''(m3  +  m2)(m2 

A,  =  C'(m,  +  m,)(m,-m1).  (9) 

J>vJ^/vjl/  \        / 

To  facilitate  comparison  with  other  similar  approaches  as  well  as  for  better  physical 
understanding  of  the  structure  of  KCKM,  we  present  here  the  approximate  form  of  0". 
For  example,  by  considering  mu«mc<D"<mr  as  well  as  md«ms<Dd<mb,  the 
structure  for  0"  can  be  simplified  and  expressed  as 


-R'112 


(10) 


where 


and 


a2  =  K/mc),  d2  =  K/mt),  Rt  =  D"/mr,  /?|  = 


/v.,  = 


The  matrix  Od  can  be  obtained  simply  by  changing  u-+d,c-+s,t~*b  with  a0  -»•  £>0  and 
d0  ->  c0  where  b2^  =  (md/ms)  and  CQ  =  (mjmb). 
The  mixing  matrix  KCICM  in  terms  of  0"'d  can  be  expressed  as 

FCKM  =  0utPud0d,  (11) 

where 

Using  (10)  and  retaining  terms  of  leading  order,  (1 1)  can  be  simplified  and  written  as 

*ud       'us       *ub 

V          V         V 
v  cd       vcs        vcb 


V          V          V 

vtd        vis        v  tb  J 


g2 


(12) 


Figure  1.    The  CKM  unitarity  triangle  in  the  complex  plane. 
where 


(13) 
(14) 


94=  -02(</>l*>4>2)-  (16> 

The  above  expressions  for  VCKM  are  approximate,  however,  for  the  purpose  of 
calculations,  we  have  employed  exact  expressions. 

3.  Unitarity  triangle 

The  unitarity  of  CKM  matrix  leads  to  six  relations  involving  complex  CKM  matrix 
elements.  These  six  relations  represent  six  triangles  in  the  complex  plane  with  their 
angles  constituting  weak  observable  phases.  Out  of  the  six  triangles,  there  is  one 
triangle  where  all  the  three  angles  are  naturally  large  and  is  expressed  as 


which  is  represented  in  figure  1.  Interestingly,  the  sides  of  this  triangle  correspond  to  the 
decays  Bd-*mz,  Bd->Dii  and  Bd  —  Bd  mixing  represented  by  (Vud  V*b),  (Vtd  Vfb)  and 
(  Vcd  V*b]  respectively.  The  three  angles  of  the  triangle  in  terms  of  CKM  matrix  elements 
are  defined  as  [18] 


*,),  (18) 

arg(-FcdF*/FtdF*J,  (19) 

=  arg(-FudF*/FcdF*).  (20) 


Table  1.  CP  violating  asymmetry  parameter, 
ImA,  for  different  decay  modes  and  its  corres- 
ponding relations  to  angles  of  unitarity  triangle. 


Quark 
subprocess 

Decay  mode           ImA 

b->-iiud 

B°-*n+n~              sin2a 

b-*ccs 

B°-+\ltKs            -sin2/? 

b-+uud 

B°->pKs             —  8^2^ 
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The  CP  violating  asymmetry  parameter  A,  for  the  respective  decays,  is  related  to  the 
angles  of  the  unitarity  triangle  [18],  expressed  in  table  1.  However,  the  relation  of  the 
asymmetry  parameter  for  the  respective  decays  in  terms  of  KCKM  elements  is  given  as 


Vlb  V*  y*,  V          tf  +  (1  -  p-)2]  [fj>  +  p2]  ' 

(21) 


...          -      Y Cd 

and 

T     ,'V*  V..V,,V*.  K.fc  V* 
i  =  Im 


(22) 

(22) 


y*  y*  y    y*  y 

b       (s       cd      cs       ub      ud 

In  (21)-(23)  we  have  also  shown  relations  in  terms  of  modified  Wolfenstein  par- 
ameters (p  =  p(\-  A2/2),  rj~ri(l-  A2/2))  of  CKM  matrix  as  suggested  by  Buras  et  al 
[19]. 

4.  Results  and  discussion 

Before  we  present  our  results,  a  brief  discussion  about  various  inputs  which  have  gone 
into  the  analysis  is  perhaps  in  order.  As  a  first  step,  we  have  considered  quark  masses  at 
IGeV  [20],  for  example,  mu  =  0-0051  ±0-00 15  GeV,  md  =  0-0089  ±  0-0026  GeV, 
ms  =  0-175  ±  0-055  GeV,mc  =  1-35  ±  0-05  GeV,mb  =  5-3  +  0-1  GeV.  Further,  in  order  to 
simplify  the  analysis,  we  have  fixed  the  two  phases,  viz.,  (j)l  =  02  =  90°  in  accordance 
with  the  values  considered  elsewhere  [14, 15].  Noting  the  fact  that  the  CKM  matrix 
elements  |  VjJ  and  |  Vcb\  are  well  known  and  have  weak  dependence  on  m3's  and  D"s,  we 
have  restricted  the  parameter  space  by  first  reproducing  |  Vus\  ^  0-22  and  |  Vcb\  ^  0-040 
[18],  ignoring  the  spread  in  the  values  of  |  VjJ  as  the  calculated  quantities  hardly  show 
any  dependence  on  these.  After  having  fixed  the  values  of  |  VjJ  and  \Veb\,  we  have 
calculated  |  VjJ,  |  VJ,  |  VJ  and  other  phenomenological  quantities  related  to  VCKM,  for 
different  values  of  Rt  and  mt  (1  GeV),  however,  for  the  purpose  of  discussion  we  have 
converted  mr  (1  GeV)  values  into  mfhys  values.  The  values  of  Rb  are  constrained  by  the 
relation, 

I  VJ  =  (R'b(l  -  Rt))il2  -  (R't(l  -  R»))1/2.  (24) 

In  order  to  have  a  better  appreciation  of  the  significance  of  our  results,  we  first 
discuss  our  results  when  (i)  Rt  ^  0,  Rb  =  0  (ii)  Rb  ^  0,  Rt  =  0,  corresponding  to  22- 
element  being  nonzero  in  U  and  D  sectors  respectively.  Keeping  in  mind  that  VCKM  can 
be  characterized  by  any  of  the  four  elements  [17],  in  table  2  we  have  presented  our 
results  pertaining  to  |  Vub\  and  |  VjJ.  This  is  primarily  to  examine  whether  in  the  above 
mentioned  cases  the  present  form  of  mass  matrices  can  accommodate  mfhys  in  the  range 
150-240 GeV  and  the  latest  data  regarding  Rub(  =  \Vub/Vcb\)  which  is  in  the  range 
0-06-0-10  [18].  From  table  2,  it  is  evident  that  0-07  is  the  maximum  value  of  Rub  for 
m,phys  in  the  range  150-240  GeV  in  agreement  with  the  conclusions  of  other  similar 
calculations  [14, 15].  Further,  one  can  easily  check  that  the  above  value  of  Rub  cannot 
be  increased  beyond  0-07  by  any  variation  of  Rt  (or  Rb\  mf hys,  ^l ,  02  and  otner  quark 
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Table  2.  Calculated  values  of  \Veb{,  Rub  and  |  Vtd \  when  (i)  Rb  =  0,  Rt  ^  0  and  (ii)  Rt  =  0  and  Rh  ^  0 
for  certain  representative  values  of  the  sets  (JR,,mrphys)  and  (l?fc,mfhys),  m*hys  in  GeV  units.  In 
ascending  order,  the  corresponding  mt  (1  GeV)  values  in  GeV  units  are  250,  300,  350  and  400. 
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(-0-030,150) 

0-036 

0-07 

0-008 

(0-012,150) 

0-043 

0-06 

0-011 

(-0-020,150) 

0-037 

0-06 

0-009 

(0-014,150) 

0-036 

0-06 

0-009 

(-0-017,150) 

0-045 

0-06 

0-011 

(0-042,150) 

0-041 

0-07 

0-008 

(-0-020,180) 

0-038 

0-06 

0-010 

(0-013,  180) 

0-039 

0-06 

0-010 

(-0-031,210) 

0-042 

0-07 

0-009 

(0-041,  180) 

0-039 

0-07 

0-007 

(-0-021,210) 

0-039 

0-06 

0-009 

(0-013,210) 

0-045 

0-06 

0-011 

(-0-020,210) 

0-043 

0-06 

0-010 

(0-015,210) 

0-038 

0-06 

0-009 

(-0-031,240) 

0-038 

0-06 

0-008 

(0-044,240) 

0-045 

0-07 

0-008 

(-0-023,240) 

0-035 

0-07 

0-008 

(0-016,240) 

0-036 

0-07 

0-009 

(-0-021,240) 

0-045 

0-07 

0-010 

(0-044,240) 

0-043 

0-07 

0-008 

masses.  Therefore,  when  Rb  =  0  (or  Rt  =  0),  it  is  clear  that  it  is  not  possible  to  achieve 
simultaneously  wfhys>  150  GeV  as  well  as  Ruh  in  the  entire  range  0-06-0-10.  From 
a  cursory  look  at  table  2,  it  is  clear  that  the  value  of  |  Vtd\,  for  the  considered  range  of 
parameters,  is  predicted  in  a  narrow  range  of  0-008-0-01 1,  therefore,  a  precise  measure- 
ment of  |  Vtd\  will  have  an  important  consequence  for  the  mass  matrices  wherein  either 
Rb  =  0  or  R,  =  0. 

In  tables  3  (a,  b)  we  have  presented  the  results  of  calculations  where  in  general  Rb  ^  0 
for  different  Rt  values.  For  the  sake  of  uniformity,  we  have  presented  in  the  tables  the 
ratios  of  FCKM  matrix  elements,  e.g.,  Rub,  Rtd  =  \  VJVcb\,  Rts  =  |  VJVcb\.  A  general  survey 
of  the  tables  brings  out  easily  that  we  are  able  to  obtain  Rub  from  0-06-0- 10  by  varying 
Rt  for  various  values  of  mfhys.  This  can  also  be  checked  from  the  expressions  of  |  Vub  \  and 
|  Vcb\.  An  important  prediction  of  the  model  is  that  |  Vts\  <  \  Vcb\  for  the  entire  range  of 
wphys  an(j  j^  -j^-g  jg  jn  accorciance  with  expectations  from  the  unitarity  of  VCKM. 
Similarly  the  results  of  |  Vtd\  can  encompass  the  presently  expected  range  [18].  Coming 
to  the  angles  of  unitarity  triangle,  a  and  /3,  we  find  that  the  present  values  are  in 
accordance  with  similar  calculations  by  other  authors  [19, 21, 22].  In  the  tables  3(a,  b), 
we  have  not  shown  values  of  J-rephasing  invariant  measure  of  CP- violation  [17]. 
However,  one  can  easily  show  that  J,  related  to  the  area  of  the  unitarity  triangle,  lies  in 
the  range  (1-7-2-8)  x  10" 5,  which  is  in  agreement  with  the  data  and  other  similar 
calculations  [14, 15,23]. 

A  closer  scrutiny  of  our  results  reveals  several  interesting  points.  One  finds  that 
tables  3  (a)  correspond  to  positive  values  of  sin2a  whereas  3(b)  correspond  to  the 
negative  values.  This  sign  ambiguity  is  in  fact  reflection  of  the  uncertainty  regarding  the 
quadrant  of  the  phase  of  the  CKM  matrix.  This  is  also  reflected  in  the  fact  that  |  Vtd\  has 
essentially  two  corresponding  ranges,  e.g.,  negative  values  of  sin2a  correspond  to 
1 7, J  < 0-009  whereas  positive  values  correspond  to  \V{d\^  0-009.  This  ambiguity, 
however,  in  the  present  formulation,  arises  from  (24)  wherein  corresponding  to  a  given 
value  of  Rt  there  are  two  values  of  Rb  which  can  reproduce  |  Vcb\.  Therefore,  a  measure- 
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the  present  as  well  as  other  similar  calculations  [19,21, 22],  a  measurement  of  the  decay 
B°  -» il/Ks  would  provide  a  good  test  of  present  form  of  mass  matrices  as  well  as  the 
CKM  mechanism  in  general. 

It  needs  to  be  brought  out  clearly  that  the  present  analysis  is  crucially  dependent  on 
D"  and  Dd.  In  the  KCKM  matrix  elements,  however,  these  manifest  through  Rrand  Rb. 
Such  mass  matrices,  in  the  language  of  Ramond  et  d  [24],  correspond  to  texture 
4  zeros.  Also,  it  seems  that  the  analysis  of  Ramond  et  a/  finds  it  difficult  to  accommo- 
date Vch\  ^  0-04  as  well  as  Ruh  =  0-08  +  0-02  with  texture  5  and  6  zeros  mass  matrices. 
Therefore,  texture  4  zeros  mass  matrices  seem  to  be  essential  for  fitting  the 
FCKM  phenomenology.  It  may  be  of  interest  to  mention  that  such  mass  matrices  have 
been  shown,  by  Joyce  and  Turok  [25],  to  maintain  their  basic  structure  as  they  evolve 
from  GUT  scale  to  the  low  energy  scale. 

5.  Conclusions 

Fritzsch  like  mass  matrices,  with  non-zero  22-elements  in  both  U  and  D  sectors,  have 
been  considered  to  accommodate  current  experimental  constraints  on  w,phys  and  CKM 
matrix  elements.  We  have  shown  that  such  mass  matrices  can  accommodate  CDF  and 
DO  mfphys  values,  e.g.,  176  +  13GeV  and  199  ±  30GeV  respectively,  apart  from  repro- 
ducing \VUS\,  \Vcb\  and  \Vuh/Vcb  =0-08  +  0-02.  Further,  Vtd\  is  predicted  to  He  in  the 
range  0-005-0-014,  in  agreement  with  the  latest  conclusions  of  Buras  [26].  The 
calculated  values  of  the  angles  of  unitarity  triangle,  a  and  /?,  are  also  in  agreement  with 
other  authors  [19,21,22],  however  two  different  ranges  for  these  highlight  the  ambi- 
guity regarding  the  quadrant  of  CP-violating  phase  of  CKM  matrix. 
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Abstract.  Considering  a  CP-violating  QCD  interaction,  the  electric  dipole  moment  of  neutron 
(EDMN)  is  estimated  in  a  quark  model  of  light  mesons  with  a  dynamical  breaking  of  chiral 
symmetry  through  a  non-trivial  vacuum  structure.  Pion  and  kaon,  being  treated  consistently 
within  the  model,  yield  to  the  constituent  quark  wave  functions  as  well  as  the  dynamical  quark 
masses  and  thus  determine  the  constituent  quark  field  operators  with  respect  to  light  quark 
flavors.  Using  the  translationally  invariant  hadronic  states  and  these  constituent  quark  field 
operators,  the  EDMN  estimated  here  remains  well  within  the  recent  experimental  bound  of 
Dn  <  11  x  10~26e-cm  with  the  CP-violation  parameter  \0\  =  10~8,  which  in  fact  accounts  for 
a  strong  CP-violation. 

Keywords.    Strong  CP-violation;  vacuum  structure;  chiral  symmetry  breaking;  quark  model. 
PACS  Nos    14-20;  13-40;  12-38;  11-30 


1.  Introduction 

Though  electric  dipole  moment  of  neutron  (EDMN)  has  been  of  great  interest  to 
physicists  since  its  discovery  [  1  ]  yet  it  has  drawn  much  attention  of  theoreticians  as  well 
as  of  experimentalists  after  the  discovery  of  CP-violation  in  K°  —  K°  system  [2]. 
Amongst  many  theoretical  models  [3]  to  explain  this  CP-violation  when  applied  to 
estimate  EDMN,  the  valency  quark  model  estimates  EDMN  by  adding  the  valency 
quark  contributions.  In  the  standard  model,  though  one  has  a  non-vanishing  contribu- 
tions towards  EDMN  due  to  quarks  still  it  is  very  much  suppressed  [4]  at  the  level  of 
three  generations  [5]  and  with  four  generations  it  does  not  enhance  much.  However, 
when  one  studies  EDMN  in  QCD  it  has  been  observed  that  due  to  instanton  effects  the 
total  divergence  term  has  non-vanishing  physical  effects  [6].  These  in  turn  yield  to 
a  CP-violating  piece  [7]  in  the  QCD  Lagrangian  due  to  an  equivalent  chiral  rotation  in 
the  quark  space  which  we  consider  here  to  describe  the  strong  interaction  between 
quarks  in  the  neutron  and  hence  to  estimate  its  EDMN.  However,  this  being  a  low 
energy  phenomenon,  its  study  requires  a  consideration  of  non-perturbative  regime  of 
QCD  where  one  has  solutions  specifically  in  the  lattice  gauge  theory  [8]  or  in  QCD 
sum  rules  [9].  But  they  have  also  their  own  limitations  such  as  non-availability  of 
better  computer  capabilities  in  case  of  former  and  consideration  of  non-perturbative 
vacuum  structure  through  the  relationship  with  perturbative  QCD  calculations  in  case 
of  latter. 

Recently,  an  alternative  scheme  [10]  for  considering  low  energy  phenomena  has 
been  proposed  where  one  realizes,  using  a  variational  method  [11]  the  existence  of 
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a  nontrivial  vacuum  structure  through  quark  and  gluon  condensates  in  the  non- 
perturbative  regime  [11].  Such  a  non-trivial  vacuum  structure  has  also  yielded  to 
a  dynamical  breaking  of  chiral  symmetry  [12]  through  which  it  has  become  possible  to 
determine  pion  and  kaon  wave  functions  and  subsequently  the  constituent  quark  field 
operators  [13, 14].  The  quark  field  operators  obtained  here  when  applied  to  explain 
electromagnetic  and  weak  properties  of  pion,  kaon  and  nucleon  [13, 14]  and  static 
properties  of  baryons  in  general  [15],  have  shown  a  reasonable  amount  of  success. 
Further,  such  a  formalism  through  the  extremization  of  gap  function  [16]  also 
generated  dynamically  the  quark  masses  [17].  We  may  note  that  the  method  applied 
presently  is  non-perturbative  with  respect  to  the  use  of  the  equal  time  algebra  for  the 
constituent  quark  field  operators  and  was  also  earlier  applied  to  solvable  cases  in  high 
energy  physics  [11, 12, 18, 19]  and  nuclear  physics  [20]  with  a  reasonable  amount  of 
success.  In  view  of  its  success  in  a  wide  ranging  phenomena  in  the  present  investigation 
we  will  be  further  using  these  constituent  quark  field  operators  as  well  as  the  quark 
masses  to  estimate  the  electric  dipole  moment  of  neutron  (EDMN). 

The  present  article  is  organized  as  follows.  In  §  2,  we  briefly  describe  the  vacuum 
structure  and  XSB  used  to  estimate  the  constituent  quark  field  operators  and  fix  up  our 
notations  to  apply  them  in  later  sections.  In  §  3,  we  estimate  EDMN  considering  the 
CP-violating  fundamental  QCD-interaction  Lagrangian  [7,21,22]  whereas  in  §4,  we 
discuss  our  results  and  compare  them  with  the  available  experimental  measurements  as 
well  as  other  model  calculations. 


2.  Chiral  symmetry  breaking  and  vacuum  structure 

Considering  the  chiral  symmetry  breaking  (XSB)  arising  from  gluon  condensates  [12] 
and  including  such  an  effect  one  writes  the  constituent  quark  fields  as  [23] 


(2.1) 


where  the  spinors  are  written  as  [24, 25] 


cos 


and     K(-k)  = 


sin 


cos 


and  the  two  component  quark  fields  q{(k)  and  <j;(k)  are 
kw       and         k  =  ku 


with  ulr  and  vlr  being  explicitly  written  as 
'1\  /(T 


Wr.   = 


Q\        A  (i 

-t     and  "'-*-  o 


(2.2) 


(2.3) 


(2.4) 


The  function  <£(k)  describing  the  spinors  in  (2.2)  is  arbitrary  and  can  be  derived  in 
principle  from  the  extremization  of  the  energy  functional.  The  quark  field  operator  \J/(x) 


in  (2.1),  through  (2.2)-(2.4)  is  found  to  satisfy  the  equal  time  algebra  and  also  becomes 
free  chiral  one  when  </>(k)  =  n/2. 

Further,  associating  the  quark  field  operators  as  in  (2.1)  with  the  presence  of  quark 
condensates  through  the  quark-antiquark  pair  creation  operator  [13] 


(2.5) 

and  a  unitary  transformation 

UQ  =  GBQ-BQ  (2.6) 

one  obtains  with  $(k)  =  UQ\j/0(k)UQ  the  relationship  between  the  arbitrary  function 
(f>(k)  and  the  correlation  function  /z(k)  as  [13,  18] 

(2.7) 

where  $0(k)  is  the  chiral  field  operator.  Thus,  the  form  of  quark  field  operator  in  (2.1) 
with  (2.2)  corresponding  to  XSB  can  have  an  interpretation  of  destabilizing  the  chiral 
vacuum  or  perturbative  vacuum  |vac>  (i.e.  the  vacuum  state  associated  with  the 
expansion  of  free  field  operators)  through  (2.6)  when  00  =  n/2.  Such  a  destabilization 
also  leads  to  the  possibility  of  a  non-perturbative  vacuum  |vac'>  (i.e.  the  vacuum  state 
obtained  via  minimization  of  energy  density)  [9]  obtained  through  the  unitary 
transformation  in  (2.6)  as 

|vac'>  =  [/Q|vac>.  (2.8) 

Such  a  non-perturbative  vacuum  is  also  found  to  be  at  a  lower  energy  than  the 
perturbative  or  chiral  vacuum  [10-12,  13,  18],  Such  a  change  of  basis  from  |vac>  to 
|  vac')  is  also  corresponding  to  the  change  of  parameters  of  the  theory  and  can  be 
treated  as  a  vacuum  realignment  [10]. 

Further,  with  the  quark  field  operators  (2.1)  one  also  obtains  the  expectation  value 
for  flavor  T  as, 

-y)]dk.  (2.9) 


where  the  factor  6  is  due  to  color  and  spin  summation.  One  also  has  here  from  (2.7) 
a  relationship  between  the  function  0,.(k)  and  the  quark-antiquark  correlation  func- 
tion /i;(k)  as  cos  0,-(k)  =  s'm(2hi(k)}  which  in  fact  is  an  explicit  description  of  the  vacuum 
structure  or  vacuum  realignment  [10,  11,  13,  18]. 

As  noted  earlier,  though  in  principle  the  function  </>(k)  can  be  obtained  from  the 
extremization  of  energy  functional  still,  one  does  not  have  a  closed  form  for  it. 
However,  one  uses  here  a  phenomenological  Gaussian  function,  for  its  simplicity  as 

sin(2/iI.(k))  =  cos  </>,.(k)  =  exp(  -  Rfk2/2)  (2.10) 

which  has  been  earlier  useful  to  consider  hadronic  phenomena  associated  with  the 
vacuum  structure  [13,  18]. 
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Next  the  function  <£(-(k)  can  also  be  related  to  the  pion  wave  function  using  Goldstone 
heorem  [13],  where  one  considers  the  chiral  charge  operator  for  pion  (say  n+)  as 

Mv^Wdx  (2.11) 

with  the  fields  (/^'(x)  and  <A2(X)  written  for  u  and  d  quarks  respectively.  With  the  help  of 
such  a  chiral  charge  operator  one  also  has  chiral  symmetry  and  its  breaking  through 
2s+ 1  vac)  =  0  and  Q*5+  \  vac'  >  ^  0  respectively.  One  may  also  note  here  that  such  a  XSB 
corresponds  to  a  pion  state  of  zero  momentum  and  zero  energy  as  a  Goldstone  mode, 
and  is  also  written  explicitly  as 


xu1+(k)d/(-k)|vac'>  (2.12) 

with  Nn  as  the  normalization  constant.  Further,  assuming  the  initial  SU(2)L  x  SU(2)K 
±iral  symmetry  breaking  into  custodial  symmetry  SU(2)K  one  has  here  R±  —  R2  =  R 
and  hence  ^(k)  =  <£2(k)  =  <£(k)  which  when  substituted  in  (2.12)  yields  to  it  as 


1     f 
7r+(0)>  =  JV,-^    dkcos0(k)u}(k)d/(k)|vac/>.  (2.13) 

v/6  J 


Ihe  normalization  constant  Nn  also,  one  obtains  here,  from  the  relation 

cos2<£(k)dk=l  (2.14) 

which  with  the  cos  </>(k)  as  in  (2.10)  becomes 


where  R*  =  R2  for  the  pion  has  been  used.  The  estimation  of  R^  as  23-58  GeV~2  [13] 
through  the  pionic  properties  such  as  pion  decay  constant  has  in  fact  led  to  a  determi- 
nation of  the  vacuum  structure  for  u  or  d  constituent  quark  through  R^  =  R\  =  Rzn 
[13].  Such  a  vacuum  structure  has  also  estimated  pion  charge  radius  in  reasonable 
agreement  with  its  experimental  measurement  [13]. 

Next,  in  a  similar  manner,  defining  the  chiral  charge  generator  <2f +  for  K-meson  as 
[14] 


Qf=  \\l/\(x)y5\l/3(\)d&  (2.16) 

J 

with  1/^3  (x)  being  taken  for  s-quark,  one  evaluates  from  it  using  (2. 1)  the  K  +  state  as  [14] 
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where  JVK  is  the  normalization  constant  for  K-meson  with  the  functions  ^(k)  and 
03  (k)  taken  for  u  and  s  quarks  respectively.  One  may  note  here  that  with  s-»d  (i.e. 
03(k)-*02(k))  the  kaon  state  in  (2.17)  reduces  to  the  pion  state  of  zero  momentum  in 
(2.12)..  However,  (2.17)  has  two  unknown  parameters  /VK  and  /?2  (through  cos03(k)) 
related  to  each  other.  Thus,  the  estimation  of  them  requires  two  relations  which  are 
taken  to  be  the  normalization  condition  for  K-meson  state  as  described  in  (2.  1  7)  and  the 
kaon  decay  constant  through  Van  Royen  and  Weisskopf  relations  [26].  The  normaliz- 
ation condition  for  K-meson  as 

2k=1  (2,8) 


and  the  Van  Royen  and  Weisskopf  relation  for  K-meson  in  momentum  space  as  [14]. 


yields  to 

1        1         vm1/2 


(2.19) 


which  in  fact  with  the  experimentally  measured  value  of  /K  gives  NK  as  [14] 

JVK=  1-9935  GeV3'2.  (2.21) 

Now  a  substitution  of  this  value  of  JVK  in  (2.21)  and  the  earlier  obtained  ^(k)  through 
R*  =  23-58  GeV  ~2  in  (2.18)  yields  to  Rl  as 

R2=  4-084  GeV-2.  -  (2.22) 

Thus  such  an  estimation  of  jR3  or  Rs  determines  <£3(k)  and  hence  the  vacuum  structure 
for  s-quark.  Such  a  vacuum  structure  when  applied  to  estimate  the  kaon  charge  radius 
is  also  found  [14]  to  estimate  in  reasonable  agreement  with  its  experimental  measure- 
ment. 

Further,  through  such  a  formalism,  on  extremization  of  the  gap  function,  one  can 
generate  dynamically  the  quark  masses  as  [16,  17] 

m,=  lim|k|tan(2/it(k))  (2.23) 

k-*0 

which  with  (2.10)  yields  to 

m,-  =  i,  (2.24) 

which,  in  fact,  with  the  quark  radii  as  mentioned  earlier  gives  mu  =  md  =  0-206  GeV  and 
ms  =  0495  GeV. 

We,  in  fact,  will  use  the  field  operators  and  the  masses  of  the  constituent  quarks 
obtained  in  this  section  to  obtain  the  EDMN  in  later  sections. 


T~    A    /•\~4,,u«_  i  one 


In  this  section  we  consider  the  'strong'  CP-violation  present  in  the  fundamental  QCD 
Lagrangian  to  estimate  the  electric  dipole  moment  of  neutron.  In  QCD,  consideration 
of  a  total  divergence  term  constructed  from  the  gluon  field  strength  tensor  G^v  as 
(l/2)e^va0Ga/'G'tv  has  been  found  to  have  some  non-vanishing  physical  effects  due  to 
instanton  effects  [6].  Further,  it  is  parameterized  in  terms  of  a  constant  parameter  as 
—  6>gf2G/JVG/iv/327r2,  with  9  being  zero  or  non-zero  means  no  CP-violation  or  CP- 
violation.  Also,  through  appropriate  global  rotations  of  the  quark  fields  the  effect  of 
GMVGAV  is  transferred  to  the  quark  sector  and  thus  yielding  to  a  CP-violating  Lagran- 
gian upto  first  order  in  6,  written  as  [7,  22] 


(x),  (3.1) 

where 

,  (3.2) 

' 


with  mu,  md  and  ms  being  the  masses  of  u,  d  and  s  quarks  respectively.  Thus  with  the 
CP-violating  interaction  in  (3.1)  and  the  electromagnetic  current  J^x)  one  has  from 
current  algebra  an  expression  involving  EDMN  (Dn)  as  [21] 


T<nl/2(P/)|JM(0)i 

*/ 

=  -Dn(u1/2(P/)V/cvy5Wl/2(P1.)  +  0(k2),  (3.3) 

where  k  =  Pf  —  P,  and  is  the  momentum  carried  by  the  electromagnetic  current  J^. 
Thus  we  estimate  Dn  through  (3.3),  and  to  do  so,  we  consider  the  matrix  element  in 
(3.3)  as 

(x)|n1/2(Pl.)>5  (3.4) 

where  |AT>  indicates  intermediate  states  arising  due  to  <5jS?cp(x)  and  is  an  admixture  of 
opposite  parity  states  consistent  with  the  quantum  numbers  of  neutron  which  may  be 
a  pion-nucleon  system  in  the  ground  state.  Thus  of  the  two  possible  intermediate  states 
in  this  regard  i.e.  \pit~y  and  |  rni°  );  only  \pn~  >  one  contributes  as  it0  at  an  elementary 
level  does  not  couple  to  photon  at  low  momenta.  Further,  |X>  being  an  odd  parity 
state,  the  space  part  of  the  electromagnetic  current  J^(0)  only  contributes  to  the  matrix 
element  which  when  used  in  (3.4)  yields  in  the  Breit  frame  the  expression  as  [21] 


--A-fix/iC-pK^rX/zfr).        (3.5) 

where  in  the  r.h.s.,  terms  of  0(k2)  and  higher  have  been  neglected  due  to  the  small 
momentum  transfer.  Further,  replacing  the  possible  intermediate  states  |AT>  by 
p')^~(p") )  with  ^/2/3  as  the  appropriate  C.G.  factor,  and  simplifying  the 
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r.h.s.,  (3.5)  becomes, 

2  f 
2'  3     Mem(p; p", p')Mcp(p', p", p)dp'dp"  =  Dn(2i/mn)PzPi,  (3.6) 

J 

where  in  (3.6)  the  energy  conservation  k°  =  p°  =  p?  has  been  used.  The  matrix  elements 
due  to  electromagnetic  and  CP- violating  interactions  as  Mem(p;p",  p')  and 
MCP(p',p",p)  respectively  are  explicitly  written  as 


(3.7) 
and 

(3.8) 

The  factor  '2'  in  (3.6),  has  in  fact  arisen  due  to  the  interchange  of  the  two  vertices  i.e. 
electromagnetic  and  CP-violating  ones.  However,  the  estimations  of  the  matrix 
elements  in  (3.7)  and  (3.8)  need  the  translationally  invariant  states  for  proton  and 
neutron  which  we  write  in  consistent  with  their  SU(6)  quantum  numbers  as 

dk1dk2dk3<5(k1  +  k2  +  k3)up(k1,k2,k3) 

uTi^  +  p/SJdf.^  +  p/S) 

p/3)]  |  vac' >  (3.9) 

and 


(k2  +  p/3)df_i(k3  +  p/3) 
-  K?-^!  +  P/3)d};(k2  +  p/3)d!;(k3  +  p/3)]  I  vac'  >,          (3.10) 

where  the  wave  functions  wn>p(k1,k2,k3)  for  neutron  and  proton  are  described  by 
normalized  harmonic  oscillator  wave  functions  as 


(3.11) 

with  Rlp  as  their  harmonic  oscillator  radii.  However,  in  the  earlier  applications  of  the 
present  formalism  to  explain  the  static  properties  of  nucleons  the  two  radii  were  taken 
to  be  the  same  [13],  and  here  also  we  take  them  to  be  the  same  i.e.  R2,  =  Rp  =  Rf^.  Thus, 
with  the  pion  state  as  described  in  (2.3)  and  the  electromagnetic  current  as 
•W)  =  Z  e«f  J(0)y'^!(0)»  we  estimate  the  matrix  element  (3.7)  to  be 

q,a 

Mem(p;p",P')  =  <«1/2(-p)|  X  e? 


dk 

UJ    J     '    3 
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xexp    ---2-p"+ 


r  k  -D'     n'2M 

x  exp    -  2R2J(k22  +  kj  +  k2-k3  +  -^-  +  —  ) 

L  2      6  yj 


where  k  =  -  k2  +  p"  and  k'  =  k2  +  p'  have  been  substituted,  and  with  a  fur 
simplification  of  the  spinor  part  (3.12)  becomes 


J  ~™2"™3>"" 

2                      2 

1 

"       R«        2 

|-k2+p"||k2-f 

n'l  CX^ 

2  (  2     k2 

x  exp 


(3 


The  matrix  element  (3.8)  involving  CP-violating  interaction  S^CP(x)  with  the  us 
the  time  translational  invariance  becomes, 


(3 

with  the  pion  energy  £7r(p")  =  (p"2  +  m2[)1'2.  However,  a  substitution  of  the  ( 
violating  Lagrangian  (3.1)  in  (3.14)  yields  it  as, 


•(3 
which  in  fact  with  the  proton,  neutron  and  pion  states  described  earlier  yields  to 

^V  exp[-^ 

~i  r 
dk; 


xexpl  ~24( 


x  exp 


k' 


12 


350 


x  [4t/l/2(k)y°r5  K_1/2(k)+  L/t.1 
Pramana  -  J.  Phys.,  Vol.  45-  No.  4,  October  1995 


,   (3 


Vacuum  structure,  chiral  symmetry  breaking, . . . 

where,  k  has  been  substituted  for  (p'  +  p")/2.  Thus,  after  some  simplifications  and 
integration  in  (3.16)  we  obtain  it  as, 


MCp(P>P">P)  =  - 


6m 


Lf3*SY'Y*lY'YJL 


x  exp 


24 


•3  p2  r>2 

-,  J^-M       ,,1  K-N       .          „ 

P'2-- gV2  +  f p  P 


COS 


(3.17) 


The  substitution  of  (3.17)  along  with  (3.13)  in  the  l.h.s.  of  (3.6)  when  compared  with 
its  r.h.s.  gives  rise  to  an  expression  for  electric  dipole  moment  of  neutron  after  k3 
integration  as, 


20 


(3.18) 


where  the  integral  7'  is  written  as, 

r 
1=     |k2|2d|k2||p'|2d|p'||p"|2d|p"|d0'd0"dcos0'dcos0" 


l-exp|  _ 


1/2 


n2 

-( 


1/2 


xexp   - 


Ik2-P"l|k2+P'|(p"2 

")] 


n' 
V 


exp 


x  exp 


x  exp    — 


[R2  —  R2  1 
?L— — ^(sin  0'sin  0"cos(</>"  -  0')  +  cos  6'  cos  9")p'p" 

JL !Lp'2  +  _« A[p 

L  24        F  8        P 


(3.19) 


where  k2  =  \k2\,  p'  =  |p'|  and  p"  =  |p"|.  However,  the  integral  7'  in  (3.19)  is  to  be 
evaluated  numerically.  Thus,  in  the  following  section  we  will  be  using  (3.18)  to  estimate 
Dn,  where  also  we  will  discuss  and  compare  it  with  other  theoretical  estimations  as  well 
as  the  experimental  measurements. 


4.  Results  and  discussion 

The  estimation  of  EDMN  from  (3.18)  of  the  earlier  section  primarily  requires  the  radius 
parameters  R2,  R2p  and  R%  for  the  neutron,  proton  and  pion  wave  functions  in  addition 
to  the  quark  masses  and  neutron  mass.  However,  they  were  obtained  from  the  earlier 


/=  1-300547  x  10   JUeVu, 

which  when  substituted  in  (3. 1 8)  along  with  the  dynamically  generated  quark  ma 
described  in  (2.24)  and  the  neutron  mass  from  its  experimental  measuremen 
determines  the  EDMN  as 

Dn=  -746xKT8|0|e-cm. 

Thus,  with  a  value  of  the  parameter  |0|  =  10 ~8  [7,21,22]  the  EDMN  becomes 
Dn=  -7-46x  10  ~26  e-cm, 

which  lies  within  the  uncertainty  of  its  recently  measured  value  of  (3  ±  5)  x  10" ' 
[28]  and  well  within  the  most  recent  experimental  upper  bound  of  |  DJ  <  1 1  x  10 ~ : 
[27]. 

We  now  compare  our  result  with  the  other  model  estimations.  In  standard 
EDMN  comes  out  to  be  within  the  range  of  10 ~30  to  10~32e~cm  [4]  which  is  s 
compared  to  experiments.  Even  with  four  generations  in  such  a  model,  the  esth 
does  not  go  beyond  10  ~29  e-cm  [29].  On  the  other  hand,  the  result  of  Weinberg- 
model  of  spontaneous  CP-violation  [30]  with  dominant  hadronic  loops  is  10  ~2 
which  is  larger  than  experimental  upper  bound  [27].  But,  in  the  Left-Right  Syn 
model  EDMN  is  found  to  be  in  the  range  of  10~  27  to  10~25  e-cm  [3 1]  in  which  a 
present  estimation  lies.  However,  in  bag  model,  with  the  QCD  interaction 
valency  quark  core  level  with  the  add  parity  N*~  states  EDMN  was  obtainec 
2-7  x  10~16|0|  e-cm  [7]  whereas  the  same  in  a  current  algebra  analysis  with 
violating  TiJV-interaction  was  estimated  to  be  —3-8  x  10~16|0|e-cm  [21].  Bu 
these  estimations  when  compared  with  the  experimental  measurement  [2 
pointing  to  a  rather  weaker  CP-violation.  In  the  framework  of  CBM,  Morg; 
Miller  [22],  taking  both  valency  quark  core  and  pionic  contributions,  have  est 
EDMN  as  —4  x  10~170  e-cm,  which  also  with  the  experimental  measureme) 
yields  to  a  weaker  CP-violation,  though  it  is  relatively  stronger  than  the  earli 
estimations  [7,21].  Further,  in  some  recent  attempts  [32-34]  in  the  context  of  n 
of  17(1)  anomaly  problem;  under  certain  approximations  9  has  also  been  predi 
QCD  to  be  vanishing.  In  view  of  these  diverse  model  predictions  we  believe  an  a< 
measurement  in  this  regard  can  provide  the  actual  dynamics  concerned  with  the  I 
and  thus  be  in  a  position  to  distinguish  different  theoretical  models  as  well 
approximations  involved  in  them.  Nevertheless,  we  observe  that  the  present  in\ 
tion  in  its  attempt  to  explain  EDMN  has  an  agreement  with  the  Left-Right  Syr 
model  and  has  a  more  reasonable  explanation  for  the  same  compared  to  the  st 
model.  However,  the  present  nonperturbative  mechanism  has  yielded  a  limited  a 
bility  with  respect  to  light  flavors  only  and  we  believe  when  the  same  is  exter 
heavier  flavors  can  be  able  to  describe  hadron  dynamics  in  a  unified  manner. 
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Abstract.  We  demonstrate  that  chaos  can  be  controlled  using  multiplicative  exponential 
feedback  control.  Unstable  fixed  points,  unstable  limit  cycles  and  unstable  chaotic  trajectories 
can  all  be  stabilized  using  such  control  which  is  effective  both  for  maps  and  flows.  The  control  is 
of  particular  significance  for  systems  with  several  degrees  of  freedom,  as  knowledge  of  only  one 
variable  on  the  desired  unstable  orbit  is  sufficient  to  settle  the  system  onto  that  orbit.  We  find  in 
all  cases  that  the  transient  time  is  a  decreasing  function  of  the  stiffness  of  control.  But  increasing 
the  stiffness  beyond  an  optimum  value  can  increase  the  transient  time.  We  have  also  used  such 
a  mechanism  to  control  spatiotemporal  chaos  is  a  well-known  coupled  map  lattice  model. 

Keywords.  Controlling  chaos;  unstable  orbits;  Lyapunov  exponents;  transient  time;  spatiotem- 
poral chaos. 
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1.  Introduction 

The  problem  of  controlling  chaos  has  recently  received  much  attention  [1-13]. 
A  chaotic  system  in  general  cannot  be  made  to  converge  to  a  freely  evolving  desired 
trajectory,  whether  periodic  or  chaotic,  because  of  the  inherent  unpredictibih'ty  of  the 
system.  The  control  of  chaos  in  this  context  consists  of  forcing  the  system  to  evolve 
along  the  desired  trajectory.  This  can  be  achieved  by  different  methods  [1,4,7,10-13]. 
Though  the  problem  of  controlling  chaos  in  low  dimensional  systems  has  been  the 
main  focus  of  interest  till  now,  recently  controlling  chaos  in  spatiotemporal  system, 
which  is  of  greater  practical  importance,  has  begun  to  receive  attention  [14].  For 
instance,  plasma  devices,  laser  systems,  chemical  reactions,  etc  show  both  spatial  and 
temporal  chaos. 

We  have  shown  recently  [15],  that  use  of  multiplicative  exponential  feedback  control 
on  a  parameter  of  the  system,  with  the  argument  of  the  exponential  being  proportional 
to  the  feedback  response  of  the  system,  i.e.  the  difference  between  the  desired  value  and 
the  actual  value  of  a  suitably  chosen  variable  of  the  system  can  stabilize  unstable  fixed 
points  of  1-D  maps.  Besides,  we  also  created  new  stable  attractors  which  are  not  the 
natural  attractors  of  the  unmodulated  system,  depending  on  our  requirement  and  use, 
using  the  discrete  map.  However,  exponential  control  cannot  stabilize  any  arbitrary 
orbit  of  the  system.  The  functional  form  of  the  map  and  the  control  criterion  decide 
which  orbit  can  be  forced  onto  the  system.  Our  method  is  a  combination  of  variable 
and  parametric  control  as  it  achieves  its  goal  by  controlling  the  parameter  through  the 
variable.  The  motivation  of  using  an  exponential  form  of  control  is  to  have  faster 
convergence  to  the  desired  orbit. 
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unstable  orbits — namely  unstable  fixed  points,  limit  cycles  and  chaotic  trajectories.  We 
demonstrate  that  it  works  effectively  both  for  maps  and  flows.  Besides  controlling 
chaos  in  low-dimensional  systems,  we  have  demonstrated  control  of  spatiotemporal 
chaos  in  a  well-known  coupled  map  lattice  (CML)  model  [16-20]  using  exponential 
control.  Exponential  feedback  pinnings  are  used,  to  control  chaos  in  the  system  by 
stabilizing  a  chosen  unstable  reference  state. 
Consider  a  general  N -dimensional  dynamical  system 

X  =  F(X;#r)  (1) 

where  X  =  (X L ,  X2 , . . . ,  XN)  are  varia bles  and  n  =  (fj,1 ,  /*, , . . . ,  IJ.K )  are  parameters  whose 
values  determine  the  nature  of  the  dynamics.  We  demonstrate  that  the  stabilization  of 
a  desired  unstable  orbit  or  a  chaotic  trajectory  of  such  a  system  is  possible  by 
multiplying  a.  suitably  chosen  parameter  say  nr  in  (1)  by  an  exponential  feedback 
control  involving  only  one  suitably  chosen  variable  say  Xt,  with  the  form  of  control 
being  given  by 

explXX, -*?)]•  (2) 

Here  Xl  is  the  actual  value  of  the  chosen  variable  of  the  system  after  applying  control, 
X\  is  the  desired  value  of  that  variable  and  e  is  the  '"stiffness"  of  the  control  which  can 
take  both  positive  and  negative  values. 
The  dynamics  of  the  modulated  system  in  the  presence  of  control  is  given  by 

X  =  F(X;/i1,/i2,...s/ipexp[8(X1-XJ)],...,AiJC;t).  (3) 

One  of  the  important  features  of  the  control  is  that  it  does  not  change  the  solution  of 
(1)  corresponding  to  the  orbit  X,  =  X* .  However,  it  changes  the  Lyapunov  exponents  of 
the  controlled  orbit,  i.e.  X* .  This  can  be  demonstrated  by  considering  the  logistic  map  as 
it  is  simplest  to  analyse.  The  expression  for  the  Lyapunov  exponent,  for  stabilizing  the 
fixed  point  X*  =  1  -  1/(4A*)  in  the  presence  of  control  is  ln(4ju(l  —2X*)  +  zX*\ 
whereas  the  expression  in  the  absence  of  control  is  In  |4^(1  —  2X*)\.  However,  in  both 
cases  X*  is  a  fixed  point  of  the  map.  This  argument  can  easily  be  extended  for  higher 
periods  of  maps  as  well  as  for  flows. 

The  control  works  for  those  combinations  of  controlling  parameters  and  variables  of 
the  system  for  which  the  largest  real  part  of  the  Lyapunov  exponent  of  the  modulated 
system,  represented  by  (3),  is  negative.  The  feedback  function  in  the  control  involves 
only  one  suitably  chosen  variable  Xl  to  convert  the  desired  repellor — whether  a  fixed 
point,  a  limit  cycle  or  a  chaotic  orbit,  into  an  attractor.  Thus,  knowledge  of  only  one 
suitably  chosen  variable  on  the  desired  unstable  orbit  is  sufficient  to  settle  the  system 
onto  that  orbit.  This  makes  the  control  particularly  useful  for  systems  with  several 
degrees  of  freedom.  For  example,  a  desired  unstable  fixed  point  of  the  Lorenz  system 
[21]  can  be  stabilized  using  control  with  feedback  depending  on  any  one  of  the  X,  Y  or 
Z  variables  whereas  an  unstable  limit  cycle  or  a  chaotic  trajectory  of  the  Lorenz  system 
can  be  converted  into  an  attractor  by  using  feedback  depending  on  the  Z  variable  only. 
For  stabilizing  most  repellors,  it  is  sufficient  to  multiply  one  chosen  parameter  by  the 
exponential  control;  but  sometimes,  as  in  the  case  of  one  of  the  two  fixed  points  of  the 
Henon  map  [22],  both  the  parameters  a  and  b  of  the  system  have  to  be  multiplied  by  the 

356  Pramana  -  J.  Phys.,  Vol.  45,  No.  4,  October  1995 


Controlling  chaos 

>onential  control.  Although  in  all  the  cases  we  have  studied  only  one  variable  on  the 
ired  orbit  was  sufficient  to  make  all  the  Lyapunov  exponents  of  the  system  negative, 
3  may  not  always  be  possible  for  arbitrary  systems  and  it  may  turn  out  that  for  some 
terns  control  using  more  than  one  variable  is  required  for  the  stabilization  of  an 
stable  orbit.  However,  the  fact  remains  that  the  control  uses  only  a  subset  of  the 
iables  and  the  parameters  for  controlling  chaos. 

V  quantity  of  obvious  interest  in  the  context  of  controlling  chaos  is  the  time  required 
the  system  to  settle  onto  the  desired  orbit.  This  of  course  depends  upon  the  stiffness 
:ontrol  E.  For  a  given  i:  we  study  the  time  T  required  for  the  system  to  approach  within 
istance  w  of  the  desired  orbit  starting  from  some  initial  point.  If  co0  is  the  initial 
tance  from  the  desired  orbit,  then  it  is  clear  that  the  length  of  the  transient  T  and  a>  are 
ited  by  to  =  co0exp(//r),  where  A.  is  the  largest  Lyapunov  exponent.  The  slope  of  the 
t  of  T  against  ln(aj/o>0)  is  nothing  but  I/A.  This  of  course  has  to  be  negative  for 
ivergence.  These  values  of  the  Lyapunov  exponents  are  found  to  be  in  good 
cement  with  those  obtained  either  numerically  (using  the  method  given  in  [23]  )  or, 
erever  possible,  analytically.  We  have  studied  the  transient  time  i  required  for 
:ling  onto  a  desired  orbit,  within  a  given  accuracy  co,  as  a  function  off:  and  found  that 
i  general  is  a  decreasing  function  of  e.  But  there  exists  an  optimum  stiffness  of  control 
'ond  which  increase  in  c  increases  r.  This  behaviour  of  T  is  found  to  be  the  same  as 
t  of  A.  with  g  for  a  given  orbit. 

fhere  is  one  drawback.  The  form  of  the  control  is  such  that  for  a  given  system  an 
stable  fixed  point  which  is  represented  by  a  null  vector  cannot  be  stabilized.  The 
son  is  obvious.  The  Jacobian  of  the  modulated  system  given  by  (3)  evaluated  at  such 
oint  in  the  presence  of  the  control  is  the  same  as  that  of  the  unmodulated  system  (1). 
the  eigenvalues  of  the  system  remain  unchanged  in  the  presence  of  control  and 
xed  point  represented  by  a  null  vector  remains  unstable  even  under  control.  Such 
nts  may,  however,  get  stabilized  in  the  process  of  stabilizing  other  fixed  points, 
tilting  in  the  coexistence  of  more  than  one  attractor.  The  fact  that  null  vectors  can  be 
bilized  in  the  process  of  stabilizing  desired  orbit  can  be  illustrated  by  taking  an 
.mple  of  unstable  fixed  points  of  the  logistic  map.  The  map  has  two  fixed  points 
=  0  and  X*  =  1  —  l/(4/j).  X*  is  stable  for  0  <  n  <  1/4  and  unstable  otherwise.  In  the 
icess  of  controlling  AT*,  the  equation  representing  the  modulated  logistic  map  is 


th  the  fixed  points  i.e.,  X*  and  X*,  of  the  unmodulated  map  are  also  the  fixed  points 
:he  modulated  map.  In  order  to  test  the  stability  of  a  fixed  point  of  a  modulated 
tem  (i.e.,  logistic  map  in  the  presence  of  control),  we  have  to  find  derivative  of/  w.r.t 
it  that  point.  The  general  expression  for  df/dX  in  the  case  of  the  modulated  logistic 
pis 


X-X*))(sX(\-X)+\-2X). 

QA 

irefore 

df 

=  4/iexp(  — . 


df 


above  mentioned  range  for  p.  becomes  0  <  fi  <  ex.p(£X*)/4.  This  shows  that  X*  c 
stabilized  for  values  of//  not  lying  in  the  range  (0,  1/4),  by  choosing  suitable  val 
g  for  a  given  X*. 

The  organization  of  this  paper  is  as  follows.  In  §  2  we  study  exponential  conti 
stabilizing  an  unstable  fixed  point.  In  §3.1  we  extend  the  control  algorithm  to  st£ 
unstable  closed  orbits  for  flows.  We  suggest  a  more  effective  control  for  the  stabiliv 
of  higher  orbits  of  a  discrete  dynamical  system  in  §  3.2.  In  §  4  we  discuss  the  stabilis 
of  chaotic  orbits.  We  use  our  control  for  controlling  spatiotemporal  chaos  in  a 
model  in  §  5.  We  conclude  with  a  summary  of  our  results  in  §  6. 

2.  Exponential  control  for  stabilizing  an  unstable  fixed  point 

We  have  studied  Lorenz  system  [21]  as  an  example  of  a  continuous  dynamical  s 
which  is  governed  by  the  following  equations 


rX  -  Y 
Z  =  XY-bZ 

where  X,  Y,  Z  are  the  variables  and  cr,  r,  b  the  controlling  parameters  of  the  systen 
system  has  three  critical  points  viz  X'  =  0,  Y'  =  0,  Z'  =  0;  X"  =  ^/b(i 
Y"  =  v^(r-l),  Z"  =  r  -  1  and  X'"  =  -  ^(r-1),  Y"'  =  -  Jb(r-l),  Z"'  = 
We  choose  the  parameters  a,  r  and  b  so  that  the  system  is  in  a  chaotic  regime;  i.e.  w 
a  =  10,  b  =  8/3  and  r  =  60  in  which  case  all  the  three  critical  points  are  unstablf 
first  critical  point  (Xf,  Y',Z')  which  is  represented  by  a  null  vector  cannot  be 
a  stable  fixed  point  using  this  form  of  control.  This  has  been  discussed  i 
introduction.  The  second  and  third  critical  points  can  be  converted  into  attract< 
multiplying  the  parameter  b  by  exp  [e(Z  —  Z"}~]  and  exp  [fi(Z  —  Z"')]  respective!; 
also  possible  to  stabilize  these  points  using  other  combinations  of  parameter 
variables,  provided  the  control  criterion  i.e.,  the  largest  real  part  of  the  Lyaj 
exponent,  1,  is  negative,  is  satisfied.  Such  combinations,  with  the  corresponding  i 
of  A  are  listed  in  table  1  for  s  —  —  0-001,  —  0-01  and  —  0-1.  Similarly,  A  can  be  w 
out  for  positive  values  of  s  also.  Table  1  shows  that  the  r-X,  r-  Y  and  b-  Y  for 
control  work  for  suitably  chosen  negative  values  of  s.  On  the  other  hand,  it  is  th 
b-X  and  b-Z  forms  of  control  which  are  effective  for  suitably  chosen  positive  valuf 
For  the  rest  of  the  forms  viz  a-X,  a-  Yand  cr-Z,  the  values  of  A  remain  positive  1 
e  (the  Lyapunov  exponents  with  these  forms  of  control  are  independent  of  e  and  a 
same  as  that  of  the  unmodulated  Lorenz  system)  thereby  implying  that  these  foi 
exponential  control  cannot  stabilize  the  unstable  fixed  points  of  the  Lorenz  sysi 
For  a  fixed  CD  the  transient  T,  using  different  forms  of  exponential  control,  is  foi 
be  a  decreasing  function  of  e.  However  there  exists  an  optimum  value  of  e  beyond  ' 
increasing  e  increases  T.  This  behaviour  of  t  is  similar  to  that  observed  in  the  c 
discrete  1-D  maps  [15]. 

358  Pramana  -  J.  Phys.,  Vol.  45,  No.  4,  October  1995 


Controlling  chaos 

Table  1.  The  largest  Lyapunov  exponent  /I  for  the 
Lorenz  attractor  while  stabilizing  (X",  Y ",  Z")  with 
a—  10,  h  =  8/3,  r=60  using  exponential  control 
with  different  combinations  of  parameters  and  vari- 
ables for  different  values  of  the  stiffness  of  control  e. 

A  for  E  = 


Control 
form 

-0-001 

-0-01 

-0-1 

r-X 

0-616 

-0-267 

-3-950 

r-Y 

0-394 

-2-381 

-7-180 

r-Z 

0-767 

1-117 

2-250 

h-X 

0-739 

0-927 

2-425 

b-Y 

0-707 

0-611 

-0-113 

b-Z 

0-786 

1-403 

7-884 

ff-X 

0-717 

0-717 

0-717 

ff-Y 

0-717 

0-717 

0-717 

ff-Z 

0-717 

0-717 

0-717 

The  stable  control  region  i.e.  the  range  of  values  of  e  for  which  an  unstable  orbit 
becomes  stable  is  the  range  of  ?.  for  which  the  largest  Lyapunov  exponent  of  a  given 
system  for  a  given  orbit  in  the  presence  of  control  is  negative  (i.e.  the  control  criterion  is 
satisfied).  This  range  of  E  can  be  obtained  by  using  the  condition  that  the  real  parts  of 
the  eigenvalues  of  the  Jacobian  matrix  about  a  given  orbit  be  less  than  zero  for  flows  or 
lie  in  the  range  (—  1, 1)  for  maps.  When  a  non-zero  fixed  point  of  the  Lorenz  system 
becomes  unstable  then  the  J  matrix  has  one  real  (a)  and  two  complex  conjugate 
eigenvalues  (/?  +  iy).  The  real  eigenvalue  is  always  negative  but  the  real  parts  of  the 
complex  eigenvalues  are  positive.  In  the  presence  of  b-Z  control,  the  lower  limit  of  the 
stable  control  region  of  E  can  be  obtained  by  using  the  condition  that  Re(A)  <  0.  Using 
this  constraint  we  obtain  the  following  expression  for  e: 


where 

R  = 


2(2  +  a) 

The  upper  limit  is  oo  in  this  case.  For  the  given  values  of  the  controlling  parameters  the 
range  of  values  of  £  for  which  the  non-zero  unstable  fixed  points  become  stable  is 
0-009568 ---^e^  oo. 

The  Lyapunov  exponents  obtained  analytically  for  the  second  and  third  critical 
points  in  the  presence  of  the  control  have  the  same  value.  The  variation  of  A  with 
e  shows  the  same  behaviour  as  that  of  i  with  £.  The  optimum  value  of  e  for  which  r  is 
minimum  also  corresponds  to  the  most  negative  value  of  A  (minimum  value  of  A)  and  is 
0-2  for  the  given  values  of  the  parameters  with  b-Z  control.  The  value  of  A  obtained 
analvticallv  is  —  6-8993..  for  E  =  0-2.  We  olot  the  transient  T.  as  a  function  of  ln(co/co,0 


tor  this  value  or  i:  and  nna  mat  me  reciprocal  01  me  slopes  tor  tne  second  and  mira 
critical  points  are  «  —6-916,  which  is  in  good  agreement  with  the  value  obtained 
analytically. 

We  intend  to  compare  the  transient  times  and  the  basin  of  attractions,  which  is  of 
great  practical  importance,  for  exponential  control  with  those  controls  which  show  the 
following  features: 

;i)  The  control  is  continuous, 

;2)  it  depends  only  on  the  variable(s)  of  the  desired  orbit, 

3)  is  independent  of  the  previous  value  of  the  controlling  parameters  \L,  if  the  values  of 
the  parameters  are  changing  in  the  presence  of  control, 

4)  and  does  not  increase  the  dimensionality  of  the  original  system. 

Fhe  reason  for  restricting  the  choice  of  controls  for  comparison  to  those  possessing 
;hese  properties  is  because  the  exponential  control  has  all  the  above  features. 

To  compare  our  control  with  that  of  Pecora  and  Carroll,  both  for  X-drive  and 
K-drive,  we  consider  the  control  where  the  parameter  r  is  multiplied  by  the  exponential 
eedback  function  involving  the  X  variable  for  the  same  set  of  parameter  values  (such 
i  choice  of  the  control  is  made  because  Pecora  and  Carroll's  method  uses  variables 
K  and  Y  as  drives).  We  find  that  the  value  of  A  with  this  form  of  the  control  is  —  4-536  for 
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Figure  1.  Plot  of  the  transient  time  T  vs  ln(to/o>0)  for  the  Lorenz  attractor  with 
a  —  10,  b  =  8/3  and  r  —  60  while  stabilizing  the  unstable  fixed  point  (v/6(r  —  1), 
,Jb(r—\\  r—l).  Values  of  r  for  Pecora  and  Carroll's  method  with  AT-drive 
(triangles)  and  V-drive  (squares)  are  compared  with  those  obtained  by  using 
exponential  control  involving  the  parameter  r  and  the  variable  X  (circles)  and  also 
r  and  7(stars). 
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1-7.  We  also  find  that  /.  for  the  control  involving  the  parameter  /•  and  the  variable 
5  -  7-593  for  e  =  —  0-05  for  the  same  set  of  parameter  values.  The  corresponding 
lies  of  /.  for  Pecora  and  Carroll's  method  are  —  1-83  for  the  X-drive  and  —  2-85  for 
Y-drive  [24].  Our  values  of  A  are  less  than  both  these  values  and  hence  exponential 
itrol  can  be  considered  to  be  more  effective. 

x>  verify  this  we  find  the  transient  time  T,  required  to  stabilize  the  second  unstable 
d  point  of  the  Lorenz  system  for  the  same  set  of  parameter  values,  for  different 
aes  of  co  using  Pecora  and  Carroll's  method  (both  for  the  Jf-drive  and  the  Y-drive) 
I  also  for  exponential  control  involving  the  parameter  r  and  the  variables  X  and 
Dre  =  —  I  -7  and  e==  —0-05  respectively.  We  plot  the  variation  oft  with  In  (co/co0)  in 
ire  1,  which  shows  that  our  control  stabilizes  the  unstable  fixed  point  faster  than 
;ora  and  Carroll's  control.  This  is  not  surprising  in  view  of  the  fact  that  our  control 
an  exponential  form. 

Ve  have  also  compared  our  control  with  the  methods  suggested  by  Singer  et  al  [10], 
I  Chen  and  Chou  [13].  For  the  first  method,  we  find  that  the  basin  of  attraction, 
ile  stabilizing  non-zero  fixed  points  of  the  Lorenz  system,  is  smaller  than  the  one 
ained  using  exponential  control.  Also,  the  transient  time  is  of  the  order  —  105  which 
igher  as  compared  to  that  with  exponential  control  (  ~  103).  The  method  of  Chen 
7  [13]  shows  the  same  transient  time  for  stabilizing  the  unstable  fixed  points  of  the 
•enz  map  but  the  basin  of  attraction  is  smaller  than  that  obtained  with  exponential 
itrol. 

"hese  comparisons  should  not  be  taken  to  imply  that  the  exponential  control  will 
ays  be  the  better  choice.  They  only  show  that  the  performance  of  exponential 
itrol  is  different  from  other  controls  and  may  be  more  efficient  for  some  systems. 

Exponential  control  for  stabiiizing  unstable  higher  period  orbits 

Stabilization  of  unstable  closed  orbits  for  flows 

unstable  limit  cycle  of  a  continuous  system  can  be  converted  into  a  stable  limit  cycle 
nultiplying  one  of  the  suitably  chosen  parameters,  say  /j,,  (or  more  depending  on  the 
:em)  by  the  exponential  feedback  function  which  depends  only  on  one  of  the  variable 
Xr,  i.e. 


jre  Xr  is  the  actual  value  of  the  variable  of  the  given  system  with  the  feedback  control 
ler  consideration  and  X"  is  the  value  of  the  Xr  coordinate  on  the  desired  unstable 
it  cycle  which  is  required  to  be  stabilized.  The  X"  is  obtained  after  allowing  the 
:em  to  evolve  freely  for  the  same  parameters  without  imposing  the  control. 
Ve  have  implemented  this  idea  for  controlling  unstable  limit  cycles  of  the  Lorenz 
:em.  We  choose  parameter  values  o  —  10,  b  =  8/3  and  r  =  28.  For  these  values  the 
irdinates  of  one  point  on  one  of  the  unstable  limit  cycles  are  (—12-786189, 
9-364189,  24-00)  [25].  The  A  of  the  system  is  found  to  be  positive.  Since  the  limit 
le  is  unstable,  even  on  starting  from  an  initial  state  very  close  to  the  limit  cycle,  the 
:em  trajectory  will  diverge  away  from  the  limit  cycle.  On  implementation  of  the 
itrol,  in  which  the  parameter  b  is  multiplied  by  exp[e(Z  —  Z")],  the  Lyapunov 
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Figure  2.  The  dotted  trajectory  shows  the  convergence  under  exponential  control 
of  a  nearby  trajectory  to  the  unstable  closed  orbit,  represented  by  the  bold  curve, 
projected  onto  the  (X,Z)  plane  of  the  Lorenz  system  with  <?=  10,  b  =  8/3,  and 
r  =  28  starting  from  a  point  with  coordinates  ( -  12-786189,  -  19-364189, 24-00)  for 
s  =  0-032. 


exponents  of  the  system  are  found  to  become  dependent  on  s  and  for  a  certain  range  of 
values  of  £,  the  real  parts  of  all  the  Lyapunov  exponents  become  negative  thereby 
implying  the  stabilization  of  the  limit  cycle.  Now,  nearby  trajectories  are  found  to 
converge  to  the  desired  limit  cycle  as  shown  in  figure  2.  The  transient  i  shows  a  similar 
dependence  on  s  as  in  the  previous  sections.  But  for  a  given  accuracy  co,  the  minimum 
value  oft  corresponding  to  the  optimal  E  is  much  higher  as  compared  to  the  case  of  the 
unstable  fixed  point  of  the  Lorenz  system.  Moreover,  the  variation  oft  with  £  for  a  given 
D  is  not  as  rapid  as  in  the  case  of  the  unstable  fixed  point.  With  the  result  i  vs  e  plot  is 
quite  flat  displaying  a  shallow  minimum. 

We  have  also  tested  the  control  for  another  limit  cycle  of  the  Lorenz  attractor.  For 
the  same  parameter  values,  the  coordinates  of  a  point  on  a  different  limit  cycle  are 
[-  13-917865,  -  21-919412,  24-00)  [25].  The  control  is  found  to  work  effectively,  and 
with  similar  features,  in  this  case  as  well. 

3.2  Stabilization  of  higher  period  orbits  of  a  discrete  dynamical  system 

We  can  extend  the  above  control  to  convert  unstable  fixed  points  of  higher  period  say 


to  the  stable  fixed  points  for  given  values  of  the  controlling 
parameters.  What  is  required  is  a  feedback  that  encodes  as  much  information  about 
a  periodic  orbit  as  is  necessary  for  its  unique  characterization.  But  there  is  a  problem  of 
practicality.  Since  in  this  case  the  form  of  the  control  given  by  eq.  (2)  requires  the 
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vergence  of  the  chosen  variable  to  one  of  the  k  values  of  that  variable,  the 
trolling  technique  diminishes  in  utility  with  increase  in  period.  For  higher  period 
its  of  a  discrete  dynamical  system  a  more  effective  control  is  one  which  employs 
gical  OR  structure  in  the  feedback  function.  So  in  order  to  stabilize  unstable  fixed 
nts  of  period  k  i.e.  {X*1}  one  of  the  parameters  jur  in  (1)  is  multiplied  by 


exp 


(5) 


;re  X,(n)  is  the  value  of  the  chosen  variables  Xl  at  time  n  of  the  modulated  map  after 
ilying  control  and  (Xf  l,Xf2,..., Xfk)  is  the  set  of  k  values  of  X{  on  the  desired 
table  period  k  orbit. 

Ve  had  implemented  this  in  the  case  of  the  logistic  map  for  stabilizing  a  period 
rbit  and  found  it  to  work  effectively  [15]. 

stabilization  of  chaotic  orbit 

have  also  tried  to  make  different  trajectories  converge  to  a  particular  desired 
otic  trajectory  using  our  control.  Again  taking  the  Lorenz  system  as  an  example  we 
ose  the  parameter  values  as  a  =  1 6,  b  =  4  and  /•  =  40.  The  real  parts  of  the  Lyapunov 
onents  are  (1-37,  0-0,  —22-37)  [26]  indicating  that  this  is  a  chaotic  regime.  We 
>ose  a  chaotic  trajectory  starting  with  an  initial  point  (10-0, 0-0,  30-0).  We  apply  our 
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Figure  3a.  The  distance  between  the  desired  trajectory,  starting  with  an  initial 
point  (10-0,  0-0,  30-0)  and  a  trajectory  started  from  a  point  away  from  the  desired 
trajectory,  for  the  Lorenz  system  with  the  parameter  values  as  a  =  16,  b  =  4  and 
r  =  40  for  e  =  0-028  is  plotted  against  time. 
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Figure  3b.     Plot  of  Arc  vs  A"  for  the  Lorenz  systems  of  figure  3a. 


control  by  multiplying  b  by  exp[e(Z  —  Zc)],  where  Zc  is  the  Z-coordinate  of  a  point  on 
the  freely  evolving  chaotic  trajectory.  We  find  that  the  real  parts  of  all  the  Lyapunov 
exponents  of  the  modified  Lorenz  system  become  negative  for  a  certain  range  of  values 
of  s,  showing  that  the  desired  chaotic  trajectory  has  become  a  stable  trajectory  and 
different  close  by  trajectories  starting  from  different  initial  states  converge  to  the 
desired  trajectory.  For  a  given  £  in  this  range,  the  reciprocal  of  the  slope  oft  vs  In(co/co0) 
is  in  good  agreement  with  the  numerically  obtained  minimum  value  of  L  It  has  been 
verified  numerically  that  the  system  settles  down  onto  the  desired  trajectory  in  the 
presence  of  the  control.  In  figure  3  (a)  we  plot  the  distance  between  the  desired  trajectory 
and  a  trajectory  started  from  a  point  away  from  the  desired  trajectory,  for  e  =  0-028.  We 
also  plot  Xc  vs  X  in  figure  3(b)  for  the  case  in  figure  3  (a).  The  other  features  of  the 
control  remain  unchanged.  We  have  also  tested  our  control  for  other  chaotic  trajecto- 
ries and  found  it  to  work  effectively. 

5.  Controlling  spariotemporal  chaos 

With  exponential  control  it  is  also  possible  to  control  spatiotemporal  chaos.  A  spa- 
tiotemporal  system  can  be  described  by  a  coupled  map  lattice  (CML)  model  [16-20]. 
Feedback  pinnings  [14]  are  used  to  control  chaos  of  the  system  by  stabilizing  a  certain 
unstable  reference  state.  We  use  exponential  feedback  pinnings  to  control  such 
a  system.  Here  we  consider  the  one-dimensional  CML  model  [17-20],  given  by 


=  F 


(6) 
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where  /  =  1,  2,  .  .  .  ,  L  label  the  lattice  sites  and  L  is  the  system  size.  We  assume  periodic 
boundary  conditions,  Xn(i+  L)-Xn(i).  Moreover,  we  take  f(X)  =  iuX(\  —  X).  For 
a  single  site  (L  =  1),  model  (6)  reduces  to  the  well-known  logistic  map.  The  dynamical 
behaviour  of  model  (6)  with  L>  1  is  quite  well  understood  [16-20].  The  coupling  term 
in  (6)  introduces  very  rich  spatiotemporal  patterns  and  the  system  displays  homogene- 
ous stationary  solutions,  inhomogeneous  stationary  solutions  (spatial  patterns),  homo- 
geneous periodic  solutions  and  inhomogeneous  periodic  solutions  (running  or 
standing  waves).  In  a  chaotic  state,  all  these  patterns  are  unstable.  We  take  L  =  60, 
a  =  0-8  and  \L  —  4.  The  initial  condition  is  prepared  by  choosing  random  numbers 
uniformly  distributed  in  the  interval  [0,  1].  The  motion  of  the  system  is  fully  turbulent 
at  the  chosen  parameter  values  [17]. 

To  control  this  system,  we  use  pinnings  defined  by 

L/I 

*,+  1  (')  =  *"  I  <5(i-/*-l)exp[e(*.(0  -*.(/)]  (7) 

fc=  i 

where  /  is  the  distance  between  two  neighbouring  pinnings,  Xn(i)  is  the  reference  state 
to  be  stabilized  and  o(j)  =  1  for  j  =  1  and  S(j)  =  0  otherwise. 
The  homogeneous  stationary  state  of  the  system  (6)  is 

*„(/)  =  **  =  !-!//£  (8) 

and  an  inhomogeneous  stationary  state  is  given  by 


where  7  =  1,2,...,  L/2,  B=\  —  (J.+  2/^a  and  C  =  a  —  /^a  +  2^a2.  Both  states  are  un- 
stable for  a  =  0-8  and  fj.  =  4. 

We  control  the  unstable  homogeneous  stationary  state  (8)  using  (7)  with  1  —  2  and 
3-36  <  e  <  3-78  and  the  unstable  inhomogeneous  stationary  state  (9)  with  1  —  1  and 
0-69  <e<  0-8.  These  ranges  for  e  are  obtained  numerically.  Although  the  initial  values 
at  the  lattice  sites  are  randomly  distributed  between  0  and  1  ,  the  system  finds  the  desired 
patterns  i.e.  eqs  (8)  and  (9)  in  less  than  104  iterations.  We  show  the  effects  of  our  control 
to  state  (8)  for  /  =  2,  e  =  3-5  in  figure  4(a),  and  to  state  (9)  for  I  =  2,  e  =  0-75  in  figure  4(b). 

At  the  parameter  values  a  =  0-3  and  ju  =  4,  system  (6)  has  a  running  wave  solution  of 
time-period-two  and  space-period-four.  Starting  from  initial  conditions  very  near  the 
running  wave  state,  the  system  approaches  the  goal  state  in  a  small  interval  of  time.  If 
the  initial  values  on  the  lattice  sites  are  randomly  distributed  over  the  interval  [0,  1],  the 
transient  time  to  the  stable  running  wave  state  increases  with  increase  in  L.  For  L  =  60 
the  average  transient  time  is  of  order  101  3  iterations  or  more  [20].  Using  (7)  with  /  =  4 
and  e  =  l-0  for  the  sites  with  Xn(i)  =  0-45841379...  and  e  =  2-0  for  those  with 
XM(i)  =  0-89872907...,  the  average  transient  time  is  of  order  103,  (shown  in  figure  5), 
which  is  much  shorter  than  that  without  control.  We  have  considered  different  random 
initial  configurations  of  the  kind  described  above  and  found  that  the  system  always 
finds  the  same  attractor  oossiblv  with  a  few  defects.  However  no  defects  exist  in  figure  5. 
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Figure  4a.  The  homogeneous  unstable  stationary  state  of  eq.  (6)  with  a  =  0-8,  /i  =  4 
and  L  =  60  is  controlled  using  exponential  control  at  sites  f  =  1, 3, 5, . . . ,  59  with  e  =  3-5. 
Circles  represent  the  state  Xn(i)  =  X*  =  0-75  while  stars  represent  other  states. 
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Figure  4b.  The  inhomogeneous  unstable  stationary  state  of  (6)  with  a  =  0-8,  ju  =  4 
and  L=60,  which  is  XH(2j-  1)  =  0-536537...,  Xn(2j)  =  0-880129...,  is  controlled 
using  exponential  control  at  sites  j  =  1, 3, 5, . . . ,  59  with  t-  =  0-75.  Circles  represent 
that  Xn(i)  has  achieved  the  desired  value  while  other  values  are  represented  by  stars. 
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Figure  5.  The  transient  process  of  the  system  evolution  under  controlling  time- 
period-two  and  space-period-four  running  wave  for  a  =  0-3  and  n  =  4.  The  initial 
state  is  prepared  using  random  numbers  uniformly  distributed  in  [0, 1].  Circles 
represent  the  state  \Xn(i)-Q-89\  =$0-15,  stars  represent  | J^/i)  -  0'45|  <$  0- 1 5  while 
blank  spaces  represent  other  states.  Exponential  control  is  applied  at  sites 
/  =  1, 5, 9, . . . ,  57;  £  =  1-0  for  the  sites  with  Xn(i)  =  045841379...  and  r.  =  2-0  for  those 
with  Xn(i)  =  0-89872907....  We  plot  data  once  every  19  iterations. 

:  average  relaxation  time  increases  with  increase  in  /.  It  is  found  to  be  of  order  105  for 
12  with  more  defects  as  compared  to  /  =  4. 


Conclusion 

r  exponential  control  given  by  (2)  is  found  to  work  effectively  both  for  maps  as  well 
'or  flows.  Since  the  knowledge  of  only  a  subset  of  variables  on  the  desired  unstable 
•it  is  sufficient  to  settle  the  system  onto  that  orbit,  this  makes  the  control  particularly 
ful  for  systems  with  several  degrees  of  freedom.  In  the  presence  of  the  control,  there 
sts  a  range  of  values  of  the  stiffness  constant  e  for  which  a  repellor  —  whether  it  be 
iced  point,  limit  cycle  or  a  chaotic  trajectory,  can  be  converted  into  an  attractor.  Both 
nd  T  are  found  to  be  decreasing  functions  of  e.  However  there  exists  an  optimum 
fness  control  for  which  both  A  and  t  go  through  a  minimum.  Such  behaviour  is 
iracteristic  of  different  types  of  orbits  and  for  a  variety  of  systems. 
n  the  case  of  a  CML  model  we  find  that  control  can  be  effected  by  pinning  alternate 
:ice  sites  using  exponential  feedback.  We  have  succeeded  in  stabilizing  various 


otatir\narv 


Acknowledgements 

We  would  like  to  thank  Dr  Neelima  M  Gupte  for  her  invaluable  comments  and 
suggestions.  We  also  thank  the  Inter  University  Center  for  Astronomy  and  Astrophys- 
ics (IUCAA)  for  allowing  the  use  of  their  computing  facility.  One  of  the  authors  (SDG) 
also  thanks  Kirori  Mai  College,  University  of  Delhi,  Delhi,  for  grant  of  study  leave. 

References 

[1]  L  M  Pecora  and  T  L  Carroll,  Phys.  Rev.  Lett.  64,  821  (1990) 

[2]  L  M  Pecora  and  T  L  Carroll,  Phys.  Rev.  Lett.  67,  645  (1991) 

[3]  L  M  Pecora  and  T  L  Carroll,  Phys.  Rev.  A44,  2374  (1991) 

[4]  E  Ott.  C  Grebogi  and  J  A  Yorke,  Phys.  Rev.  Lett.  64,  1 196  (1990) 

[5]  T  Shinbrot,  E  Ott,  C  Grebogi  and  J  A  Yorke,  Phys.  Rev.  Lett.  65,  3215  (1990) 

[6]  W  L  Ditto,  S  N  Rauseo  and  M  L  Spano,  Phys.  Rev.  Lett.  65,  321 1  (1990) 

[7]  S  Sinha,  R  Ramaswamy  and  J  S  Rao,  Physica  D43, 118  (1990) 

[8]  E  R  Hunt,  Phys.  Rev.  Lett.  67,  1953  (1991) 

[9]  C  Reyl,  L  Flepp,  R  Badii  and  E  Brun,  Phys.  Rev.  E47,  267  (1993) 
[10]  J  Singer,  Y  Z  Wang  and  H  H  Bau,  Phys.  Rev.  Lett.  66,  1 123  (1991) 
[1 1]  S  Rajasekar  and  M  Lakshmanan,  Physica  D67,  282  (1993) 
[12]  K  Murali  and  M  Lakshmanan,  Phys.  Rev.  E48,  R1624  (1993) 
[13]  Y  H  Chen  and  M  Y  Chou,  Phys.  Rev.  E50,  2331  (1994) 
[14]  G  Hu  and  Z  Qu,  Phys.  Rev.  Lett.  72,  68  (1994) 
[15]  S  D  Gadre  and  V  S  Varma,  Pramana  -  J.  Phys.  43, 431  (1994) 
[16]  T  Bohr  and  O  B  Christensen,  Phys.  Rev.  Lett.  63,  2161  (1989) 
[17]  K  Kaneko,  Prog.  Theor.  Phys.  74, 1033  (1985);  Physica  (Amsterdam)  D23, 437  (1986);  D34, 

1  ( 1989);  D37,  60  (1989) 
[18]  J  P  Crutchfield  and  K  Kaneko,  in  Directions  in  chaos,  edited  by  Hao  Bai-Lin  (World 

Scientific,  Singapore,  1987);  Phys.  Rev.  Lett.  60,  2715  (1988) 
[19]  J  D  Keeler  and  J  D  Farmer,  Phvsica  (Amsterdam)  D23, 413  (1986) 
[20]  E  J  Ding  and  Y  N  Lu,  J.  Phys.  A25,  2897  (1992) 
[21]  E  N  Lorenz,  J.  Atoms.  Sci.  20, 130  (1976) 
[22]  M  Henon,  Commun.  Math.  Phys.  50,  69  (1976) 

[23]  C  Sparrow,  The  Lorenz  equations,  bifurcations  and  chaos  (Springer  Verlag,  New  York,  1982) 
[24]  N  Gupte  and  R  E  Amritkar,  Phys.  Rev.  E48,  R1620  (1993) 
[25]  J  H  Curry,  in  Global  theory  of  dynamical  systems,  Lecture  Notes  in  Mathematics,  Edited  by 

Z  Nitecki  and  C  Robinson  (Springer  Verlag,  Berlin,  1980)  819, 1 1 1-120 
[26]  I  Shimada  and  T  Nagashima,  Prog.  Theor.  Phys.  69,  1605  (1979) 


368  Pramana  -  J.  Phys.,  Vol.  45,  No.  4,  October  1995 


Physics  pp.  369-376 


icroscopic  equation  of  motion  in  inhomogeneous  media: 
nicroscopic  treatment 

/I  JAYANNAVAR  and  MANGAL  C  MAHATO 

itute  of  Physics,  Sachivalaya  Marg,  Bhubaneswar  751  005,  India 

received  10  May  1995;  revised  16  August  1995 

tract.  The  dynamical  evolution  of  a  Brownian  particle  in  an  inhomogeneous  medium  with 
;ially  varying  friction  and  temperature  field  is  important  to  understand  conceptually.  It 
lires  to  address  the  basic  problem  of  relative  stability  of  states  in  nonequilibrium  systems 
ch  has  been  a  subject  of  debate  for  over  several  decades.  The  theoretical  treatments  adopted 
ir  are  mostly  phenomenological  in  nature.  In  this  work  we  give  a  microscopic  treatment  of 
problem.  We  derive  the  Langevin  equation  of  motion  and  the  associated  Fokker-Planck 
ation.  The  correct  reduced  description  of  the  Kramers  equation  in  the  overdamped  limit 
oluchowski  equation)  is  obtained.  Our  microscopic  treatment  may  be  helpful  in  under- 
iding  the  working  of  thermal  ratchets,  a  problem  of  much  current  interest. 
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ntroduction 

;rmodynamic  equilibrium  states  are  ideal  limiting  cases  and  are  convenient  and 
:n  theoretically  amenable  to  study  with  relative  ease.  However,  one  commonly 
ounters  systems  that  are  away  from  equilibrium.  All  nonequilibrium  systems  relax 
urally  toward  their  respective  equilibrium  or  stationary  states.  In  nature,  evolution 
in  ongoing  and  dominant  process.  Naturally,  the  process  of  relaxation  of  the 
[equilibrium  systems  is  of  great  interest  in  all  branches  of  natural  science,  be  it 
rsics,  chemistry  or  biology.  Moreover,  one  comes  across  nonuniform  systems  more 
:n  than  uniform  systems.  Uniform  systems  are  characterized  by  a  constant  (space 
spendent)  diffusion  coefficient  throughout  the  system  and  having  the  same  tempera- 
;  in  all  parts  of  the  system.  There  are  well  established  theoretical  formalisms  for 
Form  systems  to  describe  their  evolution  towards  equilibrium  or  steady  states  [1]. 
wever,  the  same  is  not  true  for  nonuniform  systems.  There  exist  phenomenological 
sriptions  but  often  without  microscopic  foundations.  The  ad  hoc  nature  of  these 
criptions  have  led  to  some  controversies,  too,  in  the  past.  For  instance,  should  the 
usion  equation  of  a  Brownian  particle  in  the  absence  of  external  potential  have  the 
n  [2-8] 


nonumiorm  systems  self-consistent  [2,4,  s,  1UJ. 

In  this  work,  we  seek  to  clarify  some  of  the  issues  pertaining  to  this  important  case  of 
nonuniform  systems  in  a  systematic  manner.  We  derive,  from  microscopic  theory,  the 
Kramers  equation  for  the  joint  probability  distribution  of  position  and  velocity  of 
a  Brownian  particle  in  an  inhomogeneous,  nonisothermal  medium.  We  then  proceed  to 
find  the  correct  Smoluchowski  limit  to  the  Kramers  equation.  This,  however,  is  not 
a  mathematical  problem  alone;  the  underlying  conceptual  development  is  quite 
appealing  and,  as  mentioned  earlier,  is  subject  to  ongoing  controversies  for  over  several 
decades  [2-8]. 

The  evolution  of  a  Brownian  particle  in  condensed  media  is  the  most  familiar  example 
of  a  nonequilibrium  process.  The  process  is  accompanied  by  frictional  dissipation  but 
aided  by  associated  fluctuations.  Nonequilibrium  behaviour  of  macroscopic  uniform 
systems  is  described  well  by  linear-response  theory  when  the  initial  state  of  the  system  is 
close  to  equilibrium.  The  fluctuation-dissipation  theorem  relating  the  power  absorbed  by 
the  system  to  the  intrinsic  fluctuations  in  the  system  in  equilibrium  has  foundations  in 
the  linear  response  theory.  But,  when  the  system  is  far  from  equilibrium  the  linear 
response  theory  cannot  be  relied  upon.  In  most  of  the  physical  systems,  whether  close  to 
or  far  from  equilibrium,  the  approach  to  equilibrium  can,  however,  be  likened  to  one 
kind  or  other  of  a  diffusion  process;  it  may  be  translational  diffusion  of  particles, 
rotational  diffusion  of  macromolecules,  spin  diffusion  of  spin  systems,  heat  or  thermal 
diffusion  in  solids,  energy  diffusion  in  excitonic  motion  in  semiconductors,  and  so  on. 
The  diffusion  process  in  inhomogeneous  systems,  therefore,  calls  for  added  attention. 

In  the  case  of  uniform  systems  the  diffusion  constant  D  =  rj~lkBT,  where  rj  is  the 
friction  coefficient  and  T  the  temperature.  However,  for  nonuniform  systems  the  space, 
q,  dependence  of  the  diffusion  comes  separately  through  r\(q]  and  T(q).  The  origin  of 
r\(q]  and  T(q)  and  the  manner  in  which  they  influence  the  relaxation  of  the  nonequilib- 
rium system  are  entirely  different.  The  variation  of  r\(q]  (in  the  absence  of  spatial 
variation  of  temperature)  influences  the  dynamics  of  particle  in  a  potential  field  and 
helps  the  system  to  approach  towards  its  equilibrium  or  steady  states.  The  relative 
stability  of  the  competing  states  is  generally  governed  by  the  usual  Boltzmann  factor  in 
the  local  neighbourhood  of  the  corresponding  (representative)  potential  wells. 
A  change  in  the  potential  barrier  between  two  potential  well  minima  changes  the 
relaxation  rate  but  leaves  the  relative  stability  of  the  two  well  states  unchanged.  This 
simple  fact,  however,  may  not  apply  for  more  general  systems  when  the  temperature  is 
nonuniform  along  the  potential  surface  (or  spatial  coordinate). 

Landauer,  in  a  series  of  papers  [2-5],  argues  that  for  systems  with  nonuniform 
temperature  the  relative  stability  of  two  states  will  be  affected  by  the  detailed  kinetics  all 
along  the  pathways  (on  the  potential  surface)  between  the  two  states  under  comparison. 
It  is  the  effect  of  thermal  fluctuations  that  plays  a  crucial  role  and  the  resulting  effective 
potential  surface  may  have  completely  different  nature  from  that  of  uniform  temperature. 
With  the  help  of  his  blowtorch  theorem  Landauer  [3]  shows  that  a  change  of  temperature 
away  from  uniformity  even  at  very  unlikely  positions  of  the  system  on  the  potential  surface 
may  cause  probability  currents  to  set  in  moving  the  system  towards  a  new  steady  state 
situation  changing  thereby  the  relative  stability  of  the  otherwise  locally  stable  states. 
This  known  important  fact,  however,  has  received  much  less  attention  in  the  literature 
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than  it  deserves.  This  effect  can  have  important  consequences  on  the  particle  motion  in 
nonuniform  systems,  for  instance,  the  kinetics  of  growth  of  crystalline  nuclei  in  the  melt 
around  its  critical  size.  The  latent  heat  generation  being,  in  this  example,  responsible  for 
the  creation  of  nonuniform  temperature  field  across  the  surface  of  the  nucleus.  Nonuniform 
temperature  field  can  also  be  generated  by  shining  light  on  semiconductors.  One  can 
have  nonuniform  temperature  field  also  because  of  nonuniform  distribution  of  electrons 
and  of  phonons  (or  of  quasiparticles  in  general)  with  different  characteristic  temperatures 
in  a  solid.  It  has  been  suggested  that  the  nonuniform  temperature  field  can  produce 
current  in  a  closed  ring  [3, 6, 9].  There  has  been  a  lot  of  theoretical  work  reported  in 
recent  times  on  thermal  ratchets  [10].  These  works  are  inspired  by  the  observed 
predominantly  unidirectional  protein  (macromolecule)  motion  in  biological  systems 
even  in  the  absence  of  obvious  external  forces  and  thermal  gradients.  The  idea  of 
relative  stability  of  states  in  nonuniform  temperature  systems  can  help  to  understand 
the  working  of  the  thermal  ratchets  better  [11, 12].  These  are  but  few  examples  where 
nonuniform  temperature  field  can  have  important  bearing  on  the  dynamical  evolution. 
A  systematic  formalism  to  deal  with  such  a  situation  is,  therefore,  essential. 

In  the  following  sections  we  proceed  systematically  to  set  up  a  formalism  from 
microscopic  theory.  We  derive  the  Kramers  equation  for  space  dependent  friction 
coefficient  and  nonuniform  temperature  field.  We  then  go  over  to  obtain  the  correct 
Smoluchowski  limit  of  the  Fokker- Planck  equation.  Before  concluding  we  also  give 
the  correct  Langevin  equation  in  the  overdamped  limit  that  is  approximated  properly 
to  order  r\(q)~l. 

2.  Microscopic  derivation  of  Langevin  equation  in  a  space  dependent  friction  field 

To  obtain  Langevin  equation  in  a  space  dependent  friction  field  we  consider  the  motion 
of  a  subsystem  (Brownian  particle)  described  by  its  coordinate  Q  and  momentum  P  and 
subjected  to  an  external  potential  field  V(Q)  of  the  system.  We  assume  the  subsystem  to 
be  in  contact  with  a  thermal  (phonon)  bath.  The  bath  oscillators  are  described  by 
coordinates  qa  and  momentum  pa  with  characteristic  frequencies  a)x.  For  our  calcula- 
tion we  consider  the  total  Hamiltonian 

•m    fit^    /  A  (D\  \  2  ~ 

(1) 


where  M  is  the  Brownian  particle  mass  and  mx  are  the  masses  of  the  bath  oscillators. 
The  interaction  of  the  subsystem  with  the  thermal  bath  [13]  is  through  the  linear 
coordinate-coordinate  coupling  term  kaqaA(Q).  From  (1)  one  obtains  the  following 
equations  of  motion 


<2c> 

Pramana  -  J.  Phys.,  Vol.  45,  No.  4,  October  1995  37  1 


A  M  j  ayannavar  ana  mangai  c  ivianaw 

and 

P,=  -m,<o?q,  +  A,A(Q)t  (2d) 

where  A'(Q)  is  the  derivative  of  A(Q)  with  respect  to  Q.  After  solving  (2c)  and  (2d)  for  qa 
using  the  method  of  Laplace  transform  and  substituting  its  value  in  (2b),  we  obtain  the 
Langevin  equation  of  motion  for  Q  and  P. 

Q  =  —  ,  (3a) 

*     M' 

P  =  ~  V'(fi)  -  *  [-4'(0]2      4-  XWW-  (3b) 


Thus,  the  effect  of  interaction  of  the  Brownian  particle  with  the  thermal  bath  is  to 
introduce  a  friction  term  and  a  fluctuating  term  f(t)  in  the  equation  of  its  motion.  The 
fluctuating  term  is  given  by 

(4a) 


2 

where  ^(0)  and  ^(0)  are  the  initial  positions  and  velocities  of  the  bath  variables.  The 
force  f(t)  is  fluctuating  in  character  because  of  the  associated  uncertainties  in  these 
initial  conditions  of  the  bath  variables.  However,  as  the  thermal  bath  is  characterized 
by  its  temperature  Tt  the  equilibrium  distribution  of  bath  variables  is  given  by  the 
Boltzmannian  form,  so  that  /(r)  follows  the  following  statistics 

</(;)>  =0  (4b) 

and 


*      a    *  (4c) 

=  2/cB7>5(r-O- 

To  arrive  at  equations  (3b)  and  the  last  term  of  (4c)  we  have  assumed  [13, 14, 15]  ohmic 
spectral  density  for  the  bath  oscillators,  i.e., 

PM  =  ~  £  — — <5(co  -  coc)  =  ^coexp( -  co/coc). 

*-         *,         If i*  OJ 


z^-'x 


The  upper  cut-off  frequency  <yc  is  assumed  to  be  much  larger  than  the  characteristic 
frequencies  of  the  system.  Equations  (3b)  and  (4c)  correspond  to  the  well  known 
Markovian  limit  and  are  valid  for  time  scales  t  >  l/cnc,  which  can  be  made  arbitrarily 
small  by  appropriately  choosing  <DC.  For  details  we  refer  to  [14, 15].  It  should  be  noted 
that  the  transient  terms  have  been  neglected  at  time  scales  t  >  ojc~ L  to  arrive  at  (3b),  and 
is  perfectly  valid  under  Markovian  approximation  [15].  It  is  noted  that  A'(Q)  =  con- 
stant corresponds  to  a  uniform  friction  coefficient.  Redefining,  r\\_A' (<2)]2  =  rj(Q)  and 
)>  and  putting  M  =  1,  we  get, 


Q  =  P,  (5a) 

P  =  -  V'(Q)  -  n(Q)P  +  >/»/(fi)T/(r),  (5b) 
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\J  WJ  (I  )S  =  ^K-B0(l  -  I  ).  (DC) 

5  instructive  to  note  that  one  could  take  rj(Q)  to  be  constant  piecewise  along  Q;  in 
h  piece  of  these  Q  segments  (5b)  would  correspond  to  a  constant  friction  coefficient 
with  the  same  statistical  character  of  f(t)  as  in  any  other  Q  intervals. 

Vlicroscopic  Markovian  Langevin  equation  with  space  dependent  friction  and  temperature 

have  so  far  derived  the  Langevin  equation  of  motion  (from  a  microscopic 
miltonian)  of  a  Brownian  particle  with  space  dependent  friction  keeping  the 
iperature  constant.  We,  now,  consider  a  system  for  which  the  temperature  too  is 
ce  dependent  T(Q).  At  this  point  it  is  pertinent  to  note  the  following  important  fact, 
vever.  It  is  quite  well  known  that  when  a  charged  Brownian  particle  is  subjected  to 
electric  field  charge  current  results.  Similarly,  when  it  is  subjected  to  a  thermal 
dient  thermal  current  flows  in  the  system.  However,  in  the  former  case  the  effect  of 
electric  field  can  be  incorporated  in  the  particle  Hamiltonian  as  a  potential  term 
;reas  temperature  gradient  cannot  be  incorporated  as  the  potential  term  in  the 
niltonian  formalism.  Therefore,  in  order-to  incorporate  the  effect  of  the  temperature 
omogeneity  we  reason  as  follows.  The  Brownian  particle  comes  in  contact  with 
mtinuous  sequence  of  independent  temperature  baths  as  its  coordinate  Q  changes  in 
e.  Equivalently,  each  space  point  of  the  system  is  in  equilibrium  with  a  thermal  bath 
haracteristic  temperature  T(Q).  In  what  follows  we  accept  this  idea  and  incorporate 
perature  inhomogeneity  into  the  equations  of  motion  (5).  Henceforth,  for  nota- 
lal  simplicity,  the  coordinate  Q  and  momentum  P  are  replaced  by  the  corresponding 
er  case  letters  q  and  p,  respectively,  reserving  P  for  probability  distribution, 
'or  the  sake  of  argument  we  consider,  for  the  time  being,  the  system  to  be  subdivided 
pace  q  into  several  small  segments  and  represent  the  segments  Ag  around  q  by 
sees  i.  Each  segment  is  connected  to  an  independent  thermal  bath  at  temperature  7] 
i  corresponding  random  forces  f.t(t).  The  last  term  on  the  right  hand  side  of  (5b),  is 
•efore  replaced  by  yjrj(q)  TJ-^t)  for  the  segment  i.  As  the  two  different  segments  are 
i  coupled  to  an  independent  temperature  bath  we  have  <C/,-(0//(t'))  =  2fcB£y<5(f  —  t'). 
ause  f(t)  is  ^-correlated  in  time,  as  the  particle  evolves  dynamically  the  fluctuation 
;e  fi(t)  experienced  by  the  Brownian  particle  while  in  the  space  segment  i  at  time 
11  have  no  memory  about  the  fluctuating  force  experienced  by  it  at  some  previous 
2 1'  while  in  the  space  segment)  ^  i.  Hence  the  space-dependent  index  z  in  ft(t)  can  be 
)red.  Now,  taking  a  continuum  limit,  the  stochastic  equations  of  motion  of  the 
wnian  particle,  in  an  inhomogeneous  medium  with  space  dependent  friction  and 
uniform  temperature,  are  given  by, 

q  =  p,  (6a) 


p  =  -  V'(q)  -  n(q}p  +  Jti(q)  T(q)f(t),  (6b) 

i 

=  2kBS(t-t'}.  (6c) 


also  important  to  note  and  repeat  that  as  long  as  the  random  force  is  delta  correlated 
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in  j  (i]  -*.  j  (q,  r  j  sucn  mat  <^j(qj)j(q  ,i)>  =  ^y(q-q  )0(i-i  j  wim 

4.  Derivation  of  Kramers  and  Smoluchowski  equations 

It  is,  now,  a  straightforward  exercise  to  derive  the  corresponding  Fokker-Planck 
equation.  We  put  M  =  1  so  that  p  —  v,  the  velocity  of  the  Brownian  particle.  The 
stochastic  differential  equations  (6a)  and  (6b)  can  be  converted  into  an  equation  for 
probability  density  P(q,v,t)  using  the  well-known  van  Kampen  lemma  [16].  To  this 
end,  we  consider  a  cloud  of  initial  phase  points  of  density  p(q,  v,  t]  in  (q,  v)  phase  space 
each  point  (q,  v)  of  which  is  evolving  in  time  according  to  (6a)  and  (6b).  The  phase  fluid 
evolves  according  to  the  stochastic  Liouville  equation  (continuity  equation) 

~  =-Vg(4p)-V[,(i'p).  (7) 

In  order  to  obtain  the  equation  for  the  evolution  of  P(q,i\t)  we  ensemble  average 
«•••»  eq.  (7)  over  all  realizations  of  the  random  force  of  given  statistics  and  use  the 
well-known  result  (van  Kampen  lemma)  [16] 

<p>  =  Pte,y,r).  (8) 

The  averaging  procedure  is  carried  out  after  substituting  for  q  and  v  in  (7)  from  (6). 
A  term  like  <p/(r)>  appears  which  is  evaluated  using  the  Novikov  theorem  [17].  For 
details  see  refs  [18,  19].  From  this  we  obtain  the  desired  Fokker-Planck  equation 

dP(q,i\t)  cP(q,vJ)  .dP(q,v,t) 

-   -  =  _  v  -  --  v  (q)  -  -  - 

ot  oq  cv 

7\  ) 

(9) 


,, 

This  is  the  Kramers  equation  for  space  dependent  friction  coefficient  r\(q)  and  nonuni- 
form  temperature  T(q),  derived  from  microscopic  theory.  It  should  be  noted  that  van 
Kampen  had  assumed  (9)  as  the  model  Kramers  equation  to  start  with  to  study  the 
diffusion  of  a  Brownian  particle  in  a  ring  due  to  the  combined  effect  of  space  dependent 
friction  coefficient  and  the  temperature  inhomogeneity  [6].  Equations  (6)  and  (9)  are 
valid  for  all  friction  coefficients,  low  as  well  as  high.  It  is,  however,  hard  to  slove  (9)  in 
general  cases.  Moreover,  in  many  of  the  practical  situations  one  does  not  need  the 
detailed  motion  of  the  Brownian  particle  at  time  scales  much  smaller  than  the 
characteristic  time  scales  of  order  r\~  \  Therefore,  sometimes  it  is  unnecessary  to  retain 
the  fast  variables  v. 

In  most  of  the  problems  of  physical  interest  (overdamped  case)  the  marginal 
distribution  P(q,  r)  suffices  to  describe  the  motion  of  a  Brownian  particle.  In  the  case  of 
uniform  systems,  that  is,  when  >?  =  constant  and  T  —  constant,  the  reduction  ofP(q,  v,  t) 
to  P(q,  t)  is  well  known  and  goes  by  the  name  of  adiabatic  elimination.  One  simply  sets 
p  =  0  in  (6b)  to  obtain  the  overdamped  Langevin  equation.  From  these  one  obtains  the 
Fokker-Planck  equation  for  P(q,  t).  The  overdamped  Langevin  equation  so  obtained 
is  correct  to  order  r\~  :.  For  inhomogeneous  systems,  however,  the  integration  of  (7)  to 
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btain  the  equation  for  P<«,  r)  is  not  easy.  Moreover,  sirnph  ignoring  ihc  n  term  m  «Mv 
.not  correct.  For  instance,  the  resulting  marginal  distribution  func^ I 1 !^ 
oes  not  conform  to  the  correct  equilibrium  distribution.  Howler.  Sancho  ,.  M t ,  i 

Zt'eTw^^^^ 
system  with  space  dependent  friction  coefficient  but  at  uniform  temper  imre  The 

verdamped  Langevm  equation  obtained  by  Sancho  et  at  is  correct  to  order  Vm«,H  ? 
nd  leads  to  physically  valid  equilibrium  distribution  function 

Following  the  prescription  of  Sancho  et  al  [20],  we  obtain  the  o,  enlamred  I  anpet  ,n 
quation  for  an  mhomogeneous  system  with  nonuniform  temperature  field  Ti<it  and  is 
iven  as  ~ 


ith 


he  corresponding  Fokker-  Planck  equation  for  the  overdamped  case  (the  Smolu- 
lowski  equation),  with  kB  set  equal  to  1,  is 


As  pointed  out  earlier  by  van  Kampen,  the  diffusion  equation  s  1  1  1  in  the  absence  of 
Lternal  potential  has  neither  the  form  cP!'ct  =  (c2:cq-)D{q)Piq\.  nor  fP  t't  =  i,"  ,'qi 
(q)(d/dq}P(q}.  It  is  clear  that  T(q)  and  r\(q\  influence  the  motion  of  the  Brownian 
irticle  in  different  ways  and  their  combined  effect  cannot  be  plugged  together  as  the 
feet  of  an  effective  diffusion  coefficient  D(q).  We  note  that  our  equation  fill  agrees 
ith  one  of  the  forms  obtained  by  van  Kampen  [6-8]. 

As  already  mentioned  earlier  the  friction  coefficient  i\(q}  only  affects  the  relaxation 
•ocess  and  not  the  equilibrium  distribution  function  in  a  constant  temperature  field. 
iven  enough  time  the  system  finds  its  equilibrium  state.  In  contrast,  the  case  of 
muniform  temperature  field  changes  the  concept  of  steady  states.  It  can  be  readilv 
:rified  that  if  the  external  potential  is  unbounded  at  infinity,  i.e.,  V(q\-*  x  as^-+  ±  x, 
en  the  system  evolves  to  a  steady  state  Ps(q)  obtained  by  setting  the  probability 
irrent  equal  to  zero 


tiere  C  is  a  normalization  constant.  The  solution  in  no  way  resembles  the  distribution 
:cided  by  the  usual  Boltzmann  factor  alone.  Ps  is  not  a  local  function  of  Viql  The 
in-local  dependence  of  Ps(q)  on  T(q)  and  V(q)  forces  the  relative  stability  of  the  system 
two  different  local  minima  to  depend  sensitively  on  the  temperature  profile  along  the 
tire  pathway  connecting  the  two  minima  [2-5].  Moreover  for  particular  choices  of 
q)  it  may  so  happen  that  Ps(q]  may  show  extrema  at  positions  completely  unrelated 
the  minima  of  the  external  potential  V(q\  Such  a  system  with  nonuniform  tempera- 
re  field  is  inherently  nonequilibrium  and  Ps(q)  describes  distribution  of  nonequilib- 


5.  Conclusion 

We  have  given  a  systematic  microscopic  derivation  of  Kramers  equation  of  motion  of 
a  Brownian  particle  in  a  medium  where  friction  coefficient  is  space  dependent  and 
having  nonuniform  temperature.  We  further  obtain  the  Smoluchowski  limit  of  the 
Kramers  equation  following  the  procedure  given  by  Sancho  et  al  [20].  We  thus  arrive 
at  the  correct  overdamped  Langevin  equation  for  such  a  system.  The  microscopic 
treatment  followed  in  this  work  helps  resolve  the  controversy  regarding  the  correct 
form  of  the  diffusion  equation  followed  by  a  Brownian  particle  in  an  inhomogeneous 
medium.  We  argue  that  the  microscopic  derivation  of  the  equations  makes  their 
application  to  systems  such  as  the  thermal  ratchets  self-consistent  [10-12].  Moreover, 
in  many  cases  the  numerical  solution  of  the  Langevin  equation  is  much  more 
transparent  to  appreciate  physically,  and  the  derived  overdamped  Langevin  equation 
could  thus  be  of  some  practical  use. 
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Nonlinear  dynamics  of  a  two-dimensional  lattice 
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Abstract.  The  dynamics  of  the  nonlinear  excitations  in  a  two-dimensional  (2D)  04-diatomic 
lattice,  with  nonlinear  on-site  electron- phonon  coupling  at  the  polarizable  ion  site  has  been 
presented,  without  considering  the  self  consistent  phonon  approximation.  One  of  the  major 
results  obtained  from  our  calculations  is  in  the  understanding  of  continuous  structural  phase 
transition,  where  we  have  obtained  the  minimum  in  soft  mode  frequency  at  a  soft  mode 
temperature  T(>  TJ,  not  at  critical  temperature  T..  This  occurs  due  to  the  anisotropy  of  such 
2D  systems. 

Keywords.     Lattice  dynamics;  phase  transition;  soft  mode;  nonlinear  dynamics. 
PACS  No.     63-20 


1.  Introduction 

During  the  last  decade  a  great  deal  of  interest  has  emerged  to  understand  the  dynamics 
of  one-dimensional  diatomic  lattices  with  nonlinear  <£4-polarization  potential  at  the 
anion  site  [1,2].  The  04-polarizability  model  successfully  describes  the  displacive  type 
of  phase  transition  in  solids  and  its  dynamics  strongly  resembles  the  structural  phase 
transition  in  ferroelectrics  and  ferromagnetic  materials.  The  phenomenology  behind 
the  structural  phase  transition  (SPT)  is  that,  the  system  goes  from  a  stable  high 
temperature  phase  to  low  temperature  phase,  at  some  critical  temperature  Tc  at  which, 
the  frequency  of  a  vibrational  mode  reduces  to  zero,  a  phenomenon  well  known  as 
"mode  softening"  [3].  The  motivation  for  studying  the  weakly  coupled  two-dimen- 
sional (2D)  chains  in  the  present  paper  is  stronger  than  of  academic  interest. 

Also  the  quasi  one-dimensional  materials  e.g.,  polyacetylene  toluene  sulfonate  [4] 
and  low  dimensional  magnetic  materials  [5],  which  undergo  mode  softening  and 
structural  inhomogeneities  near  the  transition  have  drawn  considerable  interest. 
Looking  at  the  experimental  successes  in  the  identification  and  isolation  of  strongly 
nonlinear  effects  [4, 5],  it  is  time  to  develop  more  theoretical  understanding  of  the 
two-dimensional  materials.  Horovitz  et  al  [6]  for  the  first  time  investigated  the  solitary 
solutions  in  a  two-dimensional  lattice  with  inter-chain  coupling.  Static  solitary  excita- 
tions for  the  2D  lattice  having  strong  anharmonic  polarizability  at  the  anion  have  been 
obtained  by  Behnke  and  Buttner  [7].  This  type  of  lattice  serves  as  a  model  for  certain 
ferroelectric  materials.  They  found  that  the  solitary  finite  energy  excitations  are 
stable  against  small  perturbations.  Apart  from  static  solitary  excitations,  nonlinear 
periodic  solutions  are  also  obtained  with  harmonic  first  and  second  nearest  neighbour 
interactions  [7].  Kerr  and  Bishop  [8]  have  investigated  the  central  peak  phenomena  at 
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Figure  1.    A  two-dimensional  lattice  with  on-site  nonlinear  electron-phonon  coi 
ling  at  the  polarizable  ion. 


SPT  using  computer  simulation  technique  for  a  two-dimensional  anharmonic  lattii 
They  have  used  a  model  of  displacive  04-monoatomic  chain  which  is  coupled  weakly 
a  2D  array  with  nonlinear  interparticle  coupling  and  predicted  the  incomplete  mo 
softening  at  the  transition  temperature. 

In  the  present  paper  we  aim  to  investigate  the  dynamics  of  the  nonlinear  excitatic 
in  two-dimensional  displacive  diatomic  lattice  with  an  on-site  fourth  order  anharmoi 
electron-phonon  coupling  at  the  polarizable  ion  site.  Our  analytical  treatmc 
differs  from  the  previous  workers  [6-8],  in  the  sense  that  we  consider  the  nonlinear 
in  the  core-shell  coordinate  of  the  polarizable  ion  (i.e.  on-site).  We  have  also  r 
considered  the  mean  field  approximation  for  obtaining  the  results  as  reported  earl 
[6, 7].  We  have  calculated  the  frequency  and  energy  of  the  2D  anharmonic  oscillators  a 
discussed  the  results  in  the  light  of  the  motion  of  domain  walls  in  ferroelectric  materi* 


2.    Theory 

The  main  features  of  the  present  2D  lattice  with  on-site  nonlinear  electron-phon 
interactions  of  fourth  order  at  the  polarizable  ion  site  can  be  seen  from  figure  1, 
consists  of  m  linear  diatomic  chains  along  x-axis  of  the  lattice  with  a  highly  localh 
fourth  order  core-shell  coupling  at  the  anion  site  and  a  rigid  ion  at  the  cation  site.  Th 
diatomic  chains  are  coupled  by  a  linear  nearest  neighbour  shell-core  force  constant 


weak  anisotropy. 

The  displacement  of  the  core  of  the  anion  at  the  nih  position  along  the  mth  chain 
from  its  lattice  site  is  denoted  by  u'"~".  Other  displacements  of  the  cation  and  shell  of  the 
anion  are  represented  as  u'™  and  vm~"  respectively  (see  figure  1).  The  polarization 
potential  of  the  anions  in  2D  displacive  array  can  be  written  as 


1  .m,n\ 

\    > 


M.»)2  +  .1  „  _  f  f,m,n  _  tjm.,,)4    \  ^ 


where  cj2  and  y4  are  the  on-site  shell-core  quartic  (harmonic)  and  biquadratic  (anhar- 
monic)  force  constants.  Here  g2  (  <  0)  and  04,  /  (both  >  0)  are  the  parameters  describing 
the  double-  well  potential  at  (m,  n)th  site.  This  04-potential  has  a  minima  at  the 
displacements  vv"'-"  =  +  (  —  g2/f]4.)112  [1  ,  2].  The  polarization  potential  for  this  displace- 
ment is  obtained  as  (  —  <72/4#4). 
The  Hamiltonian  for  the  two-dimensional  diatomic  lattice  can  be  written  as 


m.n 


+  5  Z  U(V"1'"  ~  "21")2  +/(»"'•"  ~  U™-"-1)2 
*•  m.n 

+f'(v'"+  1-n  -  vm-")2  +f'(vm'  l'"  -  t,'"1'")2]  +  <#>p 

i=l,2    ;=1.  (2) 

In  order  to  obtain  physical  solutions  for  the  2D  anisotropic  lattice,  the  excitations 
are  assumed  to  be  propagating  slowly  in  the  x  and  y  directions.  Hence,  it  is  appropriate 
to  use  the  continuum  limit,  in  which  the  relative  shell-core  displacement  wm'" 
(  =  v'"-"  -  1/;"-")  can  be  written  as  [7] 

wm-"  =  wm(x)  =  w(y,x) 

wm±l.n  =  w«±  1(X)  =  W(y±a^X) 
wm.r,±  1  =  wm±  1  ^  ±  ^  j  =  w^  X±QI)  (3) 

Similar  solutions  are  also  assumed  for  the  displacements  u™~"  and  u™'n.  Here  a^  and 
a2  are  the  lattice  spacings  in  x  and  j;  directions  respectively.  The  relative  displacements 
of  the  neighbouring  two  polarizable  ions  can  be  expanded  for  small  lattice  spacing 
«1  and  a2  in  Taylor's  series  as  [7] 

w(y,  x±al)  =  w(y,  x)  ±  a^x(y,  x)  +  ~-wxx(y,  x)  (4) 

a2 
w(y  ±  a2,x)  =  w(y,x)  ±  azwy(y,x)  +  ~wyy(y,x]  (5) 

Pramana  -  J.  Phys.,  Vol.  45,  No.  5,  November  1995  379 


where  wv(>',x)  =  (dw(y,x))/dx  and  vv^J^x)  =  (d2  w(y,x))/dx2  are  the  first  and  second 
derivatives  with  respect  to  x.  Similar  expressions  can  also  be  obtained  for  y  direction. 
Using  the  adiabatic  approximation  (mej  ~  0)  and  the  method  described  in  [2],  we 
have  obtained  a  coupled  nonlinear  equation  of  motion  in  terms  of  relative  core-shell 
coordinate  of  the  anion  as 

[2/  +  02  +  3  tf4(M'm-")2]  vv'"-"  +  [02  +  3  (i  4  (  W"  +  '  -w)2]  wm  +  l  '" 
+  [02  +  304(vvM-  l-")2]w'"-  '•"  +  6#4[(vv'"  +  '-")2wm+  1-" 

+  (W'"-1-")2W"'-  '•"  +  (vi'm-")2H/"-"]  +  -[02  IV"1-"  +  t/4(w'"-")3] 

+  —  [02wm+  '•"  +  04(wIB+  1-11)3]  +  —  [02  vv"'~  '•"  +  04(wM~  1-")3]  =  o    (6) 

with 

i__L    JL 

/i      m  ,      m2 
and 


+  04[(wm+1-B)3  +  (ww-1-11)3].  (7) 

In  this  paper  we  have  considered  the  solutions  for  those  nonlinear  excitations  which 
are  uniformly  varying  along  y  direction  i.e.,  vvy(  =  dvv/dv)  is  constant,  and  hence  the 
second  derivative  vvvv  (  =  d2vv/d>'2)  reduces  to  zero.  In  view  of  this  fact,  the  relative 
core-shell  displacement  depends  linearly  along  y  direction  (i.e.,  w  =  cy).  By  using  (3),  (4), 
(5)  and  the  above  assumptions,  the  second  order  differential  equation  of  motion  (eq  (6)) 
reduces  to 


2  f 

-/?'a2vv2)vi'  +  2/?'w2vv  +  - 


(a  +  j5'w2  +-/?'a2vv2)vi'  +  2/?'w2vv  +  -  (g2\v 


fl5wjw  =  0  (8) 

-**  .3        //J  -j 

where 

a  =  2f  +  g2     and    /?'  =  904. 

The  method  of  solving  such  types  of  differential  equation  has  been  given  in  detail  in 
[2].  The  first  integration  of  (8)  using  boundary  conditions  results  into 


)      -4—  — |—   —  /7     \\) 

P      3   2    -v 
where 

~3/a 
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C0  and  A  are  the  constants  of  integration  and  are  defined  in  terms  of  initial  root  of  the 
displacement  vv0.  The  frequency  is  obtained  from  (9)  by  integrating  it  numerically  and 
made  temperature  dependent  by  using  the  formula 


v0. 
made  temperature  dependent  by  using  the  formula 


The  energy  of  the  system  at  the  turning  point  (w'"'"  =  ±  w0)  can  be  obtained  as 
£  =  —  l(uf"  -  vm'")2  +  (ufn~  '  -  uw-")2] 

-I  --  -  [(„«  +  1  •«  _  y"'-»)2  _|_  (j/»  -  1  -«  _  vm*)2  -j 

+  ^2(wm'")2  +  \^(wm'")4.  (12) 

The  energy  expression  is  simplified  by  using  (3),  (4),  (5)  and  (7).  The  energy  is  obtained 
as  a  sixth  order  polynomial  in  w0,  and  gives  an  asymmetric  double-  well  with  respect  to 
displacement  with  a  central  hump  shifted  towards  the  less  equilibrium  state,  as 
discussed  in  the  next  section. 

3.    Results  and  discussion 

The  values  of  the  parameters  are  chosen  as  £/2A/4.  =  ~  0'2>  <*//?'  =  0'2,  H/m2  =  O'l  anc^ 
a2wy  =  002  in  order  to  get  a  double-well  structure  of  the  potential,  so  that  the  scope  of 
the  present  analysis  includes  simple  ferroelectric  systems.  The  force  constants  /, 
02  (  <  0)  and/'  are  in  the  unit  of  104  g/s2  and  g4(  >  0)  is  in  the  unit  of  1  O22  g/(cm2  s2).  The 
above  values  of  the  parameters  are  chosen,  keeping  in  mind  the  conditions  \g2\  «  <?4 
and/  <  |02  1  to  be  obeyed  for  a  double-well  potential.  The  parameter  ^/m2  is  a  dimen- 
sionless  quantity.  The  reduced  mass  ^  is  taken  to  be  small  as  compared  to  the 
unpolarizable  ion  at  (w,  n)th  site  since  the  unpolarizable  ion  has  been  assumed  to  be 
heavier  and  has  less  displacement  as  compared  to  the  polarizable  ion.  The  parameter 
a2wy  is  in  unit  of  lattice  constant.  The  small  value  of  the  parameter  a2wy  is  sufficient 
enough  to  visualize  the  dynamics  of  the  anisotropic  two-dimensional  array. 

The  energy  of  the  two-dimensional  (2D)  lattice  as  a  function  of  initial  displacement 
w0  is  shown  in  figure  2.  An  asymmetric  double-  well  structure  of  the  energy  is  obtained 
with  a  central  hump  shifted  to  one  side  and  crosses  the  displacement  axis  at  two 
different  points.  This  fact  decides  the  appearance  of  two  characteristic  temperatures, 
the  critical  temperature  Tc  and  soft  mode  temperature  Ts  in  an  anisotropic  two- 
dimensional  lattice,  similar  to  the  one  already  shown  by  Bishop  [8]  who  has  carried  out 
molecular  dynamics  of  weakly  coupled  2D  lattice. 

In  figure  3  we  have  plotted  the  square  of  the  soft  mode  frequency  as  a  function  of 
dimensionless  temperature  (NKBT/(gl/g4)).  From  this  figure,  it  is  evident  that  the 
minimum  in  the  soft  mode  frequency  does  not  occur  at  the  transition  temperature  Tc, 
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Figure  2.    Energy  of  the  two-dimensional  lattice  is  plotted  as  a  function  of  initial 
displacement  VVQ. 


but  at  a  temperature  slightly  greater  than  Tc,  called  soft  mode  temperature  Ts.  For 
temperatures  much  above  Ts,  the  oscillations  are  completely  across  the  asymmetric 
double-well  resulting  into  an  increase  in  frequency.  In  the  temperature  regime  T  <  T^ 
the  particle  can  oscillate  in  either  of  the  wells  depending  upon  the  equilibrium  energy 
state  (see  figure  2).  At  very  low  temperature,  the  particle  oscillates  in  either  of  the  wells 
and  remain  localized  leading  to  harmonic  oscillation  in  this  temperature  regime 
(0  <  T<  Tc).  Instability  in  the  frequency  at  Tc  is  obtained  because  of  the  reduction  of 
the  domain  wall  separation  between  the  para-  and  ferro-electric  phases,  which  increase 
the  electron-phonon  interactions  in  the  2D  system.  Complete  minimum  does  not  occur 
at  Tc  as  there  remains  some  localized  region  in  one  of  the  equilibrium  states.  As  a  result, 
the  soft  mode  frequency  increases  for  a  while  with  displacement  and  then  reduces  to  its 
minimum  at  Ts.  The  reduction  of  frequency  at  Ts  is  very  sharp  which  indicates  that  one 
will  obtain  a  singularity  in  the  physical  properties  at  Ts  (and  not  at  Tc)  for  such  a  weakly 
coupled  two-dimensional  systems. 
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Figure  3.    Square  of  the  frequency  of  the  2D  lattice  versus  dimensionless  tempera- 
ture (NKB 


Similar  modelling  of  two-dimensional  lattice  with  different  types  of  ions  at  the 
alternate  positions  in  the  same  column  can  be  made,  which  leads  to  a  somewhat 
simplified  picture  as  the  force  constants  between  the  shell  of  polarizable  ion  and  core  of 
cation  along  the  chain  and  perpendicular  to  the  chain  will  become  nearly  the  same.  This 
type  of  modelling  of  2D  lattice  leads  to  only  one  transition  temperature  Tc,  as  the 
equation  of  motion  is  isotropic  with  respect  to  core-shell  displacement  of  polarizable 
ion  and  hence  no  mode  softening  temperature  Ts. 

Our  results  have  indicated  that  anisotropic  2D  lattice  (figure  1)  shows  interesting 
behaviour  at  the  structural  phase  transition.  These  types  of  studies  will  help  in  the 
understanding  of  many  recent  experiments  on  the  structural  phase  transition  of 
hydrogen  bonded  macro-molecules  [9]  and  quasi  one-dimensional  materials  [10]. 
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Abstract.  A  consistent  and  unified  microscopic  theory  of  superfluidity  and  superconductivity  is 
developed  on  the  basis  of  two-stage  Fermi-Bose-liquid  (FBL)  (in  particular  case,  one-stage 
Bose-liquid)  scenarios.  It  is  shown  that  these  phase  transition  scenarios  is  accompanied,  as  a  rule, 
by  the  formation  of  composite  bosons  (Cooper  pair  and  bipolarons)  with  their  subsequent  single 
particle  (SPC)  and  pair  condensation  (PC).  A  brief  outline  of  the  modified  and  generalized 
BCS-like  pairing  theory  of  fermions  is  presented.  In  an  analogy  to  that,  a  detailed  boson  pairing 
theory  is  developed.  The  SPC  and  PC  features  of  an  attracting  3d-  and  2d-BG  as  a  function  of 
the  interboson  coupling  constant  in  the  complete  range  0^  T^  TB  is  studied  in  detail.  It  is 
argued  that  the  coexistence  of  the  order  parameters  of  attracting  fermions  AF  and  bosons  AB 
leads  to  the  superfluidity  (in  3He)  and  superconductivity  (in  superconductors)  by  two  FBL 
scenarios.  One  of  these  scenarios  is  realized  in  the  so-called  fermion  superconductors  (FSC)  and 
the  other  in  the  boson  superconductors  (BSC)  in  which  the  gapless  superconductivity  is  caused 
by  the  absence  of  the  gap  ASF  in  the  excitation  spectrum  of  bosons  and  not  by  the  presence  of 
point  or  line  nodes  of  the  BCS-like  gap  AF .  The  new  adequate  definitions  for  basic  superconduct- 
ing parameters  of  FSC  and  BSC  are  given.  The  theory  proposed  is  consistent  with  the 
experimental  data  available. 

Keywords.  Microscopic  theory  of  superfluidity  and  superconductivity;  Fermi-Bose-liquid; 
composite  bosons;  single  particle;  pair  condensation;  fermion  superconductors. 

PACSNo.    74-0 

1.  Introduction 

At  present,  investigation  of  superfluidity  and  superconductivity  takes  much  attention 
as  this  problem  joins  the  broad  class  of  phenomena  observed  in  different  condensed 
matters,  such  as,  for  example,  4He,  3He  liquids,  nuclears,  low  temperature  (LTSC), 
heavy-ferrnion  (HFSC),  organic  (OSC)  and  high-temperature  (HTSC)  superconductors' 
and  other  systems  [1-10].  For  understanding  these  phenomena,  numerous  theoretical 
models  [2-22]  have  been  proposed.  Nevertheless,  the  adequate  theoretical  scheme 
explaining  the  observations  both  in  superfluid  (SF)  4He,  3He  and  in  superconducting 
(SC)  materials  has  not  been  done  as  yet.  For  example,  none  of  the  existing  theoretical 
models  of  superconductivity  does  not  consider,  complete  scenarios  of  SC  transition, 
starting  from  the  Fermi-gas  state,  passing  through  the  BCS-like  ground  state  of  Fermi- 
liquid  and  leading  to  the  formation  of  unstable  or  stable  presuperconducting  (or 
precursor)  state  of  Bose-gas  (BG)  and  then  to  the  true  SC  state  of  Bose-liquid.  Usually, 
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main  attention  is  given  to  one  of  the  intermediate  (but  not  SC)  states,  namely,  to  exf 
the  electron  pairing  effect  by  means  of  phonon,  exciton,  plasmon,  magnon,  polaron 
other  excitations  exchange,  as  well  as  to  use  a  conception  of  the  Bose-Ein: 
condensation  (EEC)  of  ideal  cooperons  or  bipolarons  [6-9].  However,  the  pairir 
electrons  is  only  necessary  but  not  a  sufficient  condition  for  appearance  of  su 
conductivity  (see  also  [8]).  Therefore,  many  observed  unusual  properties  of  L/i 
HFSC,  OSC,  HTSC  and  3He  [3,23-42]  are  difficult  to  be  explained  on  the  bas 
BCS-like  pairing  theory.  Such  inexplicable  facts  by  BCS-like  theory  are:  (i)  non- 
density  of  states  (DOS)  is  the  inside  of  BCS-like  gap  and  a  peak  of  DOS  is  its  oui 
[3,23-28];  (ii)  anomalous  upward  increasing  of  the  lower  critical  magnetic  field 
[29,30],  the  critical  current  7C  [31],  sound  velocity  [32]  and  the  SC  order  paran 
[33,34]  in  HTSC  and  HFSC  at  T~  (0,6  -0,7)  Tc;  (iii)  the  existence  of  the  1 
pseudo-SC  gap  in  HTSC  not  only  below  Tc,  but  also  above  Tc  [24, 35-37];  (iv 
existence  of  the  specific  heat  jumps,  both  below  Tc  and  above  Tc  in  HTSC  and  H 
[29,38-40];  (v)  gapless  superconductivity  [3-9];  (vi)  sharp  increasing  of  the  cri 
current  velocity  and  superfiuid  density  and  jump-like  changes  of  other  phy 
parameters  at  T  ~  (0, 6  -f-  0, 7)  Tc  as  well  as  the  origin  A  ->•  B  transition  and  half-intc 
quantization  of  vorticity  in  3He  [2,41-44],  etc.  These  experimental  facts  obse 
in  HTSC  cannot  be  explained  on  the  basis  of  another  highly  attractive  resora 
valence  bond  (RVB)  models  [6-9,29,45]  and  bipolaronic  models  [6-9,32,46]. 
RVB  model  based  on  the  hypothetical  spinon-holon  phenomenology  claims 
the  normal  state  is  not  a  Fermi-liquid  and  that  the  BCS-like  approaches  are 
applicable.  These  non-Fermi-liquid  models  are  not  as  yet  formulated  in  enough  d 
to  predict  photoemission  curve  [6,9,26]  as  well  as  some  above  unusual  properti 
HTSC.  Indeed,  recent  experimental  data  [6, 9, 47]  has  confirmed  the  definite  exist 
of  a  Fermi  surface  in  HTSC.  However,  the  RVB  model  represents  indepen 
theoretical  interest,  from  the  standpoint  gauge  field  theory  and  elaboration  of  1 
theory  in  low-dimensional  systems  is  given  much  attention  [6,45,48].  In  the  exii 
bipolaronic  and  bisolitonic  models  [6, 8, 32, 46]  as  well  as  in  some  variants  of  RVB  rm 
(see  [6,8]),  the  SF  state  of  bipolarons,  bisolitons  and  holons  is  idenl 
with  the  BEC  state  of  an  ideal  BG  which  according  to  the  Landau  criterion  is 
SF  (see  also  [12]).  So,  the  nature  of  the  SF  state  of  the  electron  liquid  in  all  supercondui 
still  remains  unknown.  While  the  existing  theoretical  models  of  a  Bose-liquid  superflu 
[12-19,22],  firstly,  cannot  explain  a  number  of  SF  properties  of  4He  and  seco: 
contradicts  each  other  (see  below),  i.e.  the  theory  of  superfluidity  is  not  suffici* 
elaborated  as  yet.  Further,  separate  consideration  of  related  phenomena  oi 
superfluidity  and  superconductivity  within  the  different  approaches  is  quite  unfoui 
and  does  not  allow  to  describe  these  phenomena  from  a  unified  position. 

So,  there  are  many  reasons  not  to  be  locked  on  the  existing  BCS-like,  bipolarc 
RVB-schemes  and  to  consider  the  general  case.  In  this  respect,  a  novel  two-s 
microscopic  model  of  superfluidity  and  superconductivity  proposed  in  [49-53]  n 
sents  one  of  the  possible  alternative.  According  to  this  model,  the  phase  transition  t 
state  is  realized  by  two  possible  Fermi-Bose-liquid  (FBL)  scenarios  in  contrast  t< 
above  discussed  one-stage  Fermi-liquid  and  Bose-liquid  or  BG  ones.  In  this,  a  i 
realistic  model  of  superconductors,  the  composite  bosons  (e.g.  Cooper  pairs 
bipolarons)  like  non-composite  ones  (e.g.  4He  atoms  and  spinless  holons  [12-21]) 
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have  any  values  centre  of  mass  impulse  and  undergo  BEC  of  an  ideal  BG,  single  particle 
condensation  (SPC)  of  a  repulsive  BG  and  SPC  and  pair  condensation  (PC)  of  an 
attracting  BG.  Existence  of  the  SF  or  SC  state  of  composite  bosons,  then,  depends  on 
two  types  of  excitations,  one  of  them  characterizes  the  excitation  of  Fermi  components 
of  such  bosons  and  another  as  a  single  whole.  The  first  of  these  excitations  may  be 
considered  within  the  BCS-like  model  of  a  Fermi-liquid  and  the  second  one  on  the  basis 
of  SPC  and  PC  model  of  an  attracting  BG  (or  Bose-liquid)  [52,53].  Unlike  the 
excitation  spectrum  of  a  Fermi-liquid  with  single  BCS-like  energy  gap  AF  in  that  of 
a  Bose-liquid,  there  are  two  energy  gaps  AB  (order  parameter  bosons)  and  ASF  (energy 
gap  in  the  excitation  spectrum  of  an  attractive  BG).  In  the  present  paper,  an  attempt  has 
been  made  to  construct  a  consistent  microscopic  theory  of  a  novel  two-stage  FBL 
scenarios  of  superfluidity  and  superconductivity  as  a  combined  theory  of  Fermi-  and 
Bose-liquid.  It  will  be  shown  that,  the  SPC  and  PC  of  an  attracting  composite  bosons 
and  not  their  formation  by  BCS-like  pairing  of  fermions,  plays  a  crucial  role  in  the 
superfluidity  and  superconductivity  in  Fermi  systems. 

This  paper  is  organized  in  the  following  manner.  In  §2,  we  begin  by  short  reviewing 
only  the  outline  of  the  real-space  pairing  mechanisms  of  charge  carriers  leading  to  the 
formation  of  different  bipolaronic  states  in  various  compounds  (more  detailed  con- 
sideration will  be  reported  elsewhere).  In  §  3,  we  consider  the  BCS-like  pairing  scenario 
of  different  interacting  polarons  in  /c-space  leading  to  the  formation  coherent  or  ground 
state  of  Fermi-liquid.  In  §  4,  at  the  beginning,  we  review  the  existing  BG  and  Bose-liquid 
SF  theories  and  then,  we  investigate  in  detail  the  ground  state  of  a  Bose-liquid  on  the 
basis  of  the  concepts  SPC  and  PC  of  an  attracting  two  (2d)-  and  three  (3d)-dimensional 
BG  as  the  basis  of  their  superfluidity.  In  particular,  we  determine  the  basic  parameters 
characterizing  these  Bose-liquid.  The  calculation  of  the  specific  heat  of  2d-  and 
3d-Bose-liquid  on  the  basis  of  the  proposed  theory  SPC  and  PC  of  an  attracting  BG  is 
given  in  §  5.  The  discussion  of  the  results  and  their  validity  and  applicability  degree  for 
the  adequate  description  of  different  experimental  data  in  SF  4He,  3He  and  SC 
materials  is  the  topic  of  §6  and  a  summary  is  given  in  §7.  In  the  appendix,  we  give  the 
basic  calculations  of  the  analytical  expressions  used  in  the  text  of  the  paper. 

2.  The  real-space  pairing  mechanism  of  charge  carriers 

As  is  generally  known,  the  ground  state  of  electrons,  holes  and  excitons  in  solids  is  their 
self-trapped  (S)  state  that  are  discovered  in  many  classes  of  substances,  such  as,  alkali 
halides,  rare-gas  solids,  oxides,  organic  molecular  crystals  and  semiconductors 
[54, 55].  In  [56]  we  have  shown  the  existence  (coexistence)  possibility  of  four  type  (i.e. 
delocalized  (D)  and  three  S  (continuum,  atomic  and  molecular))  states  of  these 
quasiparticles  in  solids.  According  to  [55-57]  the  holes  and  electrons  in  ionic  com- 
pounds (to  that  are  related  HTSC  also)  are  polarons  not  only  in  S-  but  also  in  D-states, 
i.e.  they  are  dressed  by  deformation-polarization  cloud  or  polaron-bag  forming  at 
static  lattice  deformations.  In  metals  and  some  nonmetallic  solids  there  is  no  optical 
phonons.  There  holes  and  electrons  are  polarons  only  in  S-states  and  in  D-states  (to  be 
precise  we  call  it  free  (F))  and  they  are  dressed  by  phonon  (or  dynamical  polarization) 
cloud  or  bag.  Further,  the  formation  of  the  D-  and  S-states  of  bipolarons  is  quite 
possible  in  nonsuperconducting  and  superconducting  materials  at  the  interaction  of 
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the  identical  and  different  polarons  by  polaron  and  phonon-bag  exchanges. 
example,  the  existence  of  the  bipolarons  localized  on  the  neighbour  lattice  site; 
observed  in  Ti4O7,  (Ti,  _,VJt)4O7,  NaxV,O5  and  other  materials  (see  [32]). 
anomalous  characteristics  of  a  number  of  compounds,  such  as  Nb3Sn,  V3Ga, 
PbMOftSo  and  BaBiYPb,  _YO,  is  considered  also  as  the  indication  to  the  exister 

O         D  A  i         A  J 

some  above-named  polarons  and  bipolarons  [32].  So,  in  HTSC,  the  charge  carrie 
most  probably  D-  and  different  S-bipolarons  (two  of  them,  atomic  and  mole 
bipolarons,  forming  in  oxygen  sublattice).  Indeed,  the  experimental  data  [5£ 
indicate  that  two  holes  in  HTSC  are  localized  on  ligands  (i.e.  on  oxygen  ions)  and 
for  example,  atomic  bipolarons  of  O(ArN  ~  2)  ~  (N  =  4  or  6)  type  or  molecular  bipol; 
of  charged  dimer  O^""  type  [60].  The  formation  possibility  of  the  D-  (or  large  poli 
(bipolarons)  in  the  context  [46])  and  continuum  S-polarons  (bipolarons)  m< 
considered  in  the  framework  of  the  continuum  model  [56,57]  and  atomic 
molecular  S-polarons  (bipolarons),  in  the  framework  of  a  quasi-molecular  model 
First  of  these  approach  is  suitable  for  the  quantitative  determination  of  the  bii 
energies  of  D-  and  S-bipolarons  by  the  difference 


where  EP  and  EB  is  the  polaron  and  bipolaron  energy,  respectively.  Us 
|  t/c  J~  0,01  -f-0,30  and  ~0,  1  -r-leV  for  D-  and  S-bipolarons,  respectively.  " 
real-space  pairing  scenario  describes  the  attractive  interaction  of  two  sepa 
polarons.  It  corresponds  to  low  polaron  concentration  or  the  narrow  polaronic  i 
In  this  case  'bandlike'  motion  of  polarons  at  the  low  temperature  is  unlikely  anc 
can  form  separated  bipolarons  which  are  immobile  also  (these  real-space  bipoi 
may  be  considered  as  an  ideal  BG).  So  the  real-space  pairing  scenario  is  not  appli 
to  study  the  collective  (or  coherent)  properties  of  the  interaction  polarons 
bipolarons.  However,  with  increasing  of  polarons  concentration  (or  with  decreas 
the  distance  between  polarons)  their  deformation  field  begins  to  strongly  overlap  i 
is  accompanied  by  the  decreasing  of  the  lattice  deformation  degree,  \UCn\  an 
broadening  of  the  polaronic  band.  In  such  situations,  due  to  the  modulation  of  the 
parameters  of  polarons  (caused  by  their  collective  effects)  the  real-space  approxim 
[32,46]  is  not  suitable  for  the  description  of  their  pairing.  Then  the  BCS-like  k- 
attractive  pairing  between  polarons  in  extended  spatial  states  is  more  suitable  f< 
study  of  the  formation  of  both  large  D-  and  small  S-bipolarons  (e.g.  Schafroth 
(see  also  [9,32]).  In  the  present  case,  the  D-  and  different  S-bipolarons  doe 
qualitatively  differ  from  the  Cooper  pairs.  These  bipolarons  and  Cooper  pairs  (t 
dynamic  bipolarons)  can  be  regarded  as  interacting  composite  bosons  (othe 
cussions  are  given  in  §6-4).  So,  in  superconductors  (including  3He)  existence  (co 
ence)  of  the  different  types  of  unpaired  and  paired  Fermi  particle  is  quite  possible 
multicomponent  FBL  [49-53].  For  simplicity,  we  neglect  interband  fermion-1 
interaction  in  such  mixed  FBL  assuming  that  the  important  mechanism  responsit 
superconductivity  is  the  intraband  fermion-fermion  and  boson-boson  attractive 
action.  Indeed,  fermion-composite  boson  interaction  due  to  exclusion  principle 
probably  leads  again  to  the  formation  of  a  Fermi  and  composite  Bose  particle.  Th 
have  dealings  with  two  independent  subsystems,  where  both  fermionic  NF  and  be 
NQ  numbers  are  separately  conserved. 
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The  Hamiltonian  of  the  multi-component  Fermi-gas  system  with  the  pair  interaction 
between  particles  has  the  form  [43, 52] 

H=  £  &(k)-LiFJalkaanka  +  --  X    ^(^XftX-u"/-fc'i%'T      (3-1) 

nka  2-^L  nk,lk' 

where  e(/c)  =  h2k2/2mFn,  mFn  and  ^p,,  is  the  mass  and  chemical  potential  of  the  nth  type 
of  polarons,  respectively,  alka(anka)  is  the  creation  (annihilation)  operators  of  these 
polarons  in  state  |/?/CCT>,  k  and  a  their  wave  vector  and  spin  indices,  respectively, 
Vvnl(k,  k')  is  the  pair  interaction  potential  (which  has  both  an  attractive  and  a  repulsive 
part)  between  nth  and  /th  type  polarons,  Q  is  the  volume  of  the  system,  n, 
1  =  {1,2, ..., v},v<4.  The  Hamiltonian  (3.1)  is  diagonalized  by  the  standard 
Bogolubov  transformation  of  Fermi  operators.  Then  in  the  excitation  spectrum 


there  are  n  energy  gaps  determined  from 


Am-        V  v*(k  /;''thn  nn 

AF,,(K)  -  -  5  L  vai(^  k  )  p    ,,.M  th     9T  (3-2) 

"*•,/  %((M         2/ 

where  £F,,(k)  is  the  energy  of  nth  type  polarons  measured  relative  to  ^Ffl  and  a  repulsive 
Hartree-Fock  potential  (which  can  be  incorporated  and  denoted  as  /tFn)  (see  also  [61]  ). 
For  simplicity,  we  consider  the  case  V*n(k,  k')=£Q  and  K^(/o,  k')  =  0  (n  ^  /).  Further,  we 
use  usual  BCS-like  approximation  for  V^n(k,  k')  [12,  62] 

ln  ~    Vln       if   «(*)>       *(*'  )  <  £A,,  =  I  ^O,  I  +  fo»D 

L  if  £A|I  <  £(k),     fi(fc')  <  £R,,  (3.3) 

0  if  e(/c)     or    e(fc')>£Rfl 

where  EAn  and  £Rn  is  the  cutoff  parameters  for  hybrid  attractive  V^,  (which  has  both  the 
phonon-  and  polaron-bag  attracting  part)  and  a  repulsive  V£n  part  of  the  potential 
V*n(k,k'),  respectively,  in  nth  polaronic  band,  £An«£Rn.  Then  we  determine  the 
disappearance  temperature  of  nth  type  bound  polaron  pairs  TFtt  and  the  ratio 
9Pl,  =  2AF,,/TF,,  as 

TFn  *  1,  14(|  L/CJ  +  fooD)exp(-  1/KF|IDFH)  (3.4) 

and 

^F.  =  3,52  (3.5) 

when  FFn=  KAN-  !/*„,  K*  ,  =  KR,,/[1  +  DF;  VRn  In  (£RlI/£Aa  )]  and  DFn  is  the  effective 
interaction  potential  and  density  of  states  in  the  nth  polaronic  band,  respectively,  ha>D 
is  the  Debye  energy.  Here  the  combined  polaron-  and  phonon-bag  mediated  processes 
are  taken  into  account  like  it  is  done  in  case  of  the  combined  electronic  (lochon)-  and 
phonon-mediated  ones  in  [63,  64].  While  the  expression  of  the  cutoff  energy  obtained 
within  the  spin-  and  correlation-bag  approaches  [65,  66]  are  also  similar  to  the  results 
[63,64].  So  our  and  above-named  approaches  are  modified  variants  of  the  basic  BCS 
theory.  In  the  absence  of  the  static  lattice  deformation  (that  takes  place  in  many  LTSC) 
|  uCn\  =  0  and  we  have  dealings  with  pure  BCS  picture.  In  general,  pure  BCS  pairing  is 
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[67]),  the  cutoff  energy  in  (3.3)  would  be  replaced  by  Efn  (see  [68])  or  width  c 
polaronic  band  [9].  In  case  0  ^  V$H  <  VM  takes  place,  0  ^  VPnDFn  ^  1  and  the  & 
modified  BCS-like  pairing  theory  of  polarons  is  quite  applicable. 

4.  Ground  state  of  Bose-Iiquid 

Now  we  consider  the  SF  state  of  a  Bose-liquid  that  represents  interaction  of 
composite  bosons  (e.g.  Cooper  pairs  and  bipolarons)  and  non-composite  ones  su 
4He  atoms  (or  hypothetical  holons  in  RVB  models).  A  brief  survey  of  the  ex 
theories  of  SF  Bose  systems  is  made.  The  BEC  of  an  ideal  BG  cannot  serve  as  a  n 
for  a  study  of  SF  properties  of  4He  (see  also  [12])  and  SC  properties  of  electronic  li 
in  superconductors.  Another  most  often  discussed  model  of  superfluidity  is  basi 
the  SPC  of  a  repulsive  BG,  theory  of  which  was  developed  by  Bogolubov 
However,  the  validity  of  such  theory  of  superfluidity  is  not  less  controversial  [1^ 
As  the  theory  of  a  repulsive  BG  superfluidity  (or  c-number  of  condensate  theory 
based  on  the  concept  of  SPC  of  bosons  cannot  explain  a  number  of  SF  proper! 
4He,  such  as,  the  observed  half-integral  values  of  circulation  (see  [15]),  the  obs 
4He  specific  heat  departure  from  the  phonon-like  dependence  (see  [16])  an 
depletion  of  the  zero-momentum  state  (some  internal  inconsistencies  of  the  repi 
BG  theory  was  discussed  in  [14, 69]).  In  this  respect,  a  more  consistent  approach 
attractive  BG  model  of  superfluidity  proposed  in  [14].  The  theory  of  superfli 
developed  within  such  model  in  [17,18]  is  based  only  on  the  concept  of  PC 
attracting  BG.  The  SPC  and  PC  possibility  of  a  3d-BG  in  case  of  T  =  0  and 
boson-boson  attraction  was  studied  in  [19].  Recently,  the  attractive  BG  model  ad 
to  2d-boson-like  holon  gas  have  been  discussed  in  [20, 21],  where  however,  only 
assumed  of  such  exotic  bosons  and  their  SPC  possibility  at  T  =  0  remains  unsti 
At  present,  the  situation  is  very  much  controversial  and  confusing  for  attractive 
systems,  which  increase  the  number  of  unsettled  problems.  So  the  basic  proper! 
a  SF  Bose-liquid  is  still  not  established  as  a  function  of  the  interboson  inters 
coupling  constant  in  the  complete  temperature  range  of  0  ^  T  ^  TB  (where  TB 
attracting  BG  condensation  temperature).  For  example,  such  unstated  proper! 
a  SF  Bose-liquid  are:  (i)  specific  features  of  the  SPC  and  PC  of  an  attracting  B 
nature  of  the  order  parameter  AB  and  gap  ASF;  (iii)  depletion  of  the  zero-mome 
state  of  condensed  bosons  (i.e.  Bose-Einstein-like  single  particle  condensate  i 
context  and  not  unphysical  Bose-Einstein-like  pair  condensate  considered  by  1 
etal  [14,17,22]  and  others  [18])  and  complete  depletion  of  this  state  on 
temperature  interval  below  TB(=  T;  for  4He);  (iv)  specific  heat  pecularity  in  tl 
state  (e.g.  at  T*  <  TB(=  TA  in  4He  and  =  Tc  in  3He,  HFSC  and  HTSQ),  etc.  Fi 
there  are  highly  conflicting  opinions  about  existence  (absence)  of  a  gap  in  the  excii 
spectrum  of  a  nonideal  BG.  For  example,  according  to  [14, 17, 1 8]  there  is  no  gap 
excitation  spectrum  of  an  attracting  BG.  On  the  contrary,  some  Bogolubo 
c-number  condensate  theory  [18]  (see  also  [16])  predicts  the  existence  of  a  gap 
excitation  spectrum  of  a  repulsive  BG  in  the  zero-momentum  state,  while  such 
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was  not  evidently  observed.  On  the  other  hand,  in  several  schemes  of  this  theory,  the 
excitation  spectrum  of  a  repulsive  BG  is  gapless.  The  existence  of  a  finite  gap  in  the 
excitations  spectrum  of  an  attracting  BG  in  its  pair  condensate  state  was  predicted  in 
[19]  for  T  =  0.  From  the  above-stated,  it  follows  that  the  nature  and  structure  of  the  SF 
state  of  a  Bose-liquids  is  still  far  from  being  elucidated  and  the  existing  theoretical 
schemes  is  insufficient  for  understanding  all  SF  properties  of  4He  (to  say  nothing  of  3  He 
for  the  description  of  that  in  the  BCS  model  is  not  quite  adequate).  So,  up  to  now,  there 
is  no  consistent  and  adequate  microscopic  theory  of  SF  Bose-liquid  describing  the 
available  experimental  data  in  4He  (including  3He  and  superconductors)  from  a  unified 
position. 

4.1   The  Hamiltonian  model 

The  Hamiltonian  of  the  multi-component  BG  system  with  the  pair  interaction  (which 
has  both  an  attractive  and  repulsive  part)  between  particles  has  the  form  (3.1)  only  by 
the  absence  of  spin  indices.  The  linearized  part  of  this  Hamiltonian  (i.e.  Hamiltonian 
model)  for  one  component  of  BG  may  be  written  in  the  form  (see  also  [13]) 

H  =  I  k(*)iift  +  :UB(fc)(cVl  +  ckc_k)~  (4.1) 


where  e.(k)  =  s(k)  -  /IB  +  VB(0)pB  +  xB(fc),  r.(k)  =  h2k2/2mB  (here  and  further,  for  con- 
venience, we  omit  the  index  in  <>(£)),  AB(/o)  =  (l/Q)Ek,  VK(k  —  k')(c_k.ck.y, 
^B(/c)  =  (l/Q)Efc,KB(/c-/c')nB(/c'),nH(/c)  =  <4afcX  pB  =  (l/n)ZtnB(*),mB-the  mass  of 
Bose  particles,  /(B  -  the  chemical  potential  of  BG,  c'l(ck)  -  the  creation  (annihilation) 
operators  of  bosons  with  the  wave  vector  k,  VK(k  —k'}-  the  interboson  interaction 
potential.  The  Hamiltonian  (4.1)  is  diagonalized  by  the  Bogolubov  transformations  of 
Bose  operators  and  the  excitation  spectrum  of  bosons  has  the  form 


£„(*)  =  >AB(*)-Ai(*),  (4-2) 

where  the  values  of  AB(/c),  ^UB  and  #B(/c)  are  determined  from  simultaneous  equations 


2£0(/c')         2T 


*B(*')   ..,£B(*')      .  ,  (43) 


2EB(/c')          2T 

by  means  of  their  self-consistent  solution. 

At  EB(0)  =  0  or  fj.p+  KB(0)pB  +  Xe(0)  =  I^B|  =  |AB(0)|  t*16  terms  of  the  sum  (4.3) 
with  k,  k'  =  0  should  be  considered  separately  according  to  the  procedure  proposed 
in  [14].  Now,  in  order  to  find  the  solution  of  (4.3)  the  interboson  interaction  potential 
may  be  approximated  in  the  simple  separable  form  as  is  done  in  the  theory  of 
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superconductivity  [12] 


0  if 


<=BR 


where  £BA  and  CBR  are  the  cutoff  energies  for  attractive  and  repulsive  parts  of  ti 
(/c  —  /c'),  respectively.  This  approximation  allows  us  to  carry  out  the  calcul 
thoroughly  and  so  it  gives  us  a  new  insight  to  the  problem  of  condensation  (i.e.  o 
possible  properties  of  a  SF  Bose-liquid). 

Further  we  assume  £BA  »  |/7B|  ~  AB  ~  TB,  where  TB  is  the  temperature  of  the  < 
parameter  AB  disappearance.  Then  the  main  contribution  to  the  AB(fc),  AfB,  %B(/c)  c 
from  small  values  of  k  <  /CA  whereas  the  large  values  of  k>  kA  gives  small  correc 
that  may  be  neglected.  In  the  case  of  only  two  Bose  particles  interaction,  the  bii 
energy  Eb  of  these  particles  is  determined  from  the  equation 


where  PB= 

At    ^BA»£h    from    (4.5)    for    2d-    and    3d-BG,    correspondingly,    it    fo 
/R  -  i£B  ln(cBR/£BA),  DB  =  ms/2nh2  and  7R  =  DB 


4.2  Three-dimensional  Bose-liquid 

At  first  we  consider  the  case  T  =  0.  Going  in  (4.3)  from  summation  to  integratk 
e  and  by  making  elementary  transformation  the  following  equations  may  be  obt 
for  determination  of  critical  values  of  the  pB  and  \p.B\  at  which  SPC  of  bosons  b 
with  the  vanishing  of  the  gap  ASF  =  ^/fll  —  AB  in  EB(k)  (Appendix  1) 


where  7B  =  7BDB  ^/^  is  a  coupling  constant  of  the  interboson  interaction  as  ^BA 
acterizes  the  thickness  of  the  condensation  layer  including  almost  all  Bose  parti 
From  (4.5)  the  following  equation  is  obtained  for  determination  of  Eb 


where  y  =     \Eb\/2^. 

As  it  is  obvious  from  (4.7),  formation  of  the  bound  state  of  two  isolated  boson  pj 
possible  only  at  yB  >  y^  =  1  that  coincides  with  the  value  of  y$  determined  from  (4. 
dilute  BG  (pB  ->  0,  AB  =  0).  So  in  the  case  of  dilute  BG,  Eb  =  2 1  £B  |.  However,  othe 
E,,  =  2ASF<2|/iBj,  i.e.  the  binding  energy  of  the  boson  pairs  in  their  inten 
collective  is  decreased.  Hence  at  yB  =  y$  >  1  the  phase  transition  SPC^^PC  occ 
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a  nonideal  BG.  The  value  of  |  /JB  |  at  yQ  =  7*  is  equal  to  [70]  (Appendix  1). 

|/ZB|  =  IAB|  =  2,88TBEC  (4.9) 

where  TBEC  is  the  BEC  temperature.  Fon;B  <  VB  the  fraction  of  condensed  bosons  in  the 
zero-momentum  state  «BO  =  pBO/pB  is  deteimSned  as  a  function  of  yB  from  the  simulta- 
neous equations  system  [70]  (Appendix  1): 


PBO  = 


where  pBO  =  NKO/Q.,Nm  is  the  number  of  Base  particles  with  k  =  0. 

The  «BO(yB)  dependence  is  shown  in  figure:  1.  Now  we  consider  the  case  T  ^  0.  The 
numerical  solution  of  (4.3)  at  T  <  TB  taking  into  account  (4.4)  shows  that  the  gap  ASF  in 
EB(k)  vanishes  at  T  =  T$  «  TB  [70].  In  linnit  cases  such  solution  may  also  be  obtained 
analytically.  So,  at  yB  <  yg  and  T^  T*  «  TQ  assuming  |/ZB|  =  AB  from  (4.3)  it  follows 
[70,  71]  (Appendix  2): 


(4.12) 


(4.13) 


From  these  equations  the  values  of  T*(}'B)  and  nQO(T)  may  be  found.  At  T  <  T^  «  TB 
one  obtains  [70]  nBO(T)  as 

(4.14) 


where  0  =  mB/12pB0ft3i;c,  vc  =  ^/1/JalM  is  the  sound  velocity.  Analogous  relation  for 
nBO(T)  was  also  obtained  in  the  framevorl  of  the  phenomenological  approach  (see 


0.1      O.'i      0-6      (3.J        1 
Fieure  1.    The  dependence  of  nnn  on  VB  for  3d-BG. 


:>  uznumanov  ana  r  K  Khabibullaev 


[72]),  where,  however,  instead  of  pBO(0)  figures  pB  that  correspond  to  the  BEC  of  an  ideal 
BG.Near  7?  approximately  «BO(T)- (7*  -  T)  take  place.  In  thecase  of  T^TB,AB«TB, 
|£B| «  TR  solution  of  (4.3)  may  be  found  analytically  and  we  have  (Appendix  II): 


(4.16) 


from  which  at  T=  TR  and  AB  =  0  it  follows  that  for  yB  <  1  or  yB  «  y*  (just  in  this  case 
most  probably  the  conditions  |/ZB|  «  TB  is  well  satisfied) 


and  for  a  small  y  «  1 

B7],  (4J8) 


c  =  7i3/2/3,912. 

From  (4.  1  5)  and  (4.  1  6)  it  also  follows  that  the  AB(  T)  and  /ZB(  T)  near  TB  is  determined 
as  [70]  (Appendix  2). 


(419) 

0,25 

(420) 


where  fl  =  2(CyB)-°-5(^BA/rBEC)0-25. 

In  the  case  T->  T*  and  |/ZB(T*| «  T*  ^  TB  a  solution  of  (4.3)  is  determined  analog- 
ous to  the  case  T-*  TB  Then  the  temperature  dependence  of  fiB  (or  AB)  and  nBO  near 
T  =  T*  <  TB  accordingly  has  the  form  [70]  (Appendix  2) 


(4-21) 
and 

(4.22) 


From  (4.16)  (at  AB  =  0)  and  the  l.h.s.  of  (A2.14)  in  Appendix,  it  follows  that  in  case 

/A1 1^B?/I/7BMB)IK2'  ?  mUSt  bC  TJ^  T*^T»>  whereas  from  the  1-h-s.  of 
(A2.13)  (that  roughly  is  equal  to  2,612  Tjg.  at  T  =  T*)  in  Appendix  -  T*  >  T 

In  the  case  T  <  T*  «  TB  that  correspond^  the  cascV.  <  y™h  |  =  AB  -conltta'kes 
place.  The  phase  diagram  of  states  of  the  attracting  BG^is  indicated  in  figur  2  and  th 
temperature  dependence  of  AB,  fc  and  «BO  are  shown  in  figures  3  and  4  respectively 
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1         IV 
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Figure  2.    The  phase  states  of  an  attracting  3d-BG  for  £BA/TBEC  =  10  (in  case  yB  ~  1 
is  assumed  |/ZB(T*)|  ~  |/ZB(TB)|). 


(a) 


Figure  3.    a. 


4.3  Two-dimensional  Base-liquid 

The  case  of  T  =  0.  In  (4.3)  going  from  summation  to  integration  on  e,  it  is  easy  to  find 
the  value  of  yB  =  DB  KB  at  which  the  gap  ASF  is  vanishing  and  this  value  of  yB  is 
determined  from  [20] 

1 


(4.23) 


that  corresponds  to  the  value  of  1/iJ  =  2Tn,  where  Tn  =  2nh2pn/mB. 


T*      T 

'B       'BEC 


(c) 


Figure  3.     Schematic  dependence  of  the  ;1B  and  AB  vs.  temperature 
a)  at  -;B  >  y*;  b)  at  VB  <  yj  and  c)  yB  « y*. 
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IBEC         TjT     »B 

Figure  4.    Schematic  dependence  of  the  nBO  vs.  temperature  T  for 
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Unlike  the  3d-BG,  in  2d-BG  the  binding  energy  of  isolated  boson  pairs  is  determined 
from  (4.5)  analogous  to  the  binding  energy  of  isolated  fermion  pairs  in  the  BCS  theory  and 
bound  state  is  formed  even  under  weak  interaction  with  yB  >  0.  In  the  case  of  dilute  2d-BG 
(AB  =  0)  analogous  as  in  the  case  of  3d-BG,  £B  =  2|£B|  or  £B  =  2ASF  <  2|/ZB|  takes  place. 
The  value  of  pBO  is  determined  from  the  simultaneous  equations  [70]  (Appendix  1) 


-yBln 


2  |/Z 


(4.24) 


(4.25) 


The  dependence  of  pB(yB),  AB  on  |/1B|  (at  different  values  of  yB  and  pQ)  and  nBO(yB)  are 
presented  in  figures  5,  6,  and  7,  respectively. 
The  case  of  T  ^  0.  In  this  case  the  gap 

ASF  =  -  2Tln  [v/4  +  r\2  -  /yj/2]  (4.26) 

always  exists  in  £B(/c)  [20],  where  rj  =  exp[(  |/iB|  -  2T0)/2T].  The  temperature  depend- 
ence of  AB  at  low  temperatures  has  the  form  [49]  (Appendix  2) 


(4.27) 


where  Z  =  exp(4/7B)/[l  +  (exp(2/yB)/2)2],  AB(0)  ^  (f BA  +  |/2B|)/yi  +(exp(2/yB)/2)2. 
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Figure  6.    Schematic  dependence  of  the  AB  vs.  pB  for  2d-BG. 
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Figure  7.    The  dependence  of  nBO  on  yB  for  2d-BG. 
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ir«i,AB  t 


%  T 

Figure  8.    Schematic  dependence  of  the  jlK  and  AB  vs.  temperature  T  for  2d-BG: 
a)  at  7|5  >  7*  and  b)atvB<y*. 


As  it  is  clear  from  (4.27),  AB  decreases  with  increasing  of  temperature  and  at  T 
temperature-dependence  of  AB  and  /JB  have  forms  [49]  (Appendix  2) 


and 


2/T\2<, 

- 
TB/ 


(4.28) 


(4-29) 


by  means  of  the  self-consistent  solution  of  (4.26),  (4.28)  and  (4.29).  The  tempera 
dependence  of  AB  and  |/iB|  are  shown  in  figure  8. 


5.  Specific  heat  of  the  Bose-liquid 

The  specific  heat  of  the  Bose-liquid  is  determined  from  [73] 


, 
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dT       20T 


SF 


At  low  temperature  for  3d-Bose-liquid 


de 


"V    '~2T2J0  ^    sh2[EB(fi)/2rTBl0'- 

In  (5.2)  at  yu  >  y*,  |/ZB|  »  T,  AB  »  T  and  ASF  >  2T  the  sh[EB(e)/2T]  may  be  rep 
by  iexp[EB(e)/271  and  EB(fi)  by  ^2 1 /ZB | e  +  AS2F. 

Then  taking  into  account  that  an  essential  contribution  to  the  integral  comes 
a  small  e  and  using  in  the  exponent  expansion  x/2|/IB|fi  + AgF  «  ASF  +  | /JB  |  e/ASF,  v, 


,4 


exp    - 
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'SF 
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However,  at  yB  <y*,  Tj£T|_and  at  a  small  k  the  excitation  spectrum  of  bosc 
phonon-like  EB(fc)  ~  ^/\flB\/m^hk  and  cy(^)  ~  T3.  In  the  case  of  AB,  \jlB\ «  T* 
T*  according  to  (4.21)  ct.(T)~const/(rg  -  T)0>5.  Whereas  according  to  (4.19; 
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Figure  9.    Schematic  dependence  of  the  specific  heat  of  an  attracting  3d-B 
temperature  at  yB  «  y* . 
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(4.20)  near  TQ  the  law  cr(T)  ~  const/(rB  -  T)°-5  takes  place.  The  cc(T)  dependence  at 
yB«yjj  and  AB,|/jB|«Tg  has  the  form  shown  in  figure  9,  where  for  T>TB, 
c,,(T)  ~  T3/2  [36].  For  2d-Bose-liquid,  the  quantity  of  c,,(T)  at  low  temperature  in  the 
case  of  ASF  >  2T  can  be  estimated  analogous  to  that  evaluated  from  the  integral  (5.2). 
Then 


while  for  ASF  <  2T  one  obtains  (Appendix  II) 

4QD    T2 
cv(T)*         p       [l-y2+19exp(-2)]  (5.5) 

AB 

Thus,  at  small  ASF  the  dependence  cv(T)  is  very  close  to  phonon-like  one. 

» 
6.  Discussions  of  the  results  and  their  experimental  evidences 

In  this  section  we  discuss  the  experimental  data  available  in  4He,  3He  and  existing 
classes  of  superconductors  as  details  possible  in  the  context  of  two-stage  FBL  model  of 
superfluidity  and  superconductivity  which  can  be  realized  by  two  possible  FBL 
scenarios  in  the  so-called  fermion  superconductors  (FSC)  and  boson  superconductors 
(BSC)[49,52]. 

6.1  4  He  superfluidity 

It  follows  from  the  above  relation  (4.6)  that  the  excitation  spectrum  £B(/c)  of  a  3d-Bose- 
liquid  at  yB  <  yB  becomes  gapless  in  the  temperature  interval  Q^T^TQ  and  at  small  /c, 
is  phonon-like.  At  T>T$  in  EB(k)  the  gap  ASF  appears  being  responsible  for 
half-integral  values  of  the  circulation  (see  [15])  and  the  cv(T)  departure  from  the 
phonon-like  T3  dependence  observed  in  4He  (see  [16]).  Depletion  of  the  zero- 
momentum  state  (or  often  named  as  condensate)  of  a  3d-BG  at  T  =  0  and  0  <  T  ^  T$ 
as  a  function  of  }>B  (figure  1  )  and  temperature  (figure  4)  as  well  as  its  complete  absence  in 
the  interval  Tg  <  T  =$  TB  =  T;  (where  TA  is  the  temperature  of  /.-transition  in  4He)  is  in 
good  agreement  also  with  the  experimental  data  available  for  4He  [69,  72]. 

The  experimental  dependence  nQO(T)  has  a  feature  such  as  a  noticeable  increasing 
rcBO  below  ~  1  K  [72].  The  above  features  of  the  velocity  circulation  quantum,  cv(T) 
and  nBO(T)  indicate  on  appearance  of  the  gap  ASF  in  £B(/c)  at  T^  1,  1  K  in  4He  (i.e. 
T*  ~  1,  1  K).  Further  the  experimental  values  of  nBO  =  0,  10  -=-  0,  14  (at  T  =  0)  in  4He 
[72]  correspond  to  the  values  of  >'B  ^  1,  5  -=-  1,  6  for  TBEC/£fBA  =  0,  05,  i.e.  to  the  region  II 
at  figure  2.  It  means  that  the  interaction  is  strong  in  4He.  The  existence  of  the  pair 
condensate  in  the  k  =  0  state  considered  in  [14,  17]  is  unphysical  (that  was  noted  also  in 
[19])  and  complete  absence  of  such  condensate  (that  in  fact  must  be  single  particle 
condensate)  in  the  complete  range  0  ^  T  ^  T;  in  4He  is  not  consistent  with  the 
experimental  data.  At  yB  <  yg  and  yB  <  1  (i.e.  in  the  region  II  and  I  in  figure  2)  we  obtain 
the  relation  nBO(T)  ~  (TB  —  T)(2~^)l3  (that  have  been  derived  earlier  in  the  framework 
of  a  phenomenological  approach  [68,  72]  for  4He  with  /?  ^  0  and  T;  instead  of 
Tg  corresponding  to  the  case  BEC  of  an  ideal  BG)  with  0  ^  -  1  and  jg  ^  0,  5, 


r»  ____  .  __ 


respectively.  Therefore,  near  T%  <  T,  jump  of  c\,(T)  may  be  expected  at  weak  (yn«  1) 
and  probably  intermediate  (yB  ^  I )  coupling.  While  at  strong  coupling  (yn  >  1 )  in  c,,(  T) 
instead  of  such  jump  only  pecularity  may  be  displayed  as  it  was  observed,  for  example, 
in4He{see[16]). 

6.2  3He  superfluidity 

Many  experimental  results  speak  well  for  the  triplet  pairing  in  3He  with  angular 
momentum  /=  1  [74,75].  Such  pairing  of  3He  atoms  leading  to  the  formation  of 
composite  bosons  may  be  described  also  by  the  BCS-like  pairing  theory  [74,75]. 
Further  in  3He  just  as  in  4He  the  real  SF  state  most  probably  is  described  by  the  order 
parameter  AB  of  the  attracting  composite  bosons  (i.e.  paired  atoms  3He),  whereas  the 
BCS-like  order  parameter  AF  of  the  attracting  fermion  pairs  characterizes  only 
presuperfluid  (or  precursor)  state.  As  the  BCS-like  theory  cannot  explain  the  number  of 
features  of  SF  3He  and  HTSC  (including  also  LTSC,  HFSC  and  OSC)  observed  below 
the  critical  temperature  7C.  It  is  reasonable  to  assume  that  the  interboson  (i.e. 
intercooperon)  interaction  in  3He  probably  is  much  weaker  (yB  <  1  or  even  yB  «  yB  ^  2 
(for  cBA/TBEC  ^  10))  than  such  interaction  in  4He.  Therefore  the  condensate  fraction  in 
3He  must  be  larger  than  in  +He.  It  is  supposed  that  in  3He  just  as  in  LTSC  only  small 
fraction  of  fermions  may  take  part  in  pairing  and  pB/pF  «  1  (where  pp  is  the  density  of 
Fermi  particles  in  unit  volume).  Then  at  pF~  1023cm~3  and  pI3/pF~  10" 5  (just  as  in 
LTSC)  we  obtain  TBEC  c±  2, 52-10~3  K  that  is  very  close  to  the  observed  value  of  Tc  in 
3He,  where  in  all  probability  Tc  =  TF  =  TB  (see  below).  So,  the  gap  ASF  in  £B(/c)  most 
probably  vanishes  at  T  =  T|  ^  0, 7TC  and  there  some  features  of  3He  physical  proper- 
ties must  be  displayed.  Indeed,  such  features  are  observed  in  the  SF  phase  of  3He  as 
jump-like  increasing  of  the  critical  current  velocity  yc  three  times  under  pressure  and 
such  change  of  the  SF  density  just  at  T~  (0,6  -=-0,7)  Tc  [2,41-44].  This  jump-like 
increasing  of  the  vc  and  SF  density  is  caused  by  the  sharp  increasing  of  AB  (or 

uc =  •V/AB/MB)  at  ^B  when  yB  «  y|  (figure  3c).  Further,  it  may  be  assumed  that  under 
pressure  or  rotation  increasing  of  an  interaction  strength  between  composite  bosons 
occur  and  it  is  accompanied  by  appearance  of  the  gap  ASF  in  £B(/c)  at  a  more  low 
temperature.  Then  the  splitting  of  the  B-phase  into  two  phases  Bl  and  B2  as  well  as  the 
transition  between  them  observed  for  T~  0, 6TC  [41,43,44]  most  probably  is  caused 
by  these  circumstances.  Hence,  the  origin  of  the  A-  and  B-phases  as  well  as  transition 
between  them  in  3He  at  T~0,7TC  [2]  (cf.  with  the  above  value  of  T*^0,7TC)  is 
naturally  caused  by  the  phase  transition  SPCoPC  of  composite  bosons.  The  observed 
sharp  increasing  of  the  SF  density  in  3He  [2]  just  at  the  phase  transition  A  ->  B  speak  in 
favour  of  this  assumption.  While  the  observed  power  law  dependence  cu(T)  ~  T3  [2]  is 
caused  by  the  absence  of  the  gap  ASF  in  £B(/c)  at  T  ^  Tg  and  by  ASF  «  T  and  not  by  the 
presence  of  point  or  line  nodes  of  the  BCS-like  energy  gap  AF.  Step-like  jump  of  cv(T)  at 
Tc  in  3He  means  that  Tc  =  TF  =  TB  (as  in  this  case  near  Tc,  AF  ~  AB  «  ASF  takes  place). 
Finally  the  circulation  quantum  O0  =  /z/2m3  and  =  /i/4m3  (where  m3  is  the  mass  of  3He 
atoms)  observation  is  possible  in  B-  and  A-phase,  respectively,  i.e.  the  gap  ASF  vanishes 
and  appears  in  EB(fc)  at  T  ^  Tg  and  T>  T$  the  half-integral  values  of  the  circulation 
may  observe  when  the  gap  ASF  appears  in  £B(/c)  at  T  >  Tg .  Apparently  such  situation  is 
realized  at  the  transition  from  B-  to  A-phase  in  3He  (see  [41,42]). 
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6.3  Superconductivity  of  charged  bosons  in  superconductors 

To  establish  the  mechanism  of  superconductivity  in  LTSC,  HFSC,  OSC  and  HTSC 
first  of  all  it  should  elucidate  a  question  about  the  physical  essence  of  the  order 
parameter  AF  and  AB  as  well  as  the  energy  gap  ASF  in  different  cases.  As  the  relation 
between  them  undoubtedly  plays  an  important  role  in  determining  a  possible  super- 
conductivity scenarios  and  main  SC  properties  of  superconductors. 

6.3.1  Gap  features  of  superconductors:  We  consider  the  new  scenarios  according  to 
that  superconductivity  (superfluidity)  in  the  Fermi  systems  appears  under  the  coexist- 
ence of  two  order  parameter:  one  of  them  AF  characterizes  the  bond  strength  of  bound 
fermion  pairs  and  the  other  AB,  the  bond  strength  of  all  condensed  bosons  (i.e.  existence 
of  a  SF  state).  While  the  gap  ASF  characterizes,  as  stated  above,  the  binding  energy  of 
a  single  bound  boson  pairs  or  the  structure  of  the  SF  state.  In  other  words,  the  phase 
transition  to  the  SC  state  is  accompanied  as  a  rule  by  the  formation  of  bound  fermion 
pairs  (i.e.  composite  bosons)  with  their  subsequent  transition  to  the  SF  state  by  SPC 
and  PC  [70].  Here  it  should  be  distinguished  as  two  possible  cases:  a)  disappearance  of 
bound  fermion  pairs  (or  AF)  at  T=Tr  is  simultaneously  accompanied  by  vanishing 
their  SF  condensate  (or  AB)  and  b)  disappearance  of  a  SF  condensate  (or  AB)  of  such 
composite  bosons  at  T=  TB  does  not  lead  as  yet  to  the  break-up  of  bound  fermion 


'B   te          °k*>&  IF 

Figure  10.    Schematic  dependence  of  the  AF,  AB,//B  and  ASF  vs.  temperature,  for 
3d-FSC:  (a)  and  (b);  for  3d-BSC:  (b)  and  (c). 
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pairs  which  are  dissociated  at  more  higher  temperatures  (i.e.  at  T  =  TF  >  TK).  Sc 
can  exist  two  types  of  superconductors  in  which,  may  be  realized,  two  FBL  scena 
superconductivity  [49-53].  These  type  superconductors  are  called  fermion  sup 
ductors  (FSC)  (with  AF  <  AB  and  AF  >  AB,  Asc  =  AF,  Tc  =  TF  =  TB)  and  boson 
conductors  (BSC)  (with  AF  >  An,  Asr  =  AB,  TK  =  TB  <  7^),  respectively  (figure  10 
FBL  scenarios  of  superconductivity  realizing  in  these  FSC  and  BSC  may  be  des 
by  combined  Fermi-  and  SF  Bose-liquid  theories  which  are  given  above.  Then  01 
a  quasiparticle  spectrum  with  a  finite  coexisting  gaps  AF  and  AB  inside  which  wi 
collective  soundlike  modes  (at  yB  <  }>£)  caused  by  the  closing  of  the  gap  ASF.  He 
FSC  at  AF  <  AB  some  gap-like  features  associated  with  AB  and  ASF  at  AF  <  AF 
appear  outside  the  BCS-like  gap  ASC  =  AF  and  inside  it,  respectively.  Indee 
observed  in  LTSC  non-zero  DOS  inside  Asc  and  a  peak  precursor  its  outsic 
nature  of  those  remains  unknown  [3,24],  speak  well  for  our  predicts.  While,  ir 
both  the  above  gap-like  feature  AB  and  AS1.  will  appear  only  inside  the  gap  AF  w 


T 


Figure  11.    Schematic  dependence  of  the  ASF  and  AB  vs.  temperature  in 
3d-BSC  for  yB  =  y*  (a)  and  in  3d-BSC  for  y  <  7*  (b). 


a  sharp  maximum  DOS.  Further  at  }>B  <  y%  and  T  ^  T$  the  non-zero  DOS  must  exist 
also  near  the  bottom  of  the  gap  AF  (that  is  displayed  as  a  sharp  maximum  DOS  not  only 
in  the  SC  but  also  in  the  normal  stale).  Such  pictures  are  observed  in  HTSC  [23-28]. 
The  observed  large  gap  (or  often  called  as  pseudo-gap)  in  HTSC  unknown  nature  is 
most  probably  the  gap  AF  and  it  bears  no  relation  to  real  SC  order  parameter  Asc,  as 
this  gap  exists  also  in  the  normal  state  [23,  35-37].  In  BSC,  at  T  =  TF  >  TB  the  phase 
transition  in  electronic  subsystem  having  an  effect  on  the  lattice  state  must  also  occur. 
In  all  probability,  the  nature  of  the  phase  transitions  unknown  origin  observed  in 
YBa2Cu3O7_  v  at  ~  150  K  and  ~  230  K  [38,76]  are  the  same  and  are  caused  by  the 
destroying  of  two  type  bipolarons  at  T  >  Tc  .  Further  in  BSC,  identification  of  the  gap 
Asc  and  AF  must  lead  to  an  essential  deviation  of  the  temperature-dependence  of  such 
Asc  from  its  assumed  BSC-dependence  as  Tc  now  does  not  correspond  to  TF,  i.e.  at 
TF  ^  2TB  the  pseudo-gap  AF  almost  does  not  depend  on  the  temperature  up  to  Tc  =  TB. 
Such  gap  anomalies  are  observed  also  in  HTSC  [26,  35,  36].  Moreover,  in  BSC  curves 
of  the  temperature-dependence  of  AB  and  ASF  (figure  11)  with  the  raising  of  the 
temperature  it  must  shift  to  meeting  each  other  and  cross  at  T=  T*  <  Tc  =  TB.  These 
gap-like  features  were  observed  also  in  HTSC  [28,77].  Further,  the  gapless  supercon- 
ductivity in  LTSC,  HFSC,  OSC  and  HTSC  as  well  as  such  superfluidity  in  3He  is 
caused  by  the  vanishing  of  the  gap  ASF  at  T  ^  T|  for  yB  <  y%  and  not  by  the  point  or  line 
nodes  of  the  gap  AF  [52,  53].  Indeed,  the  experimental  results  show  that  often  assumed 
point  or  line  nodes  of  the  BCS-like  gap  AF  responsible  for  gapless  superconductivity  is 
absent  in  fact  [23,  25].  Therefore,  the  presence  of  the  DOS  near  the  bottom  of  the  Asc 
may  be  considered  as  one  of  the  direct  confirmation  gapless  superconductivity 
existence.  Present  theory  (see  also  [52,  53,  70])  predicts  that  the  gapless  superconduc- 
tivity can  exist  only  in  the  temperature  interval  0  <  T  <  T%  (which  can  be  observed 
experimentally). 

6.3.2  Ratio  2ASC/TC:  From  the  above-stated  it  follows  that  the  ratio  ysc  =  2ASC/TC  and 
Tc  in  case  of  FSC  may  be  determined  from  the  BCS-like  relation  (3.4)  and  (3.3), 
respectively.  However,  in  case  of  BSC  such  definition  of  Asc  is  not  applicable.  In  the 
present  case,  the  ratio  gsc  should  be  determined  as  gsc  =  2ASC(  =  AB)/TC(  =  TB)  and  not 
as  gsc  =  2ASC(  =  AF)/TC(  =  Tf)  that  leads  to  non-real  large  values  of  gsc  in  comparison 
with  its  BCS-values  (as  the  experimental  value  of  Tc  now  is  not  corresponding  to  TF). 
The  observed  values  of  gsc  ^  5  and  ^  8  in  HTSC  [23-27]  most  probably  is  caused  by 
these  circumstances.  So,  according  to  (4.9)  in  the  case  of  3d-BSC  at  yB  ^  }'B  ,  the  quantity 
of  gsc  is  equal  to  [71] 


/Tc  (6.1) 

and  in  the  case  of  2d-BSC  this  ratio  at  yB  <  yg  is  determined  as  [49,  50]  (Appendix  2) 

#sc=8exp(-2/yB)  (6.2) 

The  numerical  self-consistent  solution  of  (4.3)  after  going  to  the  integration  and  taking 
into  account  (4.4)  at  yB  =  yg  in  case  T  —  TB  for  3d-BSC  give  jTB/TBEC  ^  3  and  gsc  ^  1,  9. 
Such  solution  of  this  equation  at  0  <  yB  <  yB  gives  0  <  gsc  ^  1,  7  (figure  12).  These 
values  of  gsc  agrees  well  with  its  observed  values  in  HTSC  [24,25,27].  While  the 
observed  values  of  6fsc^3,  5-f-8  [24,25,27]  have  been  determined  according  to 
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Figure  12.    The  dependence  of  gsc  on  y,,  for  3d-BSC. 


irrelevant  BCS  scheme  as  0SC  =  2ASC(=  AF)/TC(  =  TF).  Since  the  temperature-dep< 
ences  of  the  gaps  <ysc  ^  5  and  ^  8  is  deviated  strongly  down  and  up  from  their  assu 
BCS  dependences,  respectively  [26, 78].  Moreover,  the  large  gap  asc  ^  8  exists  not  < 
below  Te  but  also  above  Tc  [24,35-37].  A  novel  two-stage  FBL  scenario 
superconductivity  [49, 52]  predicts  disappearance  of  the  BCS-like  gap  AF  in  BS< 
7=  rF>Tcoreven  T=TP»TC.  Indeed,  at  VFnDFtt  -0,5  and  | UCn \  +  hto0  ~  0,. 
from  (3.3)  it  follows  TfH  ~  530  K. 

6.4  Size  and  flux  Quantization  effect  for  fermion  and  boson  pairs 

The  above  observed  in  LTSC  anomalies  of  gap-like  features  indicate  [3]  that  < 
Cooper  pairs  are  real  composite  bosons  obeying  Bose-Einstein  statistics  and  kee; 
its  identity.  In  fact,  the  real  size  of  the  Cooper  pairs  af  ~  nh/^/4mK^f  (at  mB  ~  2mc 
AF^  10~4eV,  where  mc  is  the  mass  of  a  free  electron)  and  the  mean  distance  betv 
them  is  of  the  same  order  quantity,  i.e.  aF^  10~6cm.  So  they  are  not  strong  overk 
spite  of  the  prediction  of  BCS  theory  based  on  the  estimation  of  aF  ~  10~4cm  th 
too  overestimated.  As  in  BCS-theory,  the  coherence  length  is  usually  estim 
according  to  incorrect  assumption  z(k)  =  Ef  (i.e.  k  =  k¥)  that  contradicts  with  its  t 
supposition  s(/c)  <  htoD.  While  the  binding  energy  of  boson  pairs  in  their  interac 
collective  decreases  and  is  equal  to  zero  at  yB  ^  yg  (when  boson  pairs  begins  overl; 
ing  and  bosons  exchange  from  one  pair  to  another  becomes  dominant).  Therefore 
of  identity  or  strong  overlap  are  peculiar  to  the  attracting  boson  pairs,  and  nc 
fermion  pairs.  These  distinctive  features  of  fermion  and  boson  pairs  are  visi 
displayed  in  the  flux  quantization  effects  in  4He  and  3He,  LTSC,  respectively, 
example,  in  4He  are  observed  both  $  =  h/m4  and  h/2m4  (where  w4  is  the  mass  of 
atoms)  flux  quantum  [1, 15],  whereas  in  LTSC  and  3He  only  flux  quantum  h/2e 
h/2m3  (where  w3  is  the  mass  of  3He)  is  observed,  respectively,  and  not  h/e  and  h 
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Some  experimental  facts  in  3He  (see  [42] )  indicate  about  the  existence  of  quartets  with 
flux  quantum  <t>  =  /z/4m3  also.  This  corresponds  to  the  case  Asc  ^  0.  The  quartets  with 
flux  quantum  <D  =  h/4e  may  be  formed  also  in  superconductors  (e.g.  in  HTSC)  at 
T>  T*,  i.e.  in  the  temperature  interval  T*  <  T <  Tc. 

6.5  Specific  heat 

The  temperature-dependences  of  AB  and  |£B|  near  T|  at  1  <yB  ^  yg  and  0  <yB  <  1 
is  weak  and  strong,  respectively.  In  this  connection  the  cv(T)  curve  near  Tg  for  AB, 
lAfil «  ^B  (°r  Tu^yi?)  (tne  regi°n  I  on  figure  2)  just  as  near  TB  (where  the  condition 
AB,  |/ZBI  «  TB  anc*  P^  0, 5  may  be  well-satisfied  even  for  yB  ~  1)  may  undergo  A-like 
jump  and  at  1  <  yB  ^  yB  (the  region  II  on  figure  2)  probably  displayed  only  peculiarity. 
In  all  probability,  the  first  of  them  takes  place  in  HTSC  and  HFSC  [39,40],  and  the 
second  in  4He  (see  [16]).  As  in  HTSC  and  HFSC,  A-like  jump  of  c0(T)  are  observed 
not  only  at  Tc  [38,79]  but  also  at  T<  Tc  [39,40].  While  in  YBa2Cu3O7_;c  a  small 
step-like  jump  of  c\,(T)  at  7^2, 5TC  [76]  is  caused  by  destruction  of  bipolarons,  i.e. 
the  phase  transition  normal  Bose-liquid<=>  Fermi-liquid  (figure  9)  in  electronic  (and 
not  lattice)  subsystem  [49-53]  as  stated  above.  In  [76],  on  the  basis  of  the  experimental 
data,  analogous  assertion  have  also  been  made.  This  indicates  that  the  HTSC  are  BSC. 
The  HFSC  and  OSC  are  related  also  to  this  class  so  far  they  are  characterized  by 
ct.(T)~  T3  at  low  temperature  [8,80]  and  at  T=  Tc,  undergoes  A-like  jump  [5,79]. 
Such  gapless  superconductivity  is  most  probably  realized  also  in  LTSC  as  there 
c,(T)~  T3  is  observed  up  to  Tc  [3].  Indeed,  both  in  BSC  and  in  FSC  at  ASF<  Asc 
and  especially  at  ASF«ASC  (or  ASF  =  0)  the  main  contribution  to  cv(T)  comes  from 
the  excitation  of  the  composite  bosons  (e.g.  Cooper  pairs  in  LTSC)  as  a  whole  and 
not  from  excitation  of  their  Fermi  components.  The  number  of  such  bosons  in  LTSC  is 
much  smaller  (pB  ~  1017-^  1018cirT3)  than  their  number  in  HTSC(pB~  1021cm~3). 
Unlike  the  BSC  (where  pB  ~  const  up  to  TB  or  even  above  TB),  in  FSC  at  comparatively 
low  temperatures  pB  (or  TB)  ^  const  but  at  T-»  TF  fast  decreation  of  pB  (or  TB)  up  to 
BG  concentration  for  which  TB  -»•  TF  occurs.  For  inconstant  pB  (at  T  >  TF/2  and 
especially  at  T->  TF)  the  determination  of  the  quantitative  trend  of  /iB(T)  and  AB(T) 
changes  is  a  very  difficult  problem  as  it  requires  a  combined  self-consistent  solution 
of  the  equations  for  %(T),  AF(T),  p.B(T)  and  AB(T)  under  the  conservation  of  the 
total  particle  number  N  =  Nr  +  2WB.  In  this  case  we  have  to  limit  ourselves  to  predict 
the  qualitative  trend  of  /iB(T)  and  AB(T)  changes  which,  however,  may  be  found 
close  enough  to  the  real  situation.  So  the  value  of  TB  (including  T%  and  TBEC) 
determined  for  pB  ~  const  and  expected  above  TF  becomes  unphysical.  Probably 
in  FSC  with  a  weak  coupling  the  gap  ASF  in  ER(k)  may  be  absent  up  to  T=  Tc  =  TF 
or  appear  near  TF,  where  ASF>AF.  Then  near  Tc  the  cv(T)  undergoes  step-like 
(or  BCS-like)  jump  that  is  observed  in  LTSC.  Apparently  in  3He  interaction  between 
composite  bosons  is  several  times  stronger  than  in  LTSC  and  the  gap  opening  in  EB(k) 
occurs  at  T^O,  7TC.  Then  near  Tc  the  situation  ASF»  AF  takes  place  and  the  jump 
of  cc(T)  at  T=TC=TP  determined  by  BCS-theory  has  step-like  also.  So  LTSC 
and  3He  are  related  to  the  FSC.  Further  in  BSC,  for  Tg  that  is  close  to  TB,  the  cv(T) 
undergoes  two  jumps  near  Tc  =  TB  separated  by  a  small  temperature  interval.  Indeed, 
suchjumps  of  cv(T)  are  observed  in  HTSC  [81]  and  HFSC  [82].  Other  explanation  is 
given  below. 


vjn   uiiv 


Bose-liquid  models  [71]  (in  case  of  2d-holon  superconductors  this  questic 
considered  in  [20])  from 


where  xL(0)  =  (mEc2/4npKe2)l/2,  c,  is  the  light  velocity;  pB  =  pBs  +  pB)I;  pBs  and  pB 
density  of  the  so-called  'SF'  (nonexcited)  and  'normal'  (excited)  Bose-liqui 
a  3d-Bose-liquid  at  ASF(7X  T*  <  TJ  =  0,  pBn  ~  T4  and  from  (6.3)  it  follows  [7 


AL(0) 

that  has  been  only  empirically  obtained  earlier.  For,  a  2d-Bose-liquid  at  A, 
approximately  takes  place  at  pBll  ~  T3  and 


and  at  A    >  2T  one  obtains 


SF 


Many  experimental  data  both  in  LTSC  [3],  HFSC  [83]  and  in  HTSC  [7,  84; 
agreement  with  (6.4)  and  do  not,  with  exponential  dependence  of  the  BCS 
Further  in  HTSC,  OSC  are  observed  the  power  law  dependence  of  /1LH 
[7,  80,  85,  86]  and  that  it  follows  from  a  3d-Bose-liquid  model  also  (this  resul 
reported  separately).  In  recently  discovered  high  temperature  OSC  of  the  type 
the  dependence  /L(r)//lL(0)  is  observed  very  close  to  the  dependence  (6.5),  i.e.  the 
of  (T/TC)V  in  AL(T)/UL(0)  is  equal  to  v  =  3,2  (see  [87]).  Some  experiments  als 
v  =  1,  3  -4-  3,  2  in  accordance  with  our  results. 

6.7  Critical  magnetic  field  and  Meissner  effect 

As  is  generally  known,  the  critical  magnetic  field  Hc  for  each  type  of  SF  cat 
determined  from  the  difference  of  the  free  energies  of  normal  and  SC  states. 
difference  for  Bose-liquid  is  smaller  than  for  Fermi-liquid,  then  the  Hc  she 
determined  from  the  difference  of  the  free  energies  of  normal  and  SF  Bose 
Otherwise  Hc  is  determined  from  the  difference  of  the  free  energies  of  Fermi-li 
normal  and  SC  states  according  to  BCS  theory.  These  cases  is  most  probably  i 
in  BSC  and  FSC,  respectively.  Then  for  BSC  one  obtains  [70] 
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HFSC  (see  [29,30])  is  naturally  explained  by  this  relation  as  Hc  ~  W^,  i.e.  unusual 
upturn  of  the  low  temperature  part  of  Hc ,  ( T)  curve  is  caused  by  the  sharp  increasing  of 
the  AB(T)  at  T<  T$  when  yn  <  1  and  especially  when  y,, «  1  (figure  3). 

The  Meissner  effect  both  in  FSC  and  in  BSC  is  determined  by  superfluidity  of  the 
Bose-liquid  and  not  that  of  Fermi-liquid.  It  is  not  difficult  to  show  the  absence  of  the 
paramagnetic  part  of  current  for  SF  Bose-liquid  analogous  to  that  being  done  for  'such" 
Fermi-liquid  in  BCS  theory  (this  result  is  reported  separately). 

6.8  Existence  of  several  types  of  super  fluid  carriers 

Many  experimental  data  on  the  gap  features,  specific  heat,  London  penetration  depth, 
critical  magnetic  fields  and  fractional  Meissner  effect  is  indicative  of  the  existence 
(coexistence)  of  several  types  of  SF  carriers  in  HTSC,  HFSC  and  type  II  LTSC.  For 
example,  they  speak  well  for  the  existence  of  at  least  two  types  of  carriers  with  the 
different  fermion  and  boson  order  parameters  [4,7,38-40,76],  such  as  A,.-,,  A,,2,  and 
AB  j ,  AB2  respectively,  as  well  as  Fermi-liquid  [6, 9]  and  normal  Bose-liquid  [88, 89]  in 
HTSC  and  HFSC.  It  is  not  excluded  that  some  of  the  observed  maxima  of  c,,(T)  in 
HTSC  (including  HFSC)  below  Tc  are  caused  by  the  second  SC  transition  at 
T=  TC2  <  Tcl  =  Tc. 

Further,  in  BSC  at  TF2  <  Tcl  and  Tcl  <  TR  the  temperature  dependence  of  the  Asc 
determined  as  A^(T)  =  AK,(7)  and  A£J(T)  =  AF2(T)  should  deviate  down  and  up, 
respectively,  from  their  assumed  non-real  BCS-dependence(i.e.  when  Tc  =  Tcl  <  TF1  is 
assumed).  Indeed,  such  deviation  in  the  temperature-dependence  of  the  gap-like 
features  from  their  BCS-dependence  are  observed  in  HTSC  [26,35,36,78].  The 
observed  two  different  temperature  dependence  AL(T)  ~  T4  and  ~  T2  (in  HFSC  and 
HTSC  [7,80,83,85])  or  ;,L(T)~T3  and  ~  T2  "(in  OSC  [80,86,87]),  as  well  as 
jump-like  features  of  cv(T)  at  T>  Tc  [38]  and  fractional  Meissner  effect  or  type  III 
superconductivity  [90]  in  HTSC  may  be  explained  also  by  the  presence  of  several  types 
of  carriers.  At  different  pn,  mB  or  yB  the  crossing  of  the  different  order  parameters  AB(T) 
and  the  formation  of  a  new  mixed  state  in  the  temperature  interval  Tc2  <  T<  Tcl  as 
well  as  positive  curvature  near  Tc  is  possible.  Apparently,  such  situation  is  realized  in 
HFSC  and  HTSC,  where  two  jumps  are  observed  near  Tc  separated  by  the  temperature 
interval  60  -f-  70  mK  [81]  and  ~  3, 5  K  [82],  respectively.  Also,  a  new  mixed  state  in  the 
comparatively  wide  temperature  interval  may  be  formed  without  the  crossing  of  two 
AB(T). 

6.9  Other  confirmations 

The  following  are  the  experimental  facts:  (i)  the  small  value  of  the  entropy  (or  the  large 
single  particle  condensate  fraction  in  our  context)  observed  in  HTSC  at  T  <  Tc  (see 
[91])  in  comparison  with  its  values  expected  for  an  ideal  3d-BG;  (ii)  the  SF  density 
change  ns  ~  [1  -  (T/TC)4]  for  HTSC  [83]  and  HFSC  [92];  (iii)  the  unusual  upturn  of 
the  low  temperature  part  of  critical  current  JC(T)  and  sound  velocity  dependences 
[31,32]  (these  features  probably  is  caused  also  by  the  sharp  increasing  of  the  AB(T)  at 
T^  T%  for  yB  <  1).  These  observations  once  again  confirm  the  validity  of  the  above 
microscopic  theory  of  SPC  and  PC  of  an  attracting  BG  as  the  basis  for  superfluidity 


7.  Conclusion 

It  follows  from  the  above-stated  conceptions  and  arguments  that  the  SC  (SF)  phase 
transition  in  Fermi  and  Bose  system  is  two-  and  one-stage,  respectively,  where  the  SPC 
and  PC  of  an  attracting  BG  (as  distinct  from  the  often  discussed  SPC  of  a  repulsive  BG 
and  BCS-like  condensation  of  the  Cooper  pairs  reminiscent  of  the  BEC  of  an  ideal  BG) 
are  responsible  for  superfluidity  both  of  charged  and  uncharged  Bose-liquids  in 
superconductors,  3He  and  4He.  Further,  two  possible  FBL  scenarios,  such  as,  Fermi- 
liquid  ->SF  Bose-liquid  and  Fermi-liquid  ->•  normal  Bose-liquid -»SF  Bose-liquid  is 
realized  in  FSC  and  BSC,  respectively. 

In  all  probability,  the  SF  and  SC  properties  of  3He  and  LTSC,  being  related  to  the 
type  of  FSC  with  weak  coupling  (yB  <  1  or  even  probably  y „  «  1 )  have  determined  both 
the  Fermi-liquid  behaviour  (e.g.,  TC,ASC(T)  (or  tjsc)  and  Hc)  described  within  the 
BCS-like  theory  and  Bose-liquid  one  (e.g.,  c,.(T),AL(T),  and  gapless  superfluidity 
(superconductivity))  described  within  the  above  theory  of  SPC  and  PC  of  an  attracting 
BG  [49-53].  While  the  properties  of  4He,  HTSC,  HFSC  and  OSC  which,  most 
probably  are  related  to  the  BSC  with  weak  (}>B  <  1)  and  possibly  intermediate  (yB  ~  1) 
or  even  strong  (}>B  >  1  in  case  4He)  coupling  are  determined  almost  entirely  by 
Bose-liquid  behaviour  described  within  the  above  theory  of  SPC  and  PC  of  an 
attracting  BG.  In  these  superconductors  the  behaviour  of  the  BCS-like  Fermi-liquid 
will  be  displayed  as  the  pseudo  gap  AF  >  Asc  and  AF  7^  0  in  SC  and  normal  states, 
respectively  and  as  the  electronic  phase  transition  at  T  =  TF  >  Tc.  Many  experimental 
data  available  both  in  SF  4He,  3He,  different  classes  of  superconductors  simultaneously 
confirm  the  applicability  and  validity  of  the  above  proposed  microscopic  theory  of 
two-stage  FBL  scenarios  for  description  of  superfluidity  and  superconductivity 
phenomena  from  a  unified  position.  Thus,  SPC  and  PC  of  an  attracting  BG  serve  as 
a  key  to  superfluidity  and  superconductivity  of  both  the  composite  bosons  (e.g.  Cooper 
pairs  and  bipolarons)  and  noncomposite  ones  (e.g.  4He  atoms). 

Note  added  for  the  revised  version 

After  submission  of  the  first  version  of  the  present  paper  we  have  learnt  about  the 
work  [93-95].  In  accordance  with  our  results,  in  [93]  and  [94]  a  half  magnetic  flux 
quantum  <J>0  =  h/4e  and  a  T2  term  in  cv  has  been  observed,  respectively.  While  the 
results  of  the  work  [95]  obtained  on  the  basis  of  a  phenomenological  approach  and  its 
predictions  about  the  heat-flow  induced  anomalies  in  SF  4He  near  T;,  namely, 
a  sizeable  enhancement  of  the  cv(Q)  and  a  weak  depression  of  the  SF  density  ps(Q)  (where 
Q  is  a  stationary  heat  current)  has  any  qualitative  likeness  with  our  microscopic  ones. 
In  our  opinion,  all  these  results  once  again  demonstrates  that  the  above  proposed 
microscopic  theory  are  adequate  and  valid  for  description  of  superfluidity  and  super- 
conductivity. 
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Appendix  1 

Calculating  the  basic  parameters  of  an  attracting  BG  for  T  =  0. 
Using  the  BCS-like  approximation  (4.4)  in  (4.3)  we  have 


AB1  =   - 


where 

1          feA  J  fcR 

PBi=n  Z  wu(fc)>     pB2=-   X    nB(/c),     nB(/c)  =  [exp(EB(/c)/T-  I]'1, 


= 
B 


=   _          (l+2nB(fc>)) 
B     Qfc4A      2£B(/c') 

From  (A  1.1)  we  obtain 


As  almost  all  Bose  particles  have  energies  smaller  than  £BA,  then  going  in  (Al.l)  and 
(A  1.2)  from  summation  to  integration  on  e  we  have 


2yM-i/g)2-AB2 

For  CBA»  I^B^BI,  AB2,  T  one  gets  for  3d-BG  and  2d-BG 


^-^-IJ* 

(A1.4) 
de  (A  1.5) 

ds  (A  1.6) 


and    /R2£ln, 

Z  <=BA 

respectively.  Further  at  VBR/R  »  1  from  (Al.l)  it  follows 


that  at  pB2  «  pB1  together  with  the  (A  1.4)  determines  the  value  of  |/IB|  =  —  /^B  +  2pB 
(KBR  —  FBA)  and  AB  =  AB1  .  So  the  BCS-like  case  approximation  (4.4)  becomes  equival- 
ent to  the  following  simple  BCS-like  one 

V3(k  -  k')  =  -  KB,    if  e(/c),  B(k')  <  £BA 
0,     otherwise 
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In  the  case  7  =  0  and  |  /7B  |  =  AB  from  (A  1 .4)  and  (A  1 .8)  at  large  £BA  for  a  3d-BG  one  obtains 


^5=    lim 

#B          6,,-** 


(A1.10) 


From  (A  1.1  1)  it  follows  (4.7)  and  its  substitution  into  (A  1.10)  gives  (4.6).  Further  from 
(A1.10)  it  follows  (4.9).  Now  separating  the  term  with  k  =  0  in  the  summation  (4.3) 
according  to  procedure  [14]  we  have 


(A1.12) 

Now  going  in  (A  1  .1  2)  from  summation  to  integration  on  f,  taking  into  account  (A  1  .9)  we 
obtain 


B  -  PBO)  =  D* 


VV  +  2|juB|(:. 

From  (A1.13)  and  (A1.14)  one  obtains  (4.10)  and  (4.11),  respectively.  For  a  2d-BG  in 
(A  1.1 3)  and  (A  1.1 4)  the  multiplier  ^/e  under  the  integral  is  absent. 

Appendix  2 

Calculating  the  basic  parameters  of  an  attracting  Bose-gas  for  7^0. 
For  7^  0,  (A1.4)  and  (A1.5)  may  be  written  as 

r\  l*f  . 

de 


21      V*-  77- 

0  V(£ 


-  AB  [ex 


)2  -  A|/r)  - 


(All) 


+  2    J        N/£ 
0 


(A2.2) 
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For  T«  TB  and  £B  =  AB  the  second  integral  in  (A2. 1 }  and  (A2.2)  at  a  small  t:  is  nearly  equal 
to  (7rT)2/3,/2/<3/2  and  (nT)2/3^2p~n.  Further,  according  to  (A  1.1 2)  in  (All)  and  (A2.2), 
terms  2/;BO//ZBD,J  and  pKQ/p.KDK,  must  be  tolerated  respectively.  Then,  taking  into  account 
(A1.10)  and  (Al.l  1),  it  is  easy  to  obtain  (4.12)  and  (4.13).  In  case  of  ASF  ^  0  (or  pBO  =  0)  the 
first  integral  in  (A2.1),  and  (A2.2)  can  be  evaluated  approximately  using  the  expansion 


i    r ,     A2    i 


(A2.3) 


X/lK-l-l/'BlJ 

and  after  integration  also  using  the  expansion 

r- — TT-      5i 
arctg^/euA///,,^-- 

for  these  integrals  we  have 

\     \2       /  f,      ~\ 

and 


TtAfj 


(A2.4) 


respectively.  While  the  second  integral  in  (A2.1)  and  (A2.2)  is  calculated  by  substitution 

t  =  J(i:/\fitt\}2+2i:/\fttt\,a2}t2+a22  =  [(<;  +  |/IB|)2  -  A2]/T2  and  using  the  method  [96], 

where  al  =  |jCB|/T,  a2  =  v/|/IB|2  -  A,2}/T. 
Then  the  second  integral  in  (A2.2)  has  the  form 


(A2.5) 


It  is  reasonable  to  assume  that  near  TB  takes  place  a{  «  1  and  ct2  «  1  as  well  as  AB  « /1B. 
For  such  «i  and  «2  the  integral  (A2.5)  may  be  calculated  by  using  the  method  [96].  Here 
we  give  the  final  results  which  has  the  form 


2,612-727:     ^  +  -^f 


2    2,612- 


1  - 


|        (A2.6) 


for  (A2.2) 


TtT 


(A2.7) 


2V/1B 

for(A2.1). 
After  expansion,  the  expressions 


and 
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by  the  degrees  of  AB//ZB  and  assuming  1, 46^/2  ^  2  from  (A2.1),  (A2.2),  (A2.3),  (A. 
(A2.7)  we  obtain  (with  an  accuracy  to  ~/!B(T)) 

5 


8j5 


-2 


2pB_  n&l 

From  (A2.8)  for  T/^BA  >  £B/T  and  T/£BA  ^  1/27T  it  follows  (4.16).  While  in  (A2.9 
into  account  2/?B/AB  =  2, 612N/7rTB/E2c  and  making  some  transformation  we  he 


* 


+  2,612-2    -^    1-- 


from  which  it  follows  (4.15).  Now  the  quantity  of  the  /IB(  T)  and  AB(  T)  near  TB 
found  by  elimination  of  AB/8/IB  in  (4.15)  and  (4.16).  After  some  algebra  we  ha1 


2,612 
2N/7r/iB(T) 


+ 


/^B(^B)^B  =  I_A^ 


MT)    T 


8/25 


'MT)  TV 


-a   _  i    .     ^B 

"""       8/I2 


from  which  (at  T~  TB)  it  follows 
'/^(T1)         //ZB(TB) 


1,306     fTll2-T312 


T 

~2T  =  ° 


The  solution  of  this  equation  has  the  form 


Further  taking  into  account 


T3/2_T3/2         T3 


we  have 


2Tj>/2 


3,918    &_2|CTB-T) 
2,7^  V£B 


/A»(T)  =   T 
V^B(TB)      TB 

from  which  at  TB//IB  »  1  after  the  determination  of  £B(TB)  from  (4.16)  approxin 
follows  (4. 19).  Near  TB ,  (4. 16)  may  be  written  as  - 


414 
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j  ana  ^.iyj  u  IOHOWS  i<f.zuj.  IN  ow  we  determine  me  /.in(l  )(or  AB(  i  ))  ana 
nBO(T)  near  T=T*<  TB  assuming  /2B(T)/T*«  1.  Then  for  T<  T*  going  in  (A1.12) 
from  summation  to  integration  on  e.  and  for  T*  taking  into  account  (4.  1  5)  and  (4.  1  6)  the 
equations  determining  the  juB(T)  and  pBO(T)  (or  nBO(T))  may  be  written  as 


2  -  7t7\/2|/ZB(T)| 


(A2.13) 
(A2.14) 


from  which  after  their  combined  solution  and  some  algebra  it  follows  the  equation  for 
|/1B(T)|  analogous  as  (A2.1  1)  and  the  solution  this  equation  near  Tg  has  the  form  (4.21). 
Further  substituting  |/tB(T)|/|/ZB(Tg)|  from  (4.21)  in  (A2.13)  gives  (4.22).  In  the  case  of 
2d-BG  the  first  equation  in  (4.3)  after  going  to  the  integration  and  substitution 


I /1B  I)2  -  A2/2Thas  the  form  [49] 
TB  ~  J  -x,  J~y2  +  (A*)2 


where    xl=^Sf/2T>    x2  = 


+  \J1B\)2  - 


*  =  AB/27:    In    the    interval 


Xj  <  y  <  1  and  1  <  y  <  x2  assuming  cthy  ~  1/x  and  ~  1,  respectively,  from  (A2.15)  we 
get 


A* 


}-?+  (A*) 


l+yi+(A*) 


At  low  temperature  Ag  »  1,  y2»  A£  and  Ag  »}'1.  Then  \ny1  is  small  and  it  can  be 
neglected.  In  this  case  from  (A2.15)  approximately  it  follows 


from  which  after  some  algebra  we  obtain  (4.27).  While  near  TB,  Ag  «  1  and  from  (  A2.  1  6) 
we  have  (with  an  accuracy  to  ~  (Ag)2) 


lny 


Further,  assuming  y*n  ~  1  and  A|/yB  «  1  from  (A2.18)  it  follows 

2A* 


(A2.18) 


1+A* 
from  which  it  follows 
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It  is  reasonable  to  assume  that  the  ASF(T)  near  TB  is  changed  as  ~  y.(2T}q  (whe 
determined  at  T  =  TB  from  the  condition  AB(TB)  =  0)  with  the  variation  param 
Then  the  AB(T)  and  \fiH(T)\  is  determined  by  (4.28)  and  (4.29). 

Now  we  estimate  the  integrals  (5.2)  for  ASF<2r.  After  substitution  x  =  £, 
assuming  shf  approximately  to  be  equal  to  x  and  l/2exp(x)  accordingly  at  x  < 
x  >  1  one  may  obtain  from  (5.2) 


v/X2+(A*)2 

We  have  done  approximate  estimations  of  the  second  integral  ass 
^/x2  +  (A|)2  ~  Ag  (at  low  temperature)  as  the  main  contribution  comes  from  a 
near  lower  limit  of  the  integral,  where  .x  «  Ag  .  As  a  result  one  obtains  (5.5).  Fin 
T  =  0  and  ASF  =  0  the  integral  (A2.15)  is  equal  to 


}'B  AB(0) 

While  at  T=  TB  the  approximate  estimation  of  the  integral  (A2.15)  according 
above  scheme  gives 

dy        2T 
+        -     = 


2TB 

where  fc,  =  /ZB/2T,  b2  =  (CBA  +  /Z)/2T.  According  to  (4.29)  at  T=  TB,  /2B(TB)  = : 
(2  +  yB).  Then,  from  (A2.21)  and  (A2.22)  it  follows  (6.2). 
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istract.  ESR  spectra  of  a  laboratory  synthesized  kerosene  base  magnetic  fluid  containing 
rafine  magnetic  particles  (average  diameter  of  100  A)  of  Zn0.jFe0.9Fe2O4  are  recorded  at 
ferent  temperatures.  A  narrow  signal  was  observed  above  the  melting  point  of  the  carrier 
uid  (200  K)  which  can  be  attributed  to  a  very  small  volume  fraction  of  superparamagnetic 
rticles  in  the  system.  The  peak-to-peak  line  width  for  both  low  and  high  field  cooled 
rrfigurations  show  an  increase  with  decreasing  temperature.  This  observed  behaviour  has  been 
plained  by  considering  various  energy  terms  which  contribute  to  the  line  width. 

ywords.    Magnetic  fluid;  ESR;  temperature  dependence. 
LCS  Nos    76-50;  75-50;  75-60;  75-30 

Introduction 

erically  stabilized  colloidal  dispersions  of  ultra-fine  single  domain  ferro-  or  ferri- 
agnetic  particles  resemble  superparamagnetic  materials  at  room  temperature.  These 
lloids  called  magnetic  fluids  or  ferrofluids  do  not  exhibit  hysteresis  behaviour  or 
>ercivity  [1].  Under  the  action  of  externally  applied  gradient  magnetic  fields  such 
Huid  can  develop  structural  order  similar  to  those  in  crystalline  solids  [2]  or  dipolar 
asses  [3]  and  hence  it  can  serve  as  a  model  system  to  study  characterization  of 
•ndensed  materials.  Magnetic  and  Mossbauer  measurements  have  been. extensively 
ed  to  study  the  superparamagnetic  behavior  of  particles  dispersed  in  liquid  carriers 
-8].  It  has  been  demonstrated  that  electron  spin  resonance  (ESR)  is  a  complimentary 
ol  for  the  investigation  of  ultra-fine  colloidally  dispersed  magnetic  particles  [8-11]. 
sually,  ESR  spectra  of  magnetic  fluids  are  recorded  at  room  temperature  and 
:ak-to-peak  linewidth  (AHpp)  and  the  resonance  field  (Hr)  are  also  reported  for 
iperparamagnetic  particles  dispersed  in  solidified  kerosene  [12].  A  magnetic  fluid 
:hibits  both  superparamagnetism  at  one  temperature  (above  the  melting  point  of  the 
irrier  liquid,  Tm)  and  may  exhibit  magnetic  anisotropy  at  lower  temperatures 
spending  upon  the  particle  size  [1].  Hence  it  will  be  of  interest  to  investigate  the 
mperature  dependence  of  AHpp  from  room  temperature  to  well  below  the  melting 
)int  of  the  carrier. 

In  this  paper  we  report  the  results  of  our  investigation  of  ESR  spectra  of  a  laboratory 
rnthesized  magnetic  fluid,  Zn0.1Fe0.9Fe2O4  for  different  temperatures. 
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2.  Experimental 

There  are  a  number  of  preparation  methods  available  for  the  synthesis  of  fine 
particles  [13-17].  In  the  present  case  we  have  used  the  co-precipitation  techni 
prepare  ultra-fine  particles  of  Z^^Pe^Fe-jC^.  Analytical  reagent 
FeCl36H,O,  ZnCl,  and  FeSO47H,O  were  used  to  obtain  Fe3+,  Zn2+  and  Fe2 
in  the  aqueous  solution.  An  aqueous  solution  containing  these  ions  in  the  appn 
molar  proportions  (Fe3  + :  Fe2  +  :Zn2  +  =  2:0-9:0-1)  was  added  to  8M  NH4OH  sc 
at  room  temperature  under  constant  stirring.  The  pH  was  adjusted  to  10-5.  A 
hour,  aliquot  of  the  reaction  mixture  was  collected  for  X-ray  and  EDAX  studies 
acid  was  added  to  the  mixture  and  the  fluid  was  again  stirred  for  one  hour, 
which  time  the  pH  was  kept  at  10-5.  The  fluid  was  then  heated  to  90°C  for  5m 
cooled  to  room  temperature  (2°C/min).  The  sample  was  washed  a  number  of  tim 
distilled  water  and  finally  washed  with  acetone.  This  acetone  wet  slurry  was  dis 
in  kerosene  and  heated  to  65°C,  in  order  to  remove  acetone.  The  sample  wa 
below  200  K  and  liquid  above  this  temperature. 

The  percentage  substitution  of  Zn  was  verified  by  EDAX  and  found 
9-8  +  0-3%.  To  characterize  the  formation  of  single  phase  spinel  structure, 
diffraction  pattern  was  recorded  using  Philips  X-ray  diffractometer  with  CuK 
ation.  The  structure  was  found  to  be  single  phase  fee  spinel  structure  with  a 
parameter  of  8-36  A  (figure  1).  The  average  particle  size  was  calculated  using  Set 
formula,  using  the  full  width  at  half  maximum  intensity  of  the  plane  (3  1 1) 
pattern.  The  value  thus  obtained  (after  accounting  for  the  instrumental  line  br 
ing)  was  100  A.  The  specimen  was  also  examined  under  AEI  Cornith  Transr 
Electron  Microscope  (TEM).  The  particles  were  found  to  be  almost  spheric; 
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Figure  1.     X-ray  diffractogram  of  Zn-substituted  ferrite  sample  (ZFO). 
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ESR  spectra  of  a  magnetic  fluid 


Figure  2a.     Electron  micrograph  of  ZFO  sample. 
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Figure  2b.     Particle  size  distribution  for  ZFO  sample.  9  Expt.  points.  — Log- 
normal  distribution. 


average  median  diameter  (Dm)  of  90  A  and  standard  deviation  (<r)  0-36.  Figure  2b 
shows  the  particle  size  distribution  pattern  using  these  values. 

Room  temperature  magnetization  measurements  were  carried  out  using  search  coil 
method  [18].  Figure  3  shows  the  reduced  magnetization  (M/MJ  versus  applied  field 
(H)  for  the  Zn0.1Fe0.9Fe2O4  magnetic  fluid  (ZFO)  at  room  temperature.  To  describe 
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Figure  3.    Room  temperature  M/MS  vs  H.  Line  through  the  data  point  is  a  th< 
cal  fit  using  Dm  =  98  A  and  a  =  0-38  [19]. 

the  magnetization  of  the  magnetic  fluids  several  theoretical  models  are  being  usec 
but  the  simplest  one  is  Langevin's  model.  Using  this  model  and  our  earlier  m 
[20,21]  M/MS  was  calculated  for  each  value  of  H.  Fit  to  the  data  points  in  fi{ 
yielded  the  values  of  median  diameter,  standard  deviation,  and  Ms  (saturation  mi 
ization  of  the  fluid)  as  98  A,  0-38  and  1 50  G,  respectively. 

ESR  spectra  were  recorded  using  an  X-band  Bruker  ESP-200  spectrometer. . 
temperature  cryogenic  system  attached  with  Bruker  ER  41 1 1  temperature  conl 
model  was  used  to  vary  the  temperature  from  100  K  to  room  temperature  (300  K 
original  fluid  of  150G  was  diluted  10  times  in  kerosene  and  two  sets  of  data 
recorded.  In  one  set  the  sample  was  cooled  to  77  K  in  the  residual  field  of  100  Oe  I 
while,  in  the  second  set  the  fluid  was  cooled  in  1-1  kOe  from  room  temperature  to 
In  both  cases,  the  spectra  were  recorded  during  the  heating  cycle.  Finally,  ESR  s] 
of  coated  particles  were  recorded  at  300  K  and  77  K. 


3.  Results  and  discussion 

3. 1  Lineshape  analysis 

The  conventional  approach  in  analyzing  the  width  and  lineshape  of  a  reso 
absorption  curve  is  to  use  either  the  Lorenzian  or  Gaussian  lineshape.  The 
difference  between  these  two  is,  Lorenzian  lineshape  is  slightly  sharper  in  the  cent 
decreases  much  more  slowly  in  the  wings  beyond  the  half  amplitude  or  first  deri 
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Figure  4.     ESR  curve  of  125  K. Lorentzian  fit. Gaussian  fit.  Q  expt.  points. 


points.  This  can  be  confirmed  by  using  the  value  of  peak  amplitude,  Ap,  AHpp  and  HT 
deduced  from  experimental  resonance  curve.  Figure  4  shows  a  resonance  curve  at 
125K  with  two  different  lineshapes.  It  is  evident  from  the  figure  that  the  Gaussian 
lineshape  given  by  [14], 


dP/dH  =  ApXexp[-  1/2 (X2  -  1)] 


(1) 


agrees  well  with  the  experimental  points,  where  X  =  (H  —  Hr)/(l/2A//pp)  and  H  =  ap- 
plied field.  Throughout  this  work  we  have  deduced  values  of  H,.,  A//pp  and  Ap  from  the 
best  fit  to  the  data. 

3.2  Resonance  spectra  T>  Tm 

Resonance  spectra  of  the  coated  powder  show  an  asymmetric  line  broadening  (figure  5) 
both  at  77  K  and  300  K.  The  observed  line  broadening  in  the  sample  may  arise  due  to 
random  orientation  of  the  easy  axes,  the  interparticle  dipolar  interactions  and  to 
unresolved  magnetostatics  modes.  But  the  spectrum  becomes  less  asymmetric  when  the 
same  particles  are  dispersed  in  liquid  carrier  like  kerosene.  This  indicates  that  the 
deviation  of  easy  axes  from  the  applied  field  direction  are  reduced  (this  is  because  of 
Brownian  motion  of  the  particles).  It  was  also  observed  that  the  line  width  decreases 
with  dilution  (varies  from  857  Oe  for  the  most  concentrated  fluid  to  650  Oe  for  ten  times 
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Figure  5.    Resonance  spectra  of  coated  powder  at  300  K  and  77  K. 
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Figure  6.    Typical  ESR  spectra  for  low  field  cooled  sample  ( T  >  Tm). theoreti- 
cal points  using  eq.  (1).  Q  is  expt.  points  (a)  214  K,  (b)  235  K,  (c)  245  K. 
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Figure  7.     Typical  ESR  spectra  for  high  field  cooled  sample  (T >  Tm)  (a)  235 K, 
(b)265K,(c)297K. 


diluted  fluid).  This  decrease  in  line  width  may  be  due  to  decrease  in  interparticle  dipolar 
field.  Similar  behaviour  was  also  observed  by  Sharma  and  Waldner  [9].  Figures  6  and  7 
show  some  typical  ESR  spectra  for  LFC  and  HFC  samples,  respectively,  for  T  >  Tm.  In 
both  the  cases,  an  additional  narrow  line  was  observed  in  the  vicinity  of  the  minimum. 
A  notable  feature  in  the  above  spectra  is  a  systematic  shift  of  the  kink  from  left  to  right 
of  the  minima.  This  kink  disappeared  below  the  melting  point  of  the  carrier  (figures  8 
and  9). 

Sharma  and  Waldner  [9]  observed  a  narrow  line  for  Fe3O4  ferrofiuid  in  a  non- 
aqueous  medium.  They  found  [9]  that  the  width  of  this  narrow  line  is  dilution-sensitive. 
The  reason  for  this  observed  dilution  sensitive  width  is  due  to  the  reduction  in 
interparticle  dipolar  interaction  upon  dilution.  From  the  present  spectra  (figures  6  and  7) 
it  is  very  difficult  to  say  whether  the  width  of  the  narrow  line  is  temperature  sensitive  or 
not.  There  are  two  possible  sources  which  can  give  a  narrow  line  in  ESR  spectra;  (i) 
presence  of  free  radicals  either  in  the  coating  liquid  or  paramagnetic  impurities  in 
carrier  liquid  and  (ii)  superparamagnetic  resonance  because  of  the  presence  of  very  fine 


Pramana  -  J.  Phys.,  Vol.  45,  No.  5,  November  1995 


425 


FIELD  (kOe) 
Figure  8.     ESR  spectra  of  LFC  sample  ( T  <  TJ  T  =  121  K. 


FIELD  (kOe) 
Figure  9.    ESR  spectra  of  HFC  sample  ( T  <  TJ  T  =  155  K. 


particles  in  the  system.  The  ESR  spectra  of  surfactant  and  carrier  show  the  absence  of 
•a  narrow  line  which  supports  that  this  line  is  not  due  to  free  radicals  or  any 
paramagnetic  impurities.  Therefore,  the  only  possible  reason  for  this  narrow  line  will  be 
due  to  very  small  fraction  of  very  fine  particles,  which  gives  superparamagnetic 
resonance  effect  at  a  given  temperature  and  field. 

In  a  magnetic  fluid  two  relaxation  mechanisms  can  occur  (i)  Brownian  rotational 
diffusion  characterized  by  relaxation  time  [22], 


(2) 
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and  (ii)  by  Neel  relaxation  with  relaxation  time  [23-24], 

tN  =  t0(KV/kT)l/2exp(KV/kT)    for     KV/kT>2 


=  t0(KV/kT) 


for     KV/kT«l. 


(3) 
(4) 


where,  Kh  and  Kare  the  effective  hydrodynamic  and  magnetic  volumes  of  the  particles, 
respectively,  q  is  the  viscosity  of  the  fluid,  K  is  the  anisotropy  constant,  k  is  the 
Boltzmann  constant  and  the  pre-exponent  factor  r0  has  a  value  of  approximately 
10~9s.  Therefore,  the  dominant  mode  of  magnetization  process  of  the  particle  will  be 
that  which  has  the  shortest  relaxation  time.  In  fact,  large  particles  relax  via  Brownian 
rotation  while,  small  particles  through  Neel  rotation  [1].  As  ferrofluids  contain 
a  distribution  of  particle  sizes  both  mechanism  will,  in  general,  contribute  to  the 
magnetization  for  T  >  Tm  with  an  effective  relaxation  time  [25] 

'eff  =  W(fN  +  fB)  ^  (5) 

and  superparamagnetic  (SP)  behavior  will  be  exhibited  by  those  for  which  fN  is  much 
less  than  the  characteristic  Larmor  precession  time.  Aharoni  [26]  has  considered  the 
effect  of  external  magnetic  field  on  SP  relaxation  time,  because  the  field  polarizes  the 
large  particles  and  reduces  the  size  for  SP  behavior.  However,  the  extremely  small 
particles  cannot  be  quenched  unless  a  very  large  magnetic  field  is  applied.  Therefore, 
one  expects  a  narrow  magnetic  resonance  signal  if  a  sample  contains  extremely  small 
particles  which  are  SP  state  in  an  applied  field,  H.  Of  course  the  width  of  the  line  may 
further  reduce  depending  upon  the  viscosity  of  the  fluid  (equations  3  and  4). 
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Figure  10.    Variation  of  peak-to-peak  line  width  for  LFC  (D)  and  HFC  (A) 
samples,  Line  is  fit  to  the  eq.  (14).  The  values  used  are  i)  L(LFC)=  l-45kOe, 


L(HFC)=l-OkOe. 

v)    £d=l-4xl(r18e 
2-62xl(T14erg[27]. 


'  =  5  x  10~19erg.  iii)  F  =  0-5K.  iv)  KV=3-9  x  10   14erg. 
g.    vi)    JmH    (LFC)  =  5-2  x  l(T17erg.    JmH    (HFC)  = 
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in  me  case  01  very  small  particles,  anisotropy  energy  (A  K,  wnere  A.  is  an  anisoiropy 
constant  and  V  is  the  volume  of  the  particle)  is  much  smaller  than  the  thermal  energy 
(kT).  Therefore  the  response  time  of  the  magnetic  moments  to  random  thermal  force 
will  be  [23] 

t^M.V/gkT  (6) 

where,  Ms  is  the  saturation  magnetization,  y  =  gyromagnetic  ratio.  If  this  fluctuation  is 
faster  than  Larmor  precession  time  tL,  effects  of  magnetocrystalline  anisotropy  are 
motionally  averaged  out  roughly  by  a  narrowing  factor,  /  =  fsp/f  L  which  is  equal  to  [9] 

f  =  (M,VHt)/(2*kT).  (7) 

From  (7),  it  is  clear  that  line  width  of  SP  resonance  would  be  proportional  to  the 
volume  of  the  particles.  This  is  confirmed  by  Aharoni  and  Litt  [27].  Adopting  the 
procedure  of  Sharma  and  Waldner  [9]  in  the  present  case,  the  particle  size  correspond- 
ing to  the  observed  line  width  will  be  41  A  (neglecting  the  interparticle  dipolar  fields)  for 
coated  particles  which  is  around  1-2%  volume  fraction  of  the  particles  in  the  system 
(this  is  obtained  from  particle  size  distribution  curve  for  the  present  sample,  figure  2b). 
This  agrees  with  the  observed  1  %  magnitude  of  SP-phase  in  comparison  with  the 
broad  signal.  All  the  above  observations  are  consistent  with  the  presence  of  small 
fraction  of  very  fine  magnetic  particles  in  the  system. 

3.3  Line  width  analysis 

Experimental  points  of  peak-to-peak  line  width  (AHpp)  for  LFC  and  HFC  configur- 
ations are  shown  in  figure  10.  The  line  width  increases  with  decrease  in  temperature. 
The  observed  variation  can  be  explained  as  follows. 

Let  us  first  consider  a  model  of  regularly  spaced  magnetic  centres  dispersed  in 
a  non-magnetic  matrix  and  they  are  SP  particles,  i.e.  net  spin  A/iS  »  0,  where  An  is  the 
net  population  of  spin  aligned  along  the  easy  axis  of  magnetization  and  S  is  the  effective 
spin  of  the  magnetic  centres.  Assuming  the  presence  of  two  level  system,  one  can  write 
the  ESR  line  width  as  [19] 

AHpp  =  L  tanh  [A£/2fcT]  (8) 

and 

R3,  (9) 


where  g  —  ^-factor,  ft  =  Bohr  magneton,  n  =  total  number  of  magnetic  centres,  R  = 
distance  between  the  adjacent  particles  and  E  is  the  energy  barrier.  Therefore  the  line 
width  depends  upon  R  and  A£,  which  in  turn  depends  on  the  particular  system  under 
study.  In  the  present  investigation,  we  are  studying  the  effect  of  temperature  on  the 
system.  Therefore,  R  remains  unchanged  and  the  main  contribution  to  line  width 
will  be  A£  term.  This  makes  us  to  consider  the  effect  of  various  terms  which  contributes 
to  A£. 

As  the  system  is  in  liquid  state  above  the  melting  point  of  the  carrier  (  Tm  «  200  K), 
one  can  write  AE  as 

A£  =  £v  +  £a  +  £m+£d.  (10) 
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The  first  term  in  (10)  is  the  energy  of  Brownian-orientational  motion  i.e.  viscosity 
energy  (Ev),  which  can  be  written  as 

£v  =  3fl;?[/h,  (11) 

where  0  is  a  coupling  constant.  As  T  approaches  Tm,  the  magnetic  particles  can  relax 
via  Brownian  rotation  in  addition  to  Neel  which  makes  viscosity  of  the  liquid  an 
important  parameter.  The  viscosity  study  of  this  fluid  (ZFO)  indicates  that  fluid  is 
Newtonian  and  the  change  in  viscosity  with  temperature  can  be  described  by  the 
Andrade's  equation 

^  =  5exP[F/(/yT-TJ}].  (12). 

Here,  B  is  a  constant.  F  and  /?y  are  apparent  activation  energy  for  the  flow  and  universal 
gas  constant,  respectively.  Substituting  (12)  into  (11),  the  viscosity  energy  is 

Ev  =  fl'exp[F/(T-  TJ],  (13) 

where  B'  =  30VhB  and  F  =  F/RS. 

The  second  term,  Ea,  is  simply  KV  while  Em  is  JmH,  where  J  is  a  coupling  constant 
and  m  is  the  magnetic  moment.  Substituting  all  these  values  in  (10),  we  can  write  (8)  as, 

AHpp  =  Ltanh[(l/2/<T){Fexp(F/(7-  TJ)  + KV  +  JmH  +  Ed}].     (14) 

The  line  through  the  data  points  in  figure  10  is  the  best  fit  to  (12).  It  was  found  that  K  V 
remains  constant  for  LFC  and  HFC  samples,  but  JmH  changes  dramatically  (nearly 
500  times).  This  is  due  to  the  fact  that  as  the  system  is  frozen  in  the  field  of  1-1  kOe,  the 
large  number  of  particles  is  oriented  in  the  direction  of  applied  field  and  hence  large 
value  of  JmH.  The  decrease  in  the  value  of  AHpp  also  supports  this  concept,  because  foi 
more  ordered  state  the  thermal  energy  required  to  create  a  disorder  is  large.  Therefore 
the  line  width  will  remain  constant  for  larger  temperature  region.  This  is  true  foi 
multidomain  structure,  where  the  AHpp  remains  nearly  constant  till  the  Curie  tempera- 
ture is  reached  [24].  The  observed  discontinuity  in  the  curve  indicates  the  effect  ol 
viscosity  on  the  line  width.  A  remarkable  feature  in  figure  10  is  that  above  the  melting 
point  of  the  carrier,  the  variation  of  line  width  is  the  same  for  LFC  and  HFC  samples 
which  shows  the  validity  of  eq.  (5). 

4.  Conclusion 

The  present  study  reveals  that  (i)  A  narrow  line  observed  in  room  temperature  anc 
above  melting  point  of  the  carrier  is  attributed  to  a  very  small  volume  fraction  o 
particles  present  in  the  system,  (ii)  The  percentage  of  very  small  particle  volume  fractior 
agrees  well  with  particle  size  obtained  from  electron  micrograph,  (iii)  Line  widtl 
increases  with  decreasing  temperature  for  both  low  and  high  field  cooled  samples. 
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Abstract.  The  dielectric  properties  of  nano-particles  of  ZnS  have  been  studied  over  a  tempera- 
ture range  from  300  to  525  K.  The  dielectric  constant,  dielectric  loss  and  ac  conductivity  of  the 
samples  are  larger  than  those  of  bulk  ZnS  crystals.  Dielectric  properties  of  composites  consisting 
of  nano-particles  of  Ag  of  different  concentrations  dispersed  in  nano-particles  of  ZnS  have  also 
been  studied. 

Keywords.    Nano-particles;  quantum  size  effect;  semiconductors. 
PACS  Nos    82-70;  73-20;  72-20 

1.  Introduction 

The  synthesis  and  characterization  of  small  particles  of  materials  is  currently  an  area  of 
intense  theoretical  and  experimental  research.  The  revelation  that  physical  properties 
of  such  systems  are  bound  to  be  appreciably  different  from  those  of  the  bulk  crystals 
excited  much  interest  in  the  study  of  micro-materials  termed  nano-particles',  small 
particles,  clusters,  microclusters  or  microcrystals  [  1  -3].  Quantum  size  effect  of  confine- 
ment of  electron  in  small  particles  has  been  theoretically  predicted.  It  is  expected  that 
quantum  size  effect  on  electronic  properties  will  be  pronounced  in  mesoscopically 
small  crystals  and  that  electrical  conductivity  of  small  particles  may  be  affected  by 
a  three  dimensional  quantum  size  effect  [4-6].  Experimental  studies  on  the  electrical 
properties  of  nano-particles  are  limited  [7].  The  present  paper  reports  the  dielectric 
properties  of  nano-particles  of  ZnS  and  those  of  composites  of  nano-particles  of  Ag  of 
different  concentration  dispersed  in  the  former,  over  a  temperature  range  of  300  to  525  K. 

2.  Experimental 

Nano-particles  of  ZnS  were  prepared  by  chemical  method  taking  into  advantage  its 
very  small  solubility  in  water.  All  the  chemicals  used  were  of  analytical  grade  and  were 
used  without  further  purification.  Freshly  prepared  aqueous  solutions  of  chemicals 
were  used  for  synthesis.  Nano-particles  of  ZnS  were  prepared  at  room  temperature  by 
dropping  simultaneously  10ml  of  1  M  solution  of  zinc  sulphate  and  10ml  of  1  M 
solution  of  sodium  sulphide  into  80ml  of  distilled  water  containing  10ml  0-1M 
solution  of  ethylene  diamine  tetraacetic  acid  (EDTA)  which  was  kept  stirred  vigorously 
using  a  magnetic  stirrer.  The  high  insolubility  of  ZnS  formed  out  of  the  chemical 
reaction  caused  the  formation  of  a  number  of  new  nuclei  while  preventing  the  growth  of 
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Table  1.    Composition  of  pellets  mixed  with  nano- 
particles  of  Ag. 

Sample  Wt%  of  Ag  Sample  code 


ZnS 

0 

ZnS-0 

5 

ZnS-1 

15 

ZnS-2 

20 

ZnS-3 

already  existing  ones,  thus  limiting  the  particle  size.  The  role  of  EDTA  was  to  stabilize 
the  particles  against  aggregation  which  may  lead  to  an  increase  in  the  particle  size. 
Nano-particles  of  silver  were  prepared  according  to  Lee  and  Meisel's  [8]  method. 

Composite  samples  were  prepared  by  dispersing  in  distilled  water  by  ultrasonication 
different  weight  percentages  (wt%)  of  nano-particles  of  Ag  and  ZnS.  The  suspensions 
were  concentrated  by  evaporating  off  the  liquid  phase.  The  particles  were  washed 
repeatedly  using  distilled  water  with  the  help  of  an  ultrasonic  disintegrator  to  remove 
any  adsorbed  ions.  The  particles  were  finally  dried  at  about  100°C.  Pellets  of  nano- 
particles  of  ZnS,  and  composite  pellets  with  nano-particles  of  Ag  dispersed  in  ZnS,  of 
diameter  13mm  and  thickness  1-2  mm  were  made  by  applying  a  pressure  of 
4  tonnes/cm2  in  a  hand  operated  hydraulic  press.  Both  the  faces  of  pellets  were  then 
coated  with  silver  electrodes  for  electrical  measurements.  For  the  purpose  of  reference, 
sample  codes  were  assigned  to  the  pellets  as  in  table  1. 

The  crystal  structure  of  particles  was  determined  by  X-ray  powder  diffraction  using 
a  Shimadzu  XD  610  powder  diffractometer.  The  average  size  of  the  particles  was 
determined  from  the  broadening  of  the  diffraction  lines  and  making  use  of  Scherrer 
equation  [9].  The  diffraction  pattern  of  ZnS  nano-particles  displayed  three  intense 
lines,  corresponding  to  the  wurtzite  (8H)  structure,  confirming  the  crystallinity  of  the 
particles  [10].  The  X-ray  diffraction  pattern  of  the  silver  particles  revealed  three  strong 
characteristic  peaks  of  Ag.  The  average  sizes  of  the  particles  determined  from  X-ray  line 
broadening  were  8  and  12nm  respectively  for  ZnS  and  Ag  nano-particles.  The 
diffraction  peaks  of  the  nano-particles  were  found  to  be  shifted  to  slightly  larger  angles 
than  expected  for  the  corresponding  bulk  crystals,  suggesting  a  possible  lattice 
contraction  [11]. 

Dielectric  constant  e,  dielectric  loss  (tan  <5),  and  ac  conductivity  cra(.,  of  the  samples  were 
determined  using  a  Hewlett  Packard  Model  41 92A  LF  impedance  analyzer.  Dielectric 
constant,  dielectric  loss  and  ac  conductivity  measurements  at  different  frequencies  from 
10  kHz  to  13  MHz  were  carried  out  over  a  temperature  range  of  300  to  525  K. 

3.  Results  * 

The  variations  of  dielectric  constant «,  dielectric  loss  (tan  6}  and  ac  conductivity  aac  of 
the  samples  as  a  function  of  frequency  and  temperature  are  shown  in  figures  1  to  3.  The 
numerical  values  of  dielectric  constant  of  the  samples  at  different  temperatures  and 
frequencies  are  given  in  table  2.  It  is  seen  that  £  of  ZnS-0  is  almost  independent  of 
frequency  at  low  temperatures  (figure  1  a),  showing  a  very  slow  decrease  with  frequency 

432  Pramana  -  J.  Phys.,  Vol.  45,  No.  5,  November  1995 


3.5 


4.0 


4.5 


5.0  5.5 

log  10  f 


6.0 


6.5 


7.0 


Figure  la.     Variation  of  dielectric  constant  at  343  K  with  frequency  of  nano- 
particles  of  ZnS  and  composites  with  different  wt%  of  nano-particles  of  Ag. 
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Figure  Ib.    Variation  of  dielectric  constant  at  1  MHz  with  temperature  of  nano- 
particles  of  ZnS  and  composites  with  different  wt%  of  nano-particles  of  Ag. 


from  10-9  at  10 kHz  to  9-54  at  3  MHz.  The  general  trend  in  the  logs  vs  frequency  graph 
at  higher  temperature  is  similar  but  the  rate  of  decrease  in  the  value  of  £  with  frequency 
is  larger  at  high  temperatures.  £  of  ZnS-1  is  slightly  lesser  than  that  of  ZnS-O,  but 
increases  with  increase  in  the  wt%  of  Ag.  At  a  frequency  of  10  kHz,  s  of  ZnS-2  was  as 
high  as  about  127  and  7220  respectively  at  343  and  473  K,  the  values  for  ZnS-3  being 
459  and  12400  respectively.  £  of  all  the  samples,  at  low  as  well  as  high  frequencies, 
increased  with  temperature,  the  rate  of  increase  being  strongly  temperature  dependent 
for  ZnS-2  and  ZnS-3  samples  (figure  Ib).  The  numerical  value  of  e  of  ZnS-O  at  1  MHz 
increased  at  a  low  rate  from  10-1  at  343  to  1 1-4  at  473  K.  The  large  values  of  £  for  ZnS-2 
and  ZnS-3,  although  appear  to  be  anomalous  at  first  glance,  may  be  considered  to  be 
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Figure  2a.    Variation  of  dielectric  loss  factor  at  423  K  with  frequency  of  nano- 
particles  of  ZnS  and  composites  with  different  wt%  of  nano-particles  of  Ag. 
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Figure  2b.    Variation  of  dielectric  loss  factor  at  10  kHz  with  temperature  of 
nano-particles  of  ZnS  and  composites  with  different  wt%  of  nano-particles  of  Ag. 

representing  the  actual  behaviour  of  the  samples  from  the  fact  that  they  vary  regularly 
with  frequency  and  temperature. 

The  value  of  tan  <5  decreased  with  frequency  (figure  2a),  the  rate  of  decrease  being 
larger  at  larger  frequencies.  Tan  5  of  ZnS-0  and  ZnS-1  had  low  numerical  values  but 
increased  with  wt%  of  Ag  in  the  samples,  attaining  a  value  of  about  2  to  3  at  lower 
frequencies.  Tan<5  increased  only  slightly  with  temperature  for  ZnS-0  and  ZnS-1,  but 
for  ZnS-2  and  ZnS-3  it  was  strongly  frequency-  and  temperature-dependent.  At  lower 
frequencies,  it  increased  with  temperature  slowly  from  a  large  value  of  about  2, 
attaining  a  maximum  value  of  about  3,  then  decreased  and  again  showed  a  tendency  to 
increase  (figure  2b). 


434 


Pramana  -  J.  Phys.,  Vol.  45,  No.  5,  November  1995 


Dielectric  properties  of  nano-particles  of  ZnS 


bs-4 1- 

OJ 

O     cr 
—    "O 


-6 


-7 

3.5 


4.0 


4.5 


5.0  5.5 

log  10  f 


6.0 


6.5 


7.0 


Figure  3a.    Variation  of  ac  conductivity  at  343  K  with  frequency  of  nano-particles 
of  ZnS  and  composites  with  different  wt%  of  nano-particles  of  Ag. 
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Figure  3b.    Variation  of  ac  conductivity  at  1  MHz  with  temperature  of  nano- 
particles  of  ZnS  and  composites  with  different  wt%  of  nano-particles  of  Ag. 

The  ac  conductivity  (crac)  increased  with  both  frequency  and  temperature  (figures  3a 
and  3b).  crac  of  ZnS- 1  was  slightly  larger  than  that  of  ZnS-0,  but  its  rate  of  increase  with 
frequency  was  the  same  for  both  ZnS-0  and  ZnS-1.  <rac  of  ZnS-2  and  ZnS-3  increased 
almost  linearly  with  frequency.  As  regards  variation  with  temperature,  <rac  of  ZnS-0 
increased  at  a  low  rate  up  to  about  415  K  and  thereafter  the  increase  was'rapid.  The 
numerical^ value  of  crac  at  1  MHz  increased  from  about  4  x  lO^S-cnr1  at  343  K  to 
4-2  x  10  S-cm  '  at  473  K.  At  a  constant  frequency,  o"ac  increased  with  temperature  at 
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temperature  from  about  3  13  to  473  K 
with  increase  in  wt%  of  Ag.  „  of  ZnS 
larger  than  that  of  ZnS-0  and  ZnS  " 

4.  Discussion 
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Unlike  the  ionic  crystals  where  the  effective  field, 
for  semiconductors  Ec{f  should  be  essentially  equal  to  the  macroscopic  field  £,  i.e., 

This  leads  to  an  expression  for  E  of  the  form 
e-1  =47i(a/K), 

where  Kis  the  volume.  The  large  value  of  E  of  pellets  of  nano-particles  of  ZnS  compared 
to  the  corresponding  bulk  value  may  be  attributed  to  the  small  value  of  V  in  the  above 
expression.  The  large  values  of  e,  tan  6  and  <rac  of  nano-particles  of  ZnS  compared  to  the 
corresponding  bulk  values  may  be  due  to  the  small  volume  of  the  particles  [15]  and  the 
disorder  in  their  structure  due  to  excess  surface  stress  [18].  But  the  large  increase  in  the 
values  of  c,  tan  <5  and  erac  when  the  nano-particles  of  ZnS  are  mixed  with  nano-particles 
of  Ag  must  be  expected  to  be  caused  by  some  other  factor  which  causes  a  large  increase 
in  the  electronic  polarizability.  The  static  dielectric  constant  E  of  heteropolar  semicon- 
ductors like  CdS  and  ZnS  can  be  written  [15]  as 


where  £a  is  the  high  frequency  or  electronic  contribution  to  the  dielectric  constant 
(dielectric  constant  in  the  absence  of  lattice  vibrations)  and  g,  is  the  lattice  contribution 
to  the  dielectric  constant.  Samara  [15]  has  shown  that  for  compound  semiconductors, 
£  is  dominated  by  the  electronic  contribution  ea  and  the  lattice  contribution  £,  is 
considerably  smaller.  The  large  value  of  e  in  the  case  of  ZnS,  on  the  addition  of 
nano-particles  of  Ag  must  be  attributed  to  the  large  increase  in  £a,  and  the  enhanced 
magnitude  of  tan  8  must  be  due  to  the  disorder  in  the  structure  of  the  particles  [18]. 

5.  Conclusion 

The  dielectric  properties  of  nano-particles  of  ZnS  and  composites  with  nano-particles 
of  Ag  of  different  concentrations  dispersed  in  ZnS  have  been  studied  over  a  temperature 
range  of  300  to  525  K.  The  large  values  of  dielectric  constant,  dielectric  loss  and  ac 
conductivity  of  nano-particles  of  ZnS  compared  to  the  corresponding  values  of  bulk 
single  crystals  may  be  due  to  the  small  volume  of  the  particles  and  the  disorder  in  their 
structure. 
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Generation  of  picosecond  optical  pulses  from  single  heterostructure 
GaAs  diode  laser  and  their  emission  characteristics 
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Abstract.  Emission  characteristics  of  a  single  heterostructure  GaAs  diode  laser  are  reported 
using  a  simple  driver  circuit.  It  provides  a  single  picosecond  time  duration  optical  pulse,  a  pulse 
train  or  a  broad  optical  pulse  depending  on  the  amplitude  and  time  duration  of  the  electrical 
pump  pulse.  Results  show  that  relaxation  oscillation  frequency  depends  on  the  amplitude  of 
pumping  current  pulse  as  well  as  on  some  inherent  property  of  diode  laser,  which  seems  to  be  the 
level  of  impurity  in  lasing  medium.  Variation  of  relaxation  oscillation  frequency  with  amplitude 
of  current  pulse  shows  only  the  qualitative  agreement  with  the  reported  theoretical  predictions. 
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1.  Introduction 

Diode  lasers  are  a  low  cost,  compact  and  efficient  optical  source  which  can  provide 
optical  pulses  of  picosecond  time  duration  at  high  repetition  rate.  These  devices  play  an 
important  role  in  the  studies  of  ultrafast  processes,  optical  communication  as  well  as 
the  testing  of  temporal  behaviour  of  fast  opto-electronic  devices.  It  is  therefore 
important  to  study  in  detail  the  emission  characteristics  of  these  lasers. 

Picosecond  pulse  generation  by  semiconductor  lasers  has  been  investigated  by 
several- authors  [1-13].  Various  methods  have  been  used  to  obtain  these  ultra  short 
pulses,  such  as  mode  locking  using  external  optical  resonator  [2]  or  by  simpler  gain 
switching  methods.  Two  methods  of  gain  switching  have  been  demonstrated  as 
injection  of  a  short  current  pulse  [1,4, 1 1]  or  a  sinusoidal  RF  wave  in  the  presence  of 
DC  bias  [5-10, 12].  Many  types  of  diode  lasers  emitting  in  visible  [13]  as  well  as  in 
infrared  [1-12]  wavelength  regions  have  been  used  for  this  purpose.  A  large  number  of 
investigations  related  to  picosecond  optical  pulse  generation  have  been  performed  by 
RF  current  modulation  technique  or  with  an  ultra  short  (picosecond)  time  duration 
electrical  pulses  [9]  in  the  presence  of  a  DC  bias  voltage.  Paulus  et  al  [9]  have  reported 
that  the  optical  pulse  width  from  laser  diode  first  decreases  and  then  remains  constant 
at  a  certain  minimum  with  an  increase  in  the  amplitude  of  the  pumping  current  pulse. 
However,  the  pulse  width  (optical)  first  decreases  up  to  a  certain  value  and  then 
increases  with  an  increase  in  the  DC  bias  current.  These  observations  have  been 
supported  by  numerical  analysis.  On  the  basis  of  theoretical  calculations,  MacFarlane 
[10]  has  reported  that  the  time  duration  of  the  optical  pulse  first  decreases  and  then 
attains  a  constant  value,  when  time  duration  of  electrical  pulse  was  decreased. 
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However,  the  nature  of  variation  remains  same  with  a  new  value  of  optical  time  duration 
when  the  DC  bias  current  was  changed.  According  to  Baker  [11]  the  optical  pulse 
generation  combining  the  relaxation  oscillation  phenomenon  of  laser  diode  with  large 
current  pulse  capacity  of  avalanche  transistors  could  lead  to  optical  pulses  of  <  lOOps 
pulse  width.  It  has  been  reported  that  if  the  current  impulse  of  the  avalanche  transistor 
returns  to  zero  before  the  occurrence  of  the  second  oscillation  of  the  laser  diode,  an 
optical  impulse  will  be  generated  and  if  a  current  step  is  applied  to  the  laser  diode  a  train 
of  pulses  with  decreasing  optical  amplitude  will  be  generated.  In  another  investigation 
Yariv  [8]  and  Lau  et  al  [7]  have  presented  an  analysis  about  the  relaxation  oscillation 
frequency  observed  in  the  laser  diode,  which  was  found  dependent  on  the  amplitude  of 
the  pumping  electrical  pulse  as  well  as  on  the  power  emitted  from  the  laser  diode 
respectively.  The  latter  relation  fits  better  with  the  experimentally  observed  data. 

The  aim  of  this  study  is  to  verify  the  results  of  Paulus  et  al  [9],  Baker  [11],  Yariv  [8] 
and  Lau  et  al  [7]  when  the  laser  diode  is  operated  using  a  pumping  pulse  of 
subnanosecond  (~800ps)  to  15ns  time  duration.  For  this  purpose,  a  single  hetero- 
structure  GaAs  laser  diode  was  used  to  investigate  the  detailed  emission  characteristics 
of  the  laser  diode  with  the  amplitude  and  pulse  width  of  the  pumping  electrical  pulse. 

2.  Experimental  details 

Short  duration  optical  pulses  were  generated  [1 4, 1 5]  by  a  single  heterostructure  GaAs 
laser  diode  (LD-62)  obtained  from  Laser  Diode  Incorporated,  USA.  Laser  diode  was 


ft  LASER 

"  DIODE: 


EXTERNAL         °-luF 

TRIGGER 

INPUT 


LASER   DIODE  DRIVER 
1.      firrnit  Hiaoram  nfHinHp  lacpr  Hrii;<»i- 


driven  by  a  simple  driver  (figure  1)  where  current  pulse  was  generated  by  discharging 
a  capacitor  C  (charged  up  to  ~600V)  by  switching  the  stack  of  two  transistors 
(2N5551)  connected  in  series.  Two  transistors  were  used  to  increase  the  amplitude  of 
the  current  pulse.  These  transistors  act  as  a  switching  element  when  operated  in 
avalanche  mode  and.were  selected  carefully  such  that  they  had  same  rise  time  ( ~  1  -5  ns) 
and  breakdown  voltage  (~300V).  It  was  also  noticed  that  the  best  transistors  [16] 
were  those  having  hFE  value  ~  120.  However,  the  correlation  of  transistor  parameter  /i,,,.: 
with  the  avalanche  mode  breakdown  was  not  known.  The  actual  peak  current  passing 
through  the  laser  diode  was  adjusted  by  changing  the  charging  voltage  or  the  value  of 
the  capacitor  C.  This  follows  from  the  fact  that  the  injection  current  to  the  diode  is 
decided  by  the  product  of  C  and  the  charging  voltage  K  whereas  the  duration  of  the 
current  pulse  is  decided  by  the  product  of  C  and  the  resistance  in  the  discharging  path. 
The  rise  time  of  the  pulse  is  decided  by  the  inductance  of  the  circuit  which  was 
minimized  in  this  case  using  a  specially  designed  PCB.  The  laser  diode  was  mounted  on 
an  aluminium  plate  which  acts  as  a  heat  sink  for  the  laser  diode.  The  above  ci  rcuit  could 
be  triggered  by  a  pulse  of  ~  5  V  amplitude,  but  we  used  ~  24  V  amplitude  trigger  for 
a  reliable  triggering  with  a  reduced  time  jitter.  The  trigger  pulse  was  applied  through 
a  fast  transmission  line  pulse  transformer  to  the  base  of  one  of  the  transistors  in  the 
stack.  The  pulse  transformer  was  made  by  wrapping  four  turns  of  co-axial  cable  of  50  Q 
impedance  where  the  outer  shield  acts  as  primary  and  the  central  wire  as  secondary  of 
the  transformer.  It  was  operated  either  in  manual  single  shot  mode  or  at  ~  10  Hz 
repetition  rate.  The  output  emission  from  laser  diode  was  detected  using  MRD  510 
photodiode  in  combination  with  100  MHz  digital  storage  oscilloscope  L  and  T  Gould 
model  4074  (400  MS/S  sampling  rate)  and  a  newly  developed  picosecond  optical  streak 
camera  [17]  having  51  photocathode.  This  streak  camera  provides  a  streak  speed  of 
~  15  mm/ns  with  a  time  resolution  ^  1 7  ps.  The  schematic  diagram  of  the  experimental 
set  up  to  record  the  optical  pulses  from  laser  diode  using  streak  camera  is  shown  in 
figure  2.  The  image  from  the  intensifier  screen  was  captured  using  a  CCD  camera  where 


SYNCRONOUS  TRIGGER    PULSE 

SCHEMATIC   OF  OPTICAL  STREAK   CAMERA 
Figure  2.    Schematic  diagram  of  experimental  set  up. 
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an  intensity  pronie  was  obtained  on  a  PC  monitor  screen  witn  trie  neip  01 

(Profile  Measurement  of  Image  Size  and  Edge  Location)  software  programme  [18]. 

3.  Results  and  discussion 

3. 1  Electrical  properties  of  diode  laser  driver 

A  simple  fast  rising  current  pulse  is  used  to  pump  the  diode  laser  to  get  a  short  duration 
optical  pulse.  These  fast  electrical  pulses  were  obtained  by  the  circuit  given  in  figure  1  as 
described  in  previous  section.  However,  before  the  study  of  emission  characteristics  of 
diode  laser,  study  of  electrical  pulse  is  necessary.  Two  combinations  of  avalanche 
transistors  were  used  as  a  switching  element  to  drive  the  diode  laser.  In  the  first 
combination  only  two  transistors  connected  in  series  acts  as  the  switching  element  for 
discharging  the  capacitor  C.  Actual  shape  of  current  pulse  was  obtained  by  recording 
the  shape  of  voltage  pulse  across  1-5  Q  resistance  connected  as  a  dummy  in  place  of  diode 
laser.  Electrical  pulses  were  recorded  by  a  digital  storage  oscilloscope  L  and  T  Gould 
model  7074  (400  MS/S)  which  has  low  temporal  resolution  ~  3-5  ns.  Figure  3  shows  the 
shape  of  electrical  pulse  obtained  across  1-5Q  when  C  =  120pF  capacitor  charged  to 
600  V  was  discharged  by  triggering  one  of  the  transistors  in  the  stack.  This  provides  an 
electrical  pulse  of  ~10ns  time  duration  along  with  some  of  the  oscillating  pulses 
having  decreasing  amplitude.  An  increase  in  the  value  of  C  as  60, 1 00  and  270  pF  shows 
corresponding  increase  in  time  duration  of  the  electrical  pulse  as  ~  6, 8  and  1 5  ns  which 
is  expected.  It  was  also  noted  that  any  change  in  the  charging  voltage  does  not  show  any 
effect  on  the  FWHM  of  the  electrical  pulse.  Minimum  rise  time  of  ~  1  ns  and  pulse 
duration  ~  3  ns  were  measured  for  the  capacitors  of  value  ^  20  pF.  However,  20  pF 
capacitor  in  two  transistor  circuit  did  not  provide  detectable  intensity  from  diode  laser. 
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Figure  3.  Oscillogram  of  the  electrical  pump  pulse  when  series  stack  of  two 
transistors  (2N5551/2N5550)  were  used  as  switching  element  and  the  optical  pulse 
obtained  from  diode  laser. 
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Figure  4.  Oscillogram  of  the  pump  pulse  when  series  stack  of  nine  transistors 
(2N5551/2N5550)  were  used  as  switching  element  in  the  driver  circuit.  Capacitor 
was  charged  up  to  (a)  2-7  kV,  (b)  2-5  kV. 


It  was  observed  that  for  low  value  of  capacitor  higher  charging  voltage  is  required 
which  is  possible  only  by  increasing  the  number  of  transistors  in  the  stack. 

In  order  to  improve  the  rise  time  and  time  duration  of  the  pump  electrical  pulse  another 
combination  of  switching  element  was  used  which  has  a  series  stack  of  nine  transistors 
(2N5551/2N5550).  This  circuit  uses  20  pF  capacitor  charged  up  to  2-7  kV.  Electrical 
pulses  obtained  from  this  circuit  for  the  charging  voltage  of  2-7  kV  (figure  4a)  and 
2-5  kV  (figure  4b)  were  recorded  by  Tektronix  model  7104  oscilloscope.  Figure  4a  has  a 
rise  time  and  time  duration  of  the  pulse  as  ~  600  ps  (10%-90%)  and  ~  800  ps  respectively 
whereas  figure  4b  shows  the  rise  time  as  ~  800  ps  and  time  duration  as  ~  900  ps.  These 
results  indicate  that  rise  time  and  duration  of  the  pulse  get  improved  by  increasing  the 
number  of  transistors  in  the  switching  stack.  A  small  decrease  in  charging  voltage  increases 
the  rise  time  and  the  pulse  duration  whereas  no  variation  was  observed  in  the  rise  time 
and  pulse  duration  of  the  electrical  pulse  in  the  case  of  two  transistors  stack.  However, 
similar  variations  are  expected  in  both  the  cases  but  the  small  variation  in  two  transistor 
case  may  not  be  observable.  This  improvement  in  the  time  duration  of  the  pulse  in  the 
case  of  increased  number  of  transistors  in  the  stack  seems  to  be  due  to  change  in  the 
value  of  capacitance  as  a  result  of  series  connection  of  charging  capacitor  and  junction 
capacitances  of  the  switching  transistors.  However,  rise  time  of  the  electrical  pulse  got 
improved  due  to  the  combined  effect  of  decrease  in  effectivejunction  capacitance  of  the 
switching  transistors  and  decrease  in  the  inductance  of  discharging  path  in  the  circuit. 

3.2  Emission  characteristics  of  LD-62  diode  laser 

The  emission  characteristics  of  the  diode  laser  was  investigated  to  understand  the 
relationship  between  the  driving  current  pulse  at  high  amplitude  and  the  shape  of  the 
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Figure  5.     Variation  of  optical  intensity  emitted  from  laser  diode  with  pumping 
current  pulse  (/0//Th). 
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Figure  6.     Variation  of  optical  pulse  width  with  the  amplitude  of  the  pump  current 
(/O//TH)- 


light  pulse.  Figure  3  shows  the  shape  of  the  current  pulse  obtained  after  discharging 
a  capacitor  of  ~  120pF  and  the  optical  pulse  detected  using  photodiode  MRD  510. 
Time  duration  of  the  electrical  and  optical  pulse  was  measured  as  ~  10ns  and  ~  5  ns, 
respectively.  Here  one  can  notice  that  the  oscillations  observed  in  the  electrical  current 
pulse  did  not  generate  corresponding  optical  pulses.  However,  at  a  higher  current 
value,  the  amplitude  of  oscillations  in  current  pulse  crosses  the  lasing  threshold  and 


Picosecond  optical  pulses 

provides  corresponding  optical  pulses.  Figure  5  shows  the  variation  of  the  optical  pulse 
amplitude  with  an  increase  in  the  ratio  of  injection  current  and  the  threshold  lasing 
current  /0//Th  for  discharging  capacitor  of  value  ~  120  and  270  pF.  When  charging 
voltage  on  the  capacitor  decreases,  then,  as  a  result  of  decrease  in  the  current  pulse 
amplitude,  the  pulse  width  (figure  6)  as  well  as  intensity  of  the  optical  pulse  decreases 
(figure  5).  It  was  found  (figure  5)  that  for  getting  a  similar  photodiode  signal  with  both 
the  charging  capacitors,  required  current  (charging  voltage)  is  more  in  the  case  of 
smaller  value  of  charging  capacitor  (~  120pF).  Even  the  threshold  for  lasing  is  also 
high  in  the  case  of  lower  value  of  capacitor.  An  increase  in  the  duration  of  the  optical 
emission  with  large  amplitude  and  ns  time  duration  pumping  current  pulse  (figure  6) 
may  be  due  to  two  reasons:  firstly,  due  to  an  increase  in  the  temperature  as  a  result  of 
large  current  passing  through  the  diode  which  is  in  agreement  with  the  earlier  reported 
results  [1,4],  and  secondly,  due  to  the  occurrence  of  multiple  pulsation  or  relaxation 
oscillations  [3,7-13]  which  will  be  discussed  in  detail  afterwards.  The  observation  of 
figure  6  is  contrary  to  the  experimental  and  numerical  results  reported  by  Paulus  et  al 
[9]  where  picosecond  pulses  were  used  for  pumping  the  laser  diode.  It  seems  that  the 
small  time  duration  pumping  pulses  are  not  producing  multiple  pulsation  or  producing 
it  at  very  large  amplitude  of  exciting  current  pulse.  However,  first  decreasing  (transition 
from  spontaneous  to  stimulated  emission)  and  then  increasing  (due  to  multiple 
pulsation)  nature  in  optical  pulse  duration  has  been  reported  with  an  increase  in  the 
DC  bias  voltages  [9]  in  the  presence  of  a  picosecond  pump  pulse.  We  are  getting  only 
increasing  nature  in  optical  pulse  width  with  an  increase  in  the  amplitude  of  the  current 
pulse.  However,  a  small  increase  in  optical  pulse  width  at  lower  amplitude  of  the 
current  pulse  is  possible  near  the  threshold  of  lasing  (near  breakdown  voltage  of 
transistor  stack)  in  our  case  also.  Because  a  decrease  in  charging  voltage,  below 
breakdown  voltage  of  transistor  stack  (to  decrease  the  current)  increases  the  rise  time 
and  pulse  duration  of  the  current  pulse  (figure  4)  and  consequently  the  time  duration  of 
the  optical  pulse.  However,  this  change  may  not  be  observable  due  to  the  band-width 
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Figure  8.     Single  optical  pulse  from  laser  diode  recorded  using  streak  camera. 

limitation  of  the  recording  oscilloscope.  A  comparison  of  our  results  with  those  of  the 
Paulus  et  al  [9]  indicates  that  the  large  amplitude  of  the  pumping  pulse  combined  with 
its  large  time  duration,  has  similar  effect  as  the  DC  bias  has  in  the  presence  of 
picosecond  time  duration  electrical  pulses.  Figure?  shows  the  photodiode  signal 
recorded  in  equivalent  time  sampling  mode  at  an  injection  current  ratio  /0//Th  ~  1'32 
which  provides  a  broad  optical  pulse  along  with  a  modulation  of  intensity  at  the  top  of 
the  pulse.  The  small  time  duration  pulsation  occurs  at  nearly  equal  time  intervals  which 
is    ~~  400  ps  in  the  present  case.  Further  increase  (decrease)  in  the  current  pulse 
amplitude  shows  corresponding  decrease  (increase)  in  the  time  interval  between  the 
pulsations.  A  single  optical  pulse  (figure  8)  of  ~  180  ps  duration  was  observed  using 
a  streak  camera  when  laser  diode  was  operated  just  above  the  threshold  current  value. 
However,  smallest  optical  pulse  duration  of  ~  120ps  was  recorded  by  discharging 
a  20  pF  capacitor  through  diode  laser  (figure  4).  At  J0//Th  ~  1-32  a  pulse  train  having 
pulses  of  duration  ~  1 80  ps  was  observed  on  the  streak  camera  separated  with  the  time 
interval  of  ~  400  ps  (figure  9a).  Here  again  we  note  that  the  time  period  between  the 
pulses  decreases  with  an  increase  in  /0//Th.  It  was  observed  that  at  large  injection 
current  the  optical  pulse  gets  broadened  (pulses  merge  together)  in  such  a  way  that  the 
train  of  the  pulses  got  converted  into  a  single  large  time  duration  pulse  with  intensity 
modulation  at  the  top.  The  broad  optical  pulse  has  a  similar  shape  as  that  of  the  current 
pulse.  These  observations  indicate  that  emission  of  a  short  duration  single  pulse,  a  pulse 
train  or  a  broad  pulse  with  intensity  modulation  are  the  function  of  ratio  I0/Irb.  The 
difference  in  the  shape  of  pulses  at  70//Th  ~  1-32  in  figures  7  and  9a  is  due  to  the 
difference  in  the  temporal  resolution  of  the  recording  systems.  Experiments  were 
repeated  many  times  and  results  were  found  reproducible.  The  time  duration  of  these 
optical  pulses  were  changed  using  different  values  of  the  capacitors.  Figures  9a  and  9b 
show  the  intensity  profile  of  two  similar  laser  diodes  connected  in  parallel  and  operated 
with  the  same  current  pulse  in  single  shot  operation  by  pumping  the  diodes  simulta- 
neously. A  slight  change  was  noticed  in  the  intensity,  time  period  of  pulsation  and  time 
duration  in  the  case  of  single  pulse  or  in  the  pulse  train  when  different  laser  diodes  of  the 
same  specifications  were  tested  using  the  same  current  pulse.  In  order  to  see  the  effect  of 
improvement  in  the  rise  time  and  shortening  the  time  duration  of  the  pumping 
electrical  pulse  on  the  emitted  optical  pulse,  we  used  a  short  time  duration  electrical 
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Figure  9.    Multiple  pulsation  emitted  from  laser  diode  (for  /0//Th~  1-32)  and 
recorded  using  streak  camera,  (a)  Laser  diode  I;  (b)  Laser  diode  II. 
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Figure  10.  Contact  photograph  of  multiple  pulsation  when  diode  laser  was 
pumped  by  discharging  a  20  pF  capacitor  charged  at  2-7  kV  using  a  series  stack  of 
nine  transistors  (2N5551/2N5550)  as  a  switch. 

pulse  (figure  4)  for  pumping  the  diode  laser.  Figure  1 0  shows  the  contact  photograph  of 
the  laser  diode  emission  taken  using  a  streak  camera  on  the  polaroid  film,  which  has 


and  rise  time.  Here  again  we  observed  that  duration  of  optical  pulses  increases  when 
charging  voltage  was  decreased  from  2-7  k  V  to  2-5  k  V.  This  behaviour  is  similar  to  what 
has  been  observed  in  the  two-transistor  case.  It  was  however  noted  during  these 
experiments  that  multiple  optical  pulses  of  picosecond  time  duration  were  obtained 
always  whenever  the  laser  diode  was  driven  by  a  large  amplitude  current  pulse  obtained 
by  switching  the  transistors  in  the  avalanche  mode  (figure  9)  whereas  it.  provided 
a  single  optical  pulse  of  picosecond  time  duration  with  a  slowly  rising  intensity  at  the 
leading  edge  of  the  pulse  (figure  8),  when  transistors  were  operated  below  avalanche 
breakdown  voltage  whether  a  two-transistor  or  nine-transistor  stack  was  used.  The 
slow  rise  in  the  intensity  at  the  beginning  of  the  optical  pulse  may  be  due  to 
spontaneous  emission,  which  occurs  as  a  result  of  an  increase  in  the  rise  time  and 
decrease  in  the  amplitude  of  the  electrical  pulse. 

3.3  Comparison  with  theoretical  results 

It  is  generally  accepted  that  the  modulation  band-width  of  the  semiconductor  laser  is 
equal  to  vrel,  the  relaxation  oscillation  frequency  [7, 8]  which  varies  from  laser  to  laser. 
However,  this  frequency  limits  the  useful  band-width  of  the  diode  laser  below  relax- 
ation oscillation  frequency.  Figure  1 1  shows  the  variation  of  relaxation  oscillation 
frequency  with  the  pumping  current  for  some  of  the  typical  parameters  of  the  diode 
laser.  Frequencies  have  been  calculated  using  a  widely  accepted  formula  [8]. 
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Figure  11.    Variation  of  measured  and  calculated  relaxation  oscillation  frequencies 
with  pump  current  (I0/^h). 
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where  /0  and  7Th  are  the  pumping  and  threshold  current  for  the  occurrence  of  laser,  ts 
and  TP  are  the  spontaneous  carrier  and  photon  life  time  respectively,  F  is  the  optical 
confinement  factor,  A  is  the  optical  gain  co-efficient  and  ntr  is  the  carrier  density  at 
which  the  material  becomes  transparent,  v  is  the  group  velocity  of  the  light,  a  is  the 
distributed  loss,  L  is  the  length  of  the  cavity  and  R  is  the  mirror  reflectivity. 

The  numerical  values  for  these  parameters  have  been  obtained  from  references  7  and 
8.  The  experimentally  measured  relaxation  frequency  data  with  pumping  current  has 
also  been  plotted  on  the  same  graph  for  comparison  purpose.  It  was  found  that 
experimentally  as  well  as  theoretically  obtained  data  are  in  qualitative  agreement,  that 
is,  frequency  increases  with  an  increase  in  the  pumping  current.  But  experimental  data 
do  not  fit  correctly  with  any  of  the  theoretically  obtained  curve.  Figure  12  shows  the 
variation  of  experimentally  observed  relaxation  oscillation  frequencies  with  the  square 
root  of  the  emitted  light  intensity,  which  is  proportional  to  the  photo  diode  signal. 
A  linear  variation  of  data  points  indicates  better  agreement  with  the  relation  given  by 
Lau  et  al  [7]  for  relaxation  oscillation  frequency  as 


(2) 


where  P0  is  the  steady  state  photon  density  in  the  active  region.  However  a  small 
deviation  can  be  seen  at  the  higher  value  of  current  in  this  experiment.  The  experimen- 
tal observations,  reported  by  Lau  et  al  [7]  for  a  burried  heterostructure  GaAs  laser 
operated  by  current  modulation  at  0-1-8-5  GHz  show  a  close  agreement  wfth  (2).  The 
data  obtained  from  other  laser  diode  (figure  9b)  also  show  similar  variation  but  provide 
entirely  different  frequencies  for  similar  pumping  current.  The  observed  pulsation 
behaviour  in  both  the  diode  lasers  (figures  9a  and  9b)  were  compared  with  the 
theoretical  simulation  results  reported  by  Chik  et  al  [3]  who  have  taken  into  account 
the  presence  of  defects  (saturable  absorber)  in  the  cavity.  Emission  in  figure  9b  has 
a  broad  profile  with  intensity  modulation  on  the  top  whereas  figure  9a  shows  a  higher 
order  of  modulation  depth  in  intensity  which  seems  to  be  due  to  the  presence  of  large 
density  of  defects  in  figure  9a.  The  relaxation  oscillation  frequency  obtained  in 
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Figure  12.     Variation  of  measured  relaxation  oscillation  frequency  with  the  square 
root  of  intensity  emitted  from  diode  laser. 
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of  defects  in  the  laser  cavity  plays  an  important  role  in  providing  self-sustained 
pulsation  behaviour  in  the  semiconductor  diode  laser. 

4.  Conclusion 

In  summary,  a  short  picosecond  time  duration  pulse  was  obtained  from  a  single 
heterojunction  GaAs  laser  diode  by  injection  of  a  current  pulse  of  time  duration 
ranging  from  subnanosecond  to  15ns  obtained  from  a  simple  circuit.  However  time 
duration  of  the  single  optical  pulse  depends  on  the  time  duration  of  the  pumping 
electrical  pulse.  The  shortest  optical  pulse  obtained  was  of  ^  120ps.  One  can  obtain 
a  single  pulse,  pulse  train  or  a  broad  pulse  with  intensity  modulation  on  the  top 
depending  on  the  amplitude  of  the  current  pulse  which  is  independent  of  the  electrical 
pulse  duration.  However,  threshold  of  lasing  is  dependent  on  pump  pulse  time  duration 
(value  of  capacitor,  C).  It  was  found  that  occurrence  of  relaxation  oscillation  or 
self-sustained  pulsation  does  not  depend  only  on  the  duration  of  electrical  pump  pulse 
as  is  reported  by  Baker  [11].  It  has  been  shown  that  the  high  amplitude  of  electrical 
pump  pulse  with  large  pulse  duration  in  these  experiments  and  the  electrical  pulse  of 
ultrashort  time  duration  with  a  DC  bias  in  the  case  of  Paulus  et  al  [9]  provide  similar 
results,  that  is,  an  increase  in  optical  time  duration.  Variation  in  optical  pulse  duration 
with  electrical  pulse  duration  was  found  in  agreement  with  the  results  of  MacFarlane 
[10].  Experimental  observations  further  indicate  the  need  for  modification  in  the 
theory  (consideration  of  impurity  factor)  to  explain  the  observed  variations  in  relaxa- 
tion oscillation  frequencies  with  the  amplitude  of  pumping  electrical  pulse.  However, 
the  study  of  laser  diode  (GaAs)  emission  with  a  better  time  resolution  may  provide 
more  important  information. 
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Abstract.  A  dispersion  relation  for  the  perpendicular  propagation  of  the  electromagnetic  ion 
cyclotron  wave  around  the  second  harmonic  of  the  deuterium  ion  gyrofrequency  in  a  mildly 
relativistic, anisotropic  Maxwellian  plasma  with  hydrogen  as  the  majority  species  and  deuterium 
as  the  minority  component  has  been  derived.  The  work  has  been  carried  out  in  the  frame  of 
reference  of  the  majority  hydrogen  ions;  to  these  ions  the  waves  at  2QD  would  be  at  its  own 
gyrofrequency. 

Using  a  small  quantity  t:  to  order  all  relevant  parameters  of  the  plasma,  it  was  possible  to  derive 
the  dispersion  relations  in  a  simple  form.  To  the  lowest  order  the  relativistic  factors  do  not  enter 
the  dispersion  relation.  The  plasma  can  now  support  two  modes — one  above  and  the  other 
below  the  hydrogen  gyrofrequency  in  agreement  with  the  assumptions.  This  was  also  verified 
numerically  using  a  standard  root  solver  thereby  justifying  the  correctness  of  the  ordering  scheme. 

In  the  next  higher  order,  the  dispersion  relation  is  a  quartic  equation  and  is  sensitively 
dependent  on  the  relativistic  factors.  The  plasma  can  now  support  four  modes,  both  above  and 
below  the  hydrogen  gyrofrequency  and  consistent  with  the  ordering  scheme  used.  However  the 
modes  can  now  coalesce  resulting  in  complex  conjugate  roots  to  the  dispersion  relation  thereby 
indicating  an  instability. 

The  advantage  of  such  a  scheme  is  that  two  dispersion  relations  -  one  of  which  is  independent  of 
the  relativistic  factors  and  the  other  which  is  sensitively  dependent  on  them  can  be  separated  out. 

Keywords.    Ion  cyclotron  waves;  two-ion  relativistic  plasma;  dispersion  relation. 
PACSNo.    52-40 


1.  Introduction 

Ohmic  heating  alone  is  not  sufficient  to  bring  a  plasma  to  its  ignition  conditions. 
Among  the  many  options  available,  supplementary  heating  by  waves  in  the  ion 
cyclotron  range  of  frequencies  (ICRF)  is  probably  the  most  common  scheme  at  present 
and  is  being  used  in  a  large  number  of  tokamaks  and  mirror  machines.  These  heating 
experiments  have  concentrated  on  the  use  of  RF  waves  at  frequencies  CD  both  above  and 
below  the  ion  gyrofrequency  Q{.  The  method  is  inherently  simple,  technologically 
attractive  and  has  proved  to  be  quite  successful  in  raising  plasma  temperatures. 

In  this  method  heating  is  accomplished  by  1C  damping  where  the  wave  energy  is 
absorbed  and  converted  to  particle  energy  and  is  therefore  a  hot  plasma  effect.  In  a 
pure  hydrogen  plasma  for  waves  around  the  first  harmonic  of  Qj,  the  lowest  order 
thermal  corrections  to  the  cold  plasma  terms  cancel  and  consequently  wave  absorption 
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physics  of  wave  damping  at  co  w  2QD  in  a  deuterium  plasma  was  found  to  be  dominated 
by  the  presence  of  a  small  amount  of  hydrogen  ions  (to  which  this  wave  is  at  its 
fundamental  gyrofrequency).  These  ions  played  a  dominant  role  in  the  propagation  and 
absorption  of  wave  energy.  Starting  with  modest  powers  ICRF  heating  is  at  present  an 
important  scheme  with  which  impressive  powers  of  a  few  tens  of  MW  have  been  achieved 
[2].  With  larger  and  larger  power  inputs  leading  to  higher  and  higher  energies,  relativistic 
effects  can  be  expected  to  play  an  important  role  in  ICRF  heating  experiments  in  future. 

As  mentioned  above,  fusion  plasmas  generally  have  various  types  of  ions  in  differing 
concentrations.  Of  the  many  scenarios  possible,  we  consider  a  mildly  relativistic  plasma 
containing  hydrogen  (H)  as  the  majority  species  and  deuterium  (D)  as  the  minority 
constituent  and  study  1C  propagation  at  the  second  harmonic  of  deuterium  ion 
gyrofrequency  in  it  (that  is  co  «  2QD).  Since  we  propose  to  work  in  the  frame  of  the 
majority  hydrogen  ions  this  wave  at  CD  %  2QD  would  be  at  its  own  gyrofrequency  QH. 

Past  analyses  of  dispersion  relations  for  1C  propagation  in  multi-ion  component 
plasmas  have  been  essentially  numerical  probably  because  of  the  complexity  involved 
in  an  analytic  calculation.  Unfortunately,  a  fully  numeric  solution  has  the  drawback 
that  the  cold  plasma  terms  dominate  the  relativistic  terms  of  the  dielectric  tensor  thus 
masking  many  interesting  features  of  wave  propagation  in  relativistic  plasmas.  Guided 
by  the  experimental  parameters  of  an  early  mirror  experiment  and  the  numerical 
solutions  of  the  dispersion  relations  we  have,  by  means  of  an  ordering  parameter 
£  derived  two  dispersion  relations.  To  the  lowest  order  we  arrive  at  a  quadratic 
equation  which  is  independent  of  the  relativistic  terms.  This  relation  supports  two 
modes,  one  above  and  the  other  below  the  hydrogen  gyrofrequency  QH.  The  next 
higher  order  dispersion  relation  is  a  fourth  order  equation  which  is  sensitively 
dependent  on  the  relativistic  terms.  This  relation  yields  four  modes  on  either  side  of 
QH  which  can  coalesce  to  make  the  plasma  unstable. 

The  advantage  of  such  an  analytic  scheme  is  that  we  have  been  able  to  separate  out 
two  dispersion  relations,  one  of  which  is  independent  of  the  relativistic  factors  while  the 
other  is  sensitively  dependent  on  them.  Such  a  separation  is  not  possible  in  a  fully 
numerical  calculation. 


2.    The  dielectric  tensor  elements 

We  are  interested  in  the  perpendicular  propagation  of  1C  waves  around  the  second 
harmonic  of  the  deuterium  ion  gyrofrequency  (CD  x  2QD),  in  a  mildly  relativistic  plasma 
with  hydrogen  as  the  majority  species  and  deuterium  as  a  minority  constituent.  The 
electrons  form  a  neutralising  background.  The  expressions  for  the  relevant  dielectric 
tensor  elements  have  been  derived  earlier  for  a  single  ion  plasma  [3];  we  extend  these  to 
a  two-ion  plasma.  We  too  choose  /0  to  be  a  bi-Maxwellian  which  is  given  by  [3] 
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Ion  cyclotron  instabilities  in  H-D  relativistic  plasma 

The  thermal  energies  u^  and  v*  are  related  to  the  temperatures  TL  and  7!,,  perpen- 
dicular and  parallel  to  the  magnetic  field  B0  respectively  by 

IT  2T 

"i  JL,    1     i  •)  £*    JL    t\ 

5>=-^and5J=-^,  (2, 

where  M  is  the  rest  mass. 

On  substituting  (1)  into  the  expressions  for  the  dielectric  tensor  elements  and 
carrying  out  the  various  integrations  we  get  the  following  expressions  for  Kxx,  Kvy  and 
K  .  A  detailed  derivation  of  these  elements  has  been  given  earlier  [3];  we,  however, 
give  only  the  final  expressions  to  facilitate  our  discussions. 

The  relevant  expressions  are, 
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In  the  above  the  first  summation  is  over  the  species  (H  indicating  hydrogen, 
D  deuterium  and  e  electrons). 
The  definitions  of  the  other  parameters  are 
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where  c  is  the  velocity  of  light,  N,  the  plasma  density  and  y  the  relativistic  factor. 

The  Ts  arise  from  the  di^  integrations  and  are,  in  general,  functions  of  the  modified 
Bessel  function  /M(/±)  [3].  The  argument  /_,_  of  /„  is  defined  by 
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The  explicit  functional  forms  of  the  Ts  are  given  in  [3].  Also  the  prime  on  the 
summation  in  Kyy  indicates  that  the  n  =  0  contribution  has  to  be  divided  by  2. 

3.    The  ordering  procedure 

We  are  interested  in  the  perpendicular  propagation  of  1C  waves  around  a)  %  2QD  in 
a  hydrogen-deuterium  plasma.  As  mentioned,  to  the  hydrogen  ions  these  waves  at 
(o  %  2QD  would  be  at  its  own  gyrofrequency  OH.  We  shall  thus  work  in  the  frame  of  the 
hydrogen  ions. 

The  tensor  elements  (3a)  to  (3c)  are  all  infinite  summations  and  hence  if  used  as  such, 
the  dispersion  relation  would  become  very  complicated.  Therefore,  to  arrive  at  a 
dispersion  relation  in  a  simple  form,  the  dielectric  tensor  elements  Ku  must  contain 
only  a  finite  number  of  terms  to  begin  with.  Such  a  truncation  is  possible  only  if  the 
individual  members  of  the  tensor  elements  become  progressively  lesser  than  1.  We, 
therefore,  define  a  small  parameter  e  to  scale  all  the  relevant  parameters  in  the 
individual  tensor  elements  and  the  final  dispersion  relation. 

A  quantity  which  is  small  and  therefore  convenient  for  the  definition  of  £  is  the  ratio 
of  the  rest  mass  of  the  electron  to  that  of  the  proton;  thus  c%(Mc/MH)1/2.  Having 
defined  e  we  next  characterize  the  wave  in  terms  of  this  small  parameter,  conforming 
with  experiments  over  a  wide  range  of  parameters.  Theoretically,  for  a  resonance,  the 
frequency  to  must  be  equal  to  OH;  experimentally,  however,  the  waves  usually  have 
a  small  range  of  frequencies  around  QH.  It  is  this  deviation  from  the  exact  value  of 
QH  that  we  scale  by  e. 

In  the  presently  available  ICRF  heating  experiments  the  parameter  (5)  is  much  less 
than  unity  [4]  and  hence  we  shall  scale  this  factor  also  in  terms  of  e;  that  is  /iD  «  £.  Such 
an  ordering,  while  agreeing  with  experiment,  also  has  the  advantage  that  the  exponen- 
tial and  modified  Bessel  functions  that  occur  in  (3a)  to  (3c)  can  be  expanded  as  a  power 
series.  We  also  make  the  simplifying  assumption  that  the  two  ionic  species  are 
approximately  in  temperature  equilibrium  while  the  electrons  are  relatively  colder  than 
the  ions.  Thus  T±H/T±D  %  1  (so  that  /1H  ^  e)  while  Tle/T1H  %  £.  However  the  electrons 
in  spite  of  being  colder  will,  because  of  their  smaller  mass,  be  more  energetic  and  thus 

-1  -t  --> 
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we  shall  assume  that  -^  ^  -^-  ^  &2  while  -^  =  s.  And  finally  in  many  typical  fusion 
c~       c~  c 

experiments  the  ion  plasma  frequency  is  much  larger  than  the  ion  gyrofrequency  so  that 
(l/tOpH)  %  a  Summarising,  we  have 
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We  shall  use  this  ordering  scheme  in  the  derivation  of  our  dispersion  relations. 

4.    The  tensor  elements  and  the  dispersion  relation 

The  expressions  for  the  dielectric  tensor  elements  Xxx,  KX).  and  Ar)T  can  be  derived  using 
the  orderina  scheme  of  (6)  and  the  relevant  exoressions  from  (3a)  to  (3d.  In  fact  thev  are 


lengthy  and  hence  we  shall  not  give  them  here;  however,  their  derivation  is  straight 
forward. 

Substituting  the  tensor  elements  into  the  formula  for  dispersion  relation  (equation 
(21)  of  [3])  and  simplifying  we  arrive  at  our  dispersion  relation  which  we  shall,  for  the 
moment,  write  as 

D(co,/cJ  =  D1(w,/c1;  KE2)  +  D2((o,k^  %s4)  =  0.  (7) 

Relation  (7)  indicates  that  our  dispersion  relation  has  to  be  written  as  a  sum  of  two 
parts  —  the  first  one,  namely  D  l  ,  containing  terms  of  order  e2  and  D2  terms  of  order  e4. 
Obviously  the  two  have  to  be  separately  equated  to  zero  and  solutions  obtained. 
The  dispersion  relation,  to  order  «2,  has  the  form 


« 

where  NcH  =  Ne/NH  etc,  and 


CO, 


1/1  2 

I    H  H.  r  7  ft.  T  \T  \T 

2         p1H      3  3 


/IH      4 
and 

ATT\!       T  1 

with 


P\H 

As  a  check  on  (8)  we  note  that  for  ND  =  0,  it  reduces  to  the  dispersion  relation  in  [3];  the 
small  differences  being  due  to  our  slightly  different  ordering.  Inspecting  (8),  we  find  that 
for  the  first  three  terms  to  be  of  order  e2  we  need  to  set  [6] 


lH 

With  this  the  last  term  and  a  factor  similar  to  (9)  in  8  in  (8)  is  of  the  order  e3  and  hence 
drops  out  of  it. 
The  dispersion  relation,  to  order  «2,  is  thus  given  by 
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•-/TH  =  o  (10) 

I  P.1H         *       ~"  J          2   ' 

where 


i  '     i /y  DH  ~  ->  JYDH         'YeH'vDH 

*1H         J  -5 


1  2 

3~2/VcH  +  3Nl 

Adding  the  higher  order  terms  in  (8)  and  in  6  to  D2(w,/c1:  «r4)  the  dispersion 
relation,  to  order  ;;4,  is  given  by 

a4  r4  +  «3  r3  -f  a2  r2  +  «,  r  +  «0  =  o.  ( 1 1 ) 

The  expressions  for  the  coefficients  in  ( 1 1 )  are  lengthy  and  are  therefore  given  in  an 
appendix,  namely  Appendix  A.  As  a  check  on  these  expressions  we  note  that  for  /VD11  —  0 
they  reduce  to  the  corresponding  expressions  for  a  single  ion  plasma  [5].  Here  too  a  term 


drops  out  of  (1 1)  as  it  is  now  of  order  t:5. 

5.    Discussion 

In  the  derivation  of  our  dispersion  relations  (10)  and  (11),  the  scaling  adopted  was  that 
of /1H  « /1D  «  K  as  the  two  ionic  species  were  assumed  to  be  in  temperature  equilibrium. 
It  would,  however,  be  interesting  to  also  consider  the  case  where  the  two  ionic  species 
have  appreciably  different  temperatures  and  the  consequent  changes  in  our  dispersion 
relation.  Such  a  situation,  though  algebraically  tedious,  can  easily  be  studied  by 
changing  the  ordering  of  /1H  and  /1D.  We  discuss  below  the  cases  where  the  majority 
hydrogen  ions  are  much  colder  than  deuterium  and  vice  versa. 

We  first  consider  Ta  „  ->0  so  that  /1H  «  «2  and  v^H/c2  %  K3.  However,  /1H//?iH  is  finite 
as  it  is  independent  of  temperature.  The  quadratic  equation  (10)  now  has  one  solution 
at  ZH(=o0/QH)=  1-0.  This  can  be  anticipated  physically  as  the  majority  species 
hydrogen  is  now  effectively  cold.  The  other  solution  is  still  dependent  on  the  finite 
temperature  effects  introduced  by  the  second  species  deuterium.  On  the  other  hand  the 
quartic  equation  (11)  reduces  to  a  cubic  equation,  the  characteristics  of  which  now 
depend  on  the  deuterium  density  and  temperature.  A  solution  of  this  dispersion 
relation,  for  the  parameters  given  below,  gives  one  real  root  and  a  pair  of  complex 
conjugate  roots  around  ZH  =  1  -0.  No  major  changes  in  the  form  of  these  equations  were 
introduced  when  TIM->0. 

No  major  changes  in  (10)  and  (11)  were  found  for  the  other  limiting  case  of  T1D->0 


6.     Results 

The  dispersion  relations  have  been  plotted  for  conditions  of  a  typical  mirror  experi- 
ment [7]. 
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Figure  1.  A  plot  of  ZH  versus  k±  rLD  for  T±J  TIH  =  0-023  and  T1D/  T1H  =  0-5  and 
7^JH  =  SOkeV.  The  dotted  lines  are  the  solutions  of  (10)  for  NH  =  2-67*  1013  cm"3 
and  a  deuterium  density  30%  of  that  of  hydrogen.  The  solid  lines  are  for  a  single  ion 
plasma  (NH  =  3-76318*  10l3cm~3  and  ND  =  0-Q).  The  dots  on  the  curves  are  the 
solutions  obtained  numerically. 


The  ions  in  the  experiment  had  a  temperature  T1H  =  50  keV  and  to  conform  to  our 
ordering  scheme  we  assume  a  slightly  tower  electron  temperature  of  T±e=  1-15  keV 
compared  to  the  observed  value  of  9  keV.  Since  the  electron  contributions  to  (10)  and 
(11)  are  only  minimal  this  assumption  does  not  greatly  alter  our  results.  The  other 
relevant  parameter  is  BQ  =  4*  104  G. 

Figure  1  is  a  plot  of  ZH  versus  k±rLD(rLD  is  deuterium  ion  Larmour  radius)  for 


Tie/  TIH  =  °'023  and  TI 


=  °'5-  Smce  'i  ~  £(  =  0-023),  we  are  restricted  to  a  small 


range  of  kLrir>  from  0-205  to  0-225.  Corresponding  to  the  midpoint  of  this  interval, 
/J1H  was  calculated  using  (9)  and  finally  NH  from  the  definition  of  jS1H.  The  dotted  lines 
are  the  solutions  of  (10)  for  JVH  =  2-67*  1013cm~3  and  a  deuterium  density  of  30%  of 
that  of  hydrogen.  We  find  that  there  are  two  modes  on  either  side  of  the  hydrogen  ion 
gyrofrequency  QH  in  agreement  with  our  assumptions.  The  lower  frequency  (LF)  mode 
approaches  QH  while  the  other  high  frequency  (HF)  mode  deviates  away  from  it  for 
increasing  k^rLD.  For  a  comparative  study  we  also  plot  these  modes  for  a  single  ion 
plasma  (NH  =  3-76318  *1013  cm"3  and  ./VD  is  0-0)  and  these  are  depicted  as  the  pair  of 
solid  lines.  We  find  that  the  curves  are  similar;  however  the  point  of  their  closest 
approach  decreases  with  increasing  deuterium  density. 
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Extensive  numerical  calculations  were  also  performed  to  confirm  the  above  results. 
The  routines  for  the  calculation  of  the  modified  Bessel  functions  in  the  tensor  elements 
(3a)  to  (3c)  were  obtained  from  the  well  known  text  numerical  recipes  [8].  The  returned 
values  of  e~  /J7n(/x )  were  then  rechecked  for  their  accuracy  against  their  values  available 
in  literature  [9].  The  dispersion  relations  were  then  set  up  using  (3a)  to  (3c)  with  the 
relativistic  terms  equal  to  zero  ((10)  is  a  non-relativistic  dispersion  relation)  and 
solved  iteratively  using  the  standard  root  solver  ZANLYT.  The  dots  on  the  curves 
in  figure  1  are  the  solutions  obtained  numerically.  We  find  the  agreement  to  be 
good  especially  for  the  LF  mode  where  they  are  almost  identical  with  the  analytic 
solutions.  Very  slight  variations  occur  for  the  HF  mode.  However,  on  the  whole  the 
numerical  solutions  confirm  the  soundness  of  our  ordering  scheme  and  the  correctness 
of  relation  (10). 

Figure 2  is  a  plot  of  our  quartic  relation  (11)  for  NH  =  2-9563 *1013 cm"3  (with 
a  deuterium  density  20%  of  this  value  and  depicted  by  solid  lines)  and  2-67  *  101 3  cm  ~ 3 
(deuterium  density  is  equal  to  30%  of  this  value;  depicted  by  dotted  lines),  the 
temperature  TXH  being  =  50keV  and  T±r)/T±H  =  0-5.  As  can  be  seen,  the  plasma  can 
now  support  four  modes  on  either  side  of  QH.  However  there  is  only  a  small  region  of 
wavelength  where  there  are  four  distinct  modes.  The  LF  and  HF  modes  then  coalesce 
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Figure  2.    A     plot 


of    quartic    equation    (11).    The    solid    lines    are    for 
3  and  a  deuterium  density  20%  of  that  of  hydrogen.  The 
dotted  lines  are  for  NH  =  2-6700*  1013  cm~3  and  a  deuterium  density  30%  of  that  of 
hydrogen.  The  other  parameters  are  the  same  as  in  figure  1. 
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mstaointy  lor  tne  nr  moae  ror  tne  latter  case,  untortunately  we  are  not  able  to 
demonstrate  any  numerical  confirmation  of  figure  2,  due  to  the  reasons  mentioned  in 
§  1.  The  non-relativistic  terms  "interfere"  with  and  even  dominate  the  relativistic  terms 
in  a  fully  numeric  calculation;  whereas  analytically  we  have  been  successful  in  isolating 
the  relativistic  from  the  non-relativistic  dispersion  relation. 

We  now  briefly  compare  these  results  with  that  obtained  for  a  single  ion  plasma  [5]. 
A  weakly  relativistic  single  ion  plasma  can  support  three  modes — a  pair  of  LF  modes 
which  coalesce  to  produce  an  instability  and  a  HF  mode.  However,  for  a  two  ion 
plasma  we  find  that  in  addition  to  the  three  modes,  the  second  ion  component 
introduces  another  HF  mode. 

7.    Conclusions 

We  have  in  this  paper  studied  the  propagation  and  stability  of  1C  modes  in  weakly 
relativistic  two-ion  component  plasma  with  hydrogen  as  the  majority  species  and 
deuterium  as  the  minority  species.  We  have,  by  means  of  an  ordering  parameter,  been  able 
to  separate  out  two  dispersion  relations  -  one  of  which  is  independent  of  the  relativistic 
factors  (and  having  stable  roots)  and  the  other  which  depends  on  the  relativistic  terms.  The 
non-relativistic  relation  has  two  solutions — a*HF  mode  and  a  LF  mode,  with  frequencies 
respectively  higher  and  lower  than  the  ion  gyrofrequency.  The  latter  dispersion  relation 
supports  four  modes — a  pair  of  HF  and  LF  modes  which  can  coalesce  to  make  the  plasma 
unstable.  The  relativistic  terms  are  responsible  for  the  instability.  The  results  for  the 
non-relativistic  case  have  also  been  verified  numerically  using  a  standard  root  solver. 
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Abstract.  In  this  paper  we  deal  with  the  measurement  of  parameters  of  the  gravitational  wave 
signal  emitted  by  a  coalescing  binary  system  of  compact  stars.  We  present  the  results  of  Monte 
Carlo  simulations  carried  out  for  initial  LIGO,  incorporating  the  first  post-Newtonian  correc- 
tions to  the  waveform.  Using  the  parameters  so  determined,  we  estimate  the  direction  to  the 
source.  We  stress  the  use  of  the  timc-of-coalescence  rather  than  the  time-of-arrival  of  the  signal  to 
determine  the  direction  of  the  source.  We  show  that  this  can  considerably  reduce  the  errors  in  the 
determination  of  the  direction  of  the  source. 

Keywords.    Gravitational  waves;  coalescing  binaries;  data  analysis. 
PACS  Nos    04-30;  04-80 


1.  Introduction  and  summary 

A  major  source  for  the  planned  interferometric  gravitational  wave  detectors  like  LIGO 
[1]  and  VIRGO  [2]  is  the  radiation  emitted  by  a  binary  system  of  compact  stars,  e.g. 
neutron  stars  or  black  holes,  during  the  last  few  minutes  of  its  evolution  just  before  the 
two  stars  coalesce.  The  interferometers  are  expected  to  have  high  enough  sensitivity  to 
detect  sources  at  cosmological  distances.  Since  the  waveform  emitted  during  a  binary 
coalescence,  often  called  the  chirp,  can  be  modeled  fairly  accurately  it  is  possible  to 
enhance  the  visibility  or  the  'signal-to-noise'  ratio  henceforth  denoted  as  SNR,  (defined 
later  in  the  text  in  (eq.  9))  of  the  signal  in  noisy  data  by  employing  powerful  data 
analysis  techniques  such  as  Weiner  or  matched  filtering  [3,4].  A  filter,  in  the  Fourier 
domain,  is  a  copy  of  the  expected  signal  weighted  by  the  noise  power  spectral  density. 
The  process  of  filtering  involves  obtaining  correlations  of  the  detector  output  with  a  set 
of  templates  which  together  span  the  relevant  range  of  the  parameters  of  the  waveform. 
On  filtering  the  detector  output  for  signal  of  known  shape  one  enhances  the  raw  SNR 
by  comparing  the  signal  energy  with  that  of  the  noise  as  opposed  to  the  case  of  a  burst 
signal  where  one  has  to  contend  with  the  signal  power  vis-a-vis  noise  power  [4].  The 
fact  that  the  signal  waveform  can  be  predicted  very  well  also  implies  that  the 
parameters  of  the  waveforrn  can  be  estimated  to  a  high  accuracy.  For  instance,  by 
tracking  the  radiation  emitted  by  a  neutron  star-neutron  star  (In  this  paper  a  neutron 
star  is  considered  to  have  a  mass  of  1-4  Mo,  and  a  black  hole  will  be  considered  to  have 
a  mass  of  10  M o.)  (NS-NS)  binary  starting  from  10  Hz  to  say  1  kHz,  the  masses  can  be 
determined  to  an  accuracy  better  than  a  few  per  cent  [5-9].  Observation  of  several 
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coalescence  events  could  be  potentially  used  to  determine  the  Hubble  parameter  to  an 
accuracy  of  -~  10%  [10].  Once  in  a  while,  when  an  event  produces  a  very  high  SNR 
( ^  40)  it  would  be  possible  to  infer  the  presence  of  such  non-linear  effects  as  tails  of 
gravitational  waves  in  the  signal  waveform  and  test  general  relativity  in  the  strongly 
non-linear  regime  [11,12]. 

When  a  signal  is  present  in  the  detector  output,  amongst  all  filters  the  filter  matched 
to  that  particular  signal  will,  on  the  average,  obtain  the  largest  possible  correlation  with 
the  detector  output,  thus  enabling  the  detection  as  well  as  the  estimation  of  parameters. 
The  parameters  of  the  template  that  obtains  the  maximum  correlation  are  unbiased 
estimates  of  the  actual  signal  parameters.  Of  course,  the  detector  output  also  contains 
noise  which  can  affect  the  correlation  thereby  giving  rise  to  spurious  maxima  even 
when  the  parameters  of  the  template  and  those  of  the  signal  are  mismatched.  Thus,  the 
measured  parameters  are  in  error.  These  errors  reduce  with  increasing  SNR.  In  the 
limit  of  high  SNR  the  covariance  matrix  gives  errors  in  the  estimation  of  parameters 
and  the  covariances  among  them.  Based  on  the  computation  of  the  covariance  matrix 
(for  the  advanced  LIGO),  it  is  now  generally  agreed  that  the  chirp  mass,  which  is 
a  combination  of  the  masses  of  the  binary  stars  (see  below  for  its  definition),  can  be 
determined  at  a  relative  accuracy  —  0.1%  —  1%  at  a  SNR  of  10.  The  reduced  mass,  as  it 
turns  out,  will  have  a  much  larger  error  (few  —  50%)  especially  if  the  spins  of  the  bodies 
are  large  [6].  The  Cramer-Rao  theorem  asserts  that  the  covariance  matrix  is  only 
a  lower  bound  on  the  errors  in  the  estimation  of  parameters  [13].  Consequently,  a  more 
rigorous  or  a  different  method  of  computing  the  variances  and  covariances  is  in  order. 
Alternatively  one  can  determine  the  errors  through  the  Monte  Carlo  simulation 
method.  The  basic  idea  here  is  to  mimic  the  actual  detection  problem  on  a  computer  by 
adding  numerous  realizations  of  the  noise  to  the  signal  and  then  passing  the  resultant 
time  series  through  a  bank  of  filters  each  matched  to  a  gravitational  wave  signal  from 
a  coalescing  binary  system  with  a  particular  set  of  parameters.  The  errors  involved  in 
the  measurement  can  be  computed  by  using  the  ensemble  of  measured  values  of  the 
signal  parameters.  Such  a  method  is  robust  in  the  sense  that  it  does  not  make  any 
assumptions  about  the  SNR  and  is  therefore  much  closer  to  the  actual  detection 
problem.  It  also  squarely  addresses  the  difficulties  faced  in  a  realistic  data  analysis 
exercise  such  as  the  finite  sampling  rate,  the  discreteness  of  the  set  of  filters,  etc.  For 
details  on  the  Monte  Carlo  method  see  Balasubramanian  et  al  [14]. 

In  this  communication  we  summarize  the  outcome  of  the  first  in  a  series  of  Monte  Carlo 
simulations  which  suggests  that  the  covariance  matrix  grossly  underestimates  the  errors  in 
the  estimation  of  the  parameters  by  over  a  factor  of  3  at  an  SNR  of  10.  The  standard 
method  to  deduce  the  direction  to  the  source  of  the  gravitational  radiation  has  been  to  use 
the  differences  in  the  times-of-arrival  in  a  network  of  three  or  more  detectors.  We  point  out 
that  since  the  instant  of  coalescence  of  a  binary  system  can  be  measured  to  a  much  greater 
accuracy  than  the  time-of-arrival,  the  former  can  be  employed  to  infer  the  direction  of  the 
source  at  a  much  lower  uncertainty  than  has  been  so  far  thought. 

2.  Basic  equations  and  formalism 

In  the  point  mass  approximation  the  restricted  first-order  post-Newtonian  gravi- 
tational radiation  emitted  by  a  binary  system  of  stars  induces  a  strain  in  the  detector 
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which  can  be  written  as 

h(t)  =  A[7tf(t)~]2l3cos(<p(t)'],  (1) 

where  f(t)  is  the  instantaneous  gravitational  wave  frequency,  which  at  this  level  of 
approximation  is  equal  to  twice  the  orbital  frequency  and  is  implicitly  given  by 

/)"] 

.J  a/         J 


i        r    /7vii 

+  47t/aT1  i  -  -7      +$ 


and  <p(t)  is  the  phase  of  the  signal  given  by 

y\-5/3 

7, 

where  fa,  the  time-of-arrival,  is  the  time  at  which  the  signal  reaches  a  frequency  /a;  and 
$  is  the  phase  of  the  signal  at  ra  a  constant  depending  on  the  relative  orientations  of  the 
binary  orbit  and  the  arms  of  the  interferometer;  TO  and  T:  are  constants,  having 
dimensions  of  time,  depending  on  the  masses  m^  and  m2  of  the  binary  system.  They  are 
referred  to  as  Newtonian  and  post-Newtonian  chirp  time,  respectively,  and  are  given 

by, 

«;  «;         fiA"\      11  ,,  \ 

(4) 


Here  /*  is  the  reduced  mass,  M  is  the  total  mass,  and  Jf  =  /i3/5  M2/5  is  the  chirp  mass.  In 
this  level  of  approximation  the  post-Newtonian  waveform  is  characterized  by  a  set  of 
five  parameters:  The  amplitude  parameter  A,  the  time-of-arrival  ta,  the  phase  at  the 
time  of  arrival  O,  and  the  two  masses  m  l  and  m2  .  We  shall  find  below  a  more  convenient 
parametrization  of  the  waveform. 

The  Fourier  transform  of  the  waveform  in  the  stationary  phase  approximation  for 
positive  frequencies  is  given  by 


[4 
'  I 
H=l 


where  ,tf  is  a  normalization  constant  and  is  fixed  by  specifying  the  SNR,  and  i//M(/)  are 
functions  only  of  frequency  given  by 

~5/3  //"X"1 

,    ^4  =  2;u/a    f       .  (6) 

a/  Va/ 

Here  A"  are  related  to  the  set  of  parameters  introduced  earlier:  ^  =  {tc^c^o^\}-: 
where 

.  (7) 


For  /<0,  the  Fourier  transform  can  be  obtained  by  the  relation  h(—f)  =  h*(f\ 
obeyed  by  real  functions  h(t). 

Central  to  the  computation  of  the  covariance  matrix  of  the  parameters  is  the  scalar 
product  defined  over  the  space  of  the  signal  waveforms.  Given  waveforms  /i(f;A)  and 
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h(t;  A)  their  scalar  product  is  defined  by, 

Ti  f.  ;jM/T*/  f-  l't'\ 

"(/'/'"/-A   V+c.0.  (8) 


where  S,,(/)  is  the  two-sided  detector  noise  power  spectral  density.  The  scalar  product 
defines  a  norm  on  the  vector  space.  The  SNR  obtained  for  a  signal  fc(t;A)  when  it  is 
passed  through  the  matched  filter  is  the  norm  of  the  signal  [4] 

p  =  </i,/i>1/2.  (9) 

The  covariance  matrix  C    is  the  inverse  of  the  Fisher  information  matrix  F  v  [  1  3]  given 

by, 


The  diagonal  elements  of  the  covariance  matrix  provide  us  an  estimate  of  the  variances 
to  be  expected  in  the  measured  values  of  the  parameters  in  a  given  experiment.  In 
figure  1  we  have  shown  as  a  solid  line  the  behaviour  of  the  computed  la  uncertainties 
(square  root  of  the  variances)  in  the  estimation  of  parameters  ta,T0,Tj  and  TC,  as 
a  function  of  the  SNR.  As  is  well-known  the  covariance  matrix  predicts  that  the  errors 
fall  off  in  inverse  proportion  to  the  SNR. 

3.  Simulations 

In  the  actual  detection  problem  the  detector  output  x(t)  is  filtered  through  a  host  of 
search  templates  corresponding  to  test  parameters,  ,A  and  the  template  that  obtains  the 
maximum  correlation  with  the  output  gives  us  the  measured  values  mA  of  the  signal. 
These  measured  values  will  in  general  differ  from  the  actual  signal  parameters.  With  the 
aid  of  a  large  number  of  detectors  one  obtains  an  ensemble  of  measured  values  WA.  Such 
an  ensemble  provides  us  with  an  estimate  CA,  the  errors  a}  and  correlation  coefficients 


where  the  overbar  denotes  the  average  over  an  ensemble.  In  reality  we  will  have  only 
a  few  detectors  and  hence  a  numerical  simulation  needs  to  be  carried  out  to.  deduce  the 
errors  in  the  estimation  of  parameters.  We  have  carried  out  such  a  numerical 
simulation  by  using  in  excess  of  5000  realizations  of  detector  noise.  Thus,  the  results  of 
our  simulations  are  statistically  significant  and  the  errors  in  the  estimation  of  various 
statistical  quantities,  such  as  the  mean  and  the  variance  (cf.  figure  1),  are  negligible.  In 
figure  1  the  dotted  line  corresponds  to  the  behaviour  of  the  errors  in  the  estimation  of 
various  parameters  computed  at  several  values  of  the  SNR.  At  low  values  of  the  SNR 
(p  ~~  10)  there  is  a  significant  departure  of  the  observed  errors  from  those  predicted  by 
the  covariance  matrix.  However  at  an  SNR  of  ~  30  the  two  curves  merge  together 
indicating  the  validity  of  the  covariance  matrix  at  high  enough  SNRs.  Since  most  of  the 
events  which  the  interferometric  detectors  will  observe  are  expected  to  have  an  SNR 
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Figure  1.  Dependence  of  the  errors  in  the  estimation  of  parameters  of  the 
post-Newtonian  waveform  i.e.  {"'t(1>^TI,o'(a»ffrc}  as  a  function  of  SNR.  The  solid  line 
represents  the  theoretically  computed  errors  whereas  the  dotted  line  represents  the 
errors  obtained  through  Monte  Carlo  simulations.  The  simulations  have  been 
carried  out  for  10M0  —  1-4M0  binary  system.  The  errors  in  the  parameters  are 
expressed  in  ms. 


less  than  1 0,  we  conclude  that  the  accuracy  in  the  determination  of  the  parameters  is  not 
as  high  as  was  thought  to  be.  Detailed  analysis  suggests  that  this  discrepancy  is  larger 
when  higher  post-Newtonian  corrections  are  taken  into  account.  Consequently,  a  more 
exhaustive  analysis  has  been  reported  here  or  elsewhere  [14]  which  is  in  order.  At  SNRs 
>  30  numerically  estimated  errors  are  somewhat  lower  than  the  covariance  matrix, 
especially  in  the  case  of  fc,  as  opposed  to  what  is  expected  from  the  Cramar-Rao  bound. 
The  reason  for  this  discrepancy  is  that  we  do  not  have  a  high  enough  resolution  (in 
particular,  in  the  case  of  tc)  to  determine  the  errors  when  the  SNR  is  high.  This  is 
indicated  by  the  increase  in  the  error  bars  after  the  two  curves  cross  over  in  figure  1. 
Moreover,  the  covariance  matrix  is  computed  using  the  stationary  phase  approxi- 
mation to  the  Fourier  transform  of  the  signal  while  numerical  estimation  uses  Fast 
Fourier  Transforms. 

With  reference  to  figure  1  we  point  out  that  the  instant  of  coalescence  tc  can  be 
determined  to  an  accuracy  much  better  than  the  time-of-arrival  ta.  Typically  ah.  is 
a  factor  of  50  less  than  cu.  Consequently,  with  the  aid  of  tc  we  can  fix  the  direction  to  the 
source  at  a  much  higher  accuracy  than  with  fa.  Since  the  parameters  TO,  rt  and  ta  have 
negative  covariances  their  sum,  rc,  is  more  accurately  determined.  A  possible  physical 
interpretation  of  this  result  is  that  at  the  time~of-arrival  the  frequency  is  lower  and 
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Figure  2.  This  figure  illustrates  the  network  of  three  identical  detectors  and  the 
coordinate  system  employed.  The  detectors  lie  on  the  X-Y  plane  and  0  and 
$  represent  the  angles  determining  the  direction  of  the  source  (S). 


hence  allows  for  a  larger  variation  in  ra  without  incurring  a  substantial  error  in  the 
phase  of  the  template  relative  to  that  of  the  signal.  While  close  to  coalescence  the  signal 
has  a  greater  frequency  and  hence  even  a  slight  mismatch  in  fc's  of  the  signal  and  the 
template  leads  to  a  greater  error  in  the  phase.  It  is  to  be  noted,  however,  that  as  of  now 
we  do  not  know  the  orbit  of  the  binary  accurately  enough,  to  predict  the  exact  instant  of 
the  coalescence.  In  fact  the  frequency  cut  off  imposed  by  the  onset  of  the  plunge  orbit 
will  make  it  difficult  to  calculate  tc.  In  addition,  for  a  realistic  detector  the  noise 
increases  with  the  frequency  beyond  about  1 50  Hz  and  the  frequency  which  will  be  of 
more  interest  to  us  is  the  one  where  the  power  spectrum  of  the  signal  divided  by  that  of 
the  noise  reaches  a  maximum.  This,  of  course,  assumes  that  the  detectors,  used  in  the 
measurement  of  the  direction  of  the  source,  are  all  identical. 

4.  Determining  source  direction 

Let  t  denote  a  convenient  time  parameter  which,  if  measured  at  three  detectors,  gives 
the  direction  to  the  source.  For  our  purpose  we  take  this  parameter  to  be  either  tc  or  ra. 
We  assume  a  simple  configuration  of  three  identical  detectors  placed  at  the  vertices  of 
an  equilateral  triangle  on  the  equatorial  plane  of  the  earth.  We  take  the  separation 
between  any  two  detectors  as  L.  Thus,  the  maximum  delay  in  arrival  times  of  the  signal 
at  the  detectors  is  A  =  L/C,  where  c  is  the  speed  of  light.  Figure  2  illustrates  such 
a  detector  network  and  the  coordinate  system,  used.  Given  t1 ,  t2  and  t3  as  the  measured 
values  of  nn  the  three  detectors,  respectively,  we  can  deduce  the  direction  to  the  source 
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L      3)'  J 


(12) 


assumption  of  identical  detectors  means  that  the  errors  in  t  in  different  detectors 
11  the  same.  As  the  noises  in  different  detectors  are  uncorrelated,  the  fluctuations  in 
leasurements  of  tl  ,  t2  and  r3  are  uncorrelated;  however,  y  and  6  have  non-zero 
fiances.  Thus, 


errors  in  the  measurement  of  time  parameter  will  induce  errors,  ff(jl  and  cr0,  in  the 
urement  of  angles.  Assuming  the  errors  in  time  parameter  to  be  small  we  can  write, 

°<,  =  -£94,(0,<l>)    and    c0  =  -g0(0,(j)),  (15) 

e, 

ss      and 


'actors  g^O,  0)  and  g0(0,  $)  are  typically  of  order  unity  for  most  directions.  For 
based  detectors  the  value  of  A  is  «  15  ms.  This  certainly  rules  out  the  use  of  ta  to 
mine  the  direction  as  the  error  in  this  parameter,  even  for  high  SNRs  ~  1  5,  is  much 
than  A.  Though  for  the  initial  LIGO  we  can  use  tc,  the  errors  at  an  SNR  of  10  will 
ound  two  degrees  which  is  too  large  to  make  an  optical  identification  of  the  source. 
der  to  determine  the  direction  to  arcsecond  resolution  one  needs  the  sensitivity  of 
dvanced  LIGO  or  the  VIRGO. 

lowledgements 

authors  would  like  to  thank  Biplab  Bhawal  and  S  D  Mohanty  for  useful 
ssions.  One  of  the  authors  (RB)  would  like  to  acknowledge  CSIR,  India!  for 
cial  support  through  the  senior  research  fellowship. 

•ences 

A  Abramovici  et  al,  Science  256,  325  (1992) 

C  Bradaschia  et  al,  Nucl.  lustrum.  Methods,  A289,  518  (1990) 

K  S  Thome,  in  300  Years  of  gravitation,  edited  by  S  W  Hawking  and  W  Israel  (Cambridge 

Univ.  Press,  1987) 

B  F  Schutz,  in  The  detection  of  gravitational  radiation,  edited  by  D  Blair  (Cambridge  Univ. 

Press,  Cambridge,  1989)  pp  406-427 

Pramana  -  J.  Phys.,  Vol.  45,  No.  5,  November  1995  L469 


[5]  L  S  Finn  and  D  F  Chernoff,  Phvs.  Rev,  D47,  2198  (1993) 
[6]  C  Cutler  and  E  Flanagan,  Phvs.  Rev.  D49,  2658  (1994) 
[7]  E  Poisson  and  C  M  Will,  Phvs.  Rev.  D52,  848  (1995) 
[8]  A  Krolak,  K  D  Kokkotas  and  G  Schafer,  Phys.  Rev.  D52,  2089  (1995) 
[9]  P  Jaranowski,  K  D  Kokkotas,  A  Krolak  and  G  Tsegas,  report  (unpublished) 
[10]  D  Markovic,  Phys.  Rev.  D48,  4738  (1993)  and  references  therein 
[11]  L  Blanche!  and  B  S  Sathyaprakash,  Class.  Quantum  Gravil.,  11,  2807  (1994) 
[12]  L  Blanchet  and  B  S  Sathyaprakash,  Phys.  Rev.  Lett.  74,  1067  (1995) 
[13]  C  W  Helstrom,  Statistical  theory  of  signal  detection,  (Pergamon,  Cambridge,  1 96( 

edition 

[14]  R  Balasubramanian,  B  S  Sathyaprakash  and  S  V  Dhurandhar,  Gravitational  wa 
coalescing  binaries:  detection  strategies  and  Monte  Carlo  estimation  of  parameters 
ted  for  publication  (1995) 
[15]  P  Jaranowski  and  A  Krolak,  Phys.  Rev.  D49  1723  (1994) 


L470  Pramana  _ 


Regd.  No.  KRN-A  64 

Pramana  journal  of  physics  Vol.  45  (No.  5),  November  1995 

Contents 


Nonlinear  dynamics  of  a  two-dimensional  lattice 

Nabonita  D  Chowdhury  and  Sankar  P  Sanyal         377 

Microscopic  theory  of  single  particle  and  pair  condensation  of  an  attracting 

Bose-gas  as  the  basis  for  superfluidity  and  superconductivity 

S  Dzhumanov  and  P  K  Khabibullaev        385 

Temperature  dependence  electron  spin  resonance  spectra  of  a  magnetic  fluid 
.R  VUpadhyay.D  Srinivas,  R  V  Mehtaand  P  M  Trivedi         419 

Dielectric  properties  of  nano-particles  of  zinc  sulphide 

Binny  Thomas  and  M  Abdulkhadar         431 

Generation  of  picosecond  optical  pulses  from  single  heterostructure  GaAs 

diode  laser  and  study  of  their  emission  characteristics 

VN  Rai,  M  Shukla  and  H  C  Pant         439 

Ion  cyclotron  instabilities  in  a  mildly  relativistic  hydrogen-deuterium  fusion 
plasma Chandu  Venugopal,  P  J  Kurian  and  G  Renuka        453 

Rapid  Communication 

Estimation  of  parameters  of  gravitational  waves  from  coalescing  binaries 
R  Balasubramanian,  B  S  Sathyaprakash  and  S  V  Dhurandhar      L463 


Indexed  in  CURRENT  CONTENTS  ISSN  0304-4289 

Edited  and  published  by  V  K  Gaur  for  the  Indian  Academy  of  Sciences,  Bangalore  560080.  Typeset  and 
printed  at  Thomson  Press  (I)  Ltd.,  Faridabad  121 007. 


PRAMANA 

journal  of 

physics 


Vol.  45  No.  6 
December  1995 


NDIAN  ACADEMY  OF  SCIENCES 

ndian  National  Science  Academy 
ndign  Physics  Association 


Editor 
R  Nityananda 

Riimcin  Research  Institute, 


Associate  Editor 
H  R  Krishnamurthy 

Indian  Institute  of  Science. 

Editorial  Board 

G  S  Agarwal,  Physical  Research  Laboratory.  Ahmedahad 

V  Balakrishnan,  Indian  institute  of  Technology,  Madras 

M  Burma,  Tula  Institute  of  Fundamental  Research,  Bombay 

P  Chaddall,  Centre  for  Advanced  Technoloay,  Indore 

B  M  Deb,  Panjuh  University,  Chandii/arh 

P  K  Kaw,  Institute  for  Plasma  Research,  dandlnmiiiur 

H  S  Mani,  Indian  Institute  of  Technology.  Kanpur 

N  Mukunda,  Indian  Institute  of  Science.  Baiujalore 

T  Padmanabhan,  Inter-University  Centre  for  Astronomy  and 

Astrophysics.  Pune 

J  C  Parikh,  Physical  Research  Laboratory,  Ahmedahad 
G  Rajasekaran,  Institute  of  Mathematical  Sciences,  Madras 
V  S  Ramamurlhy,  Department  of  Science  and  Technology.  New  Delhi 
R  Ramaswamy,  Jawaharlat  Nehru  University,  New  Delhi 
K  R  Rao,  Bhahha  Alomic  Research  Centre,  Bombay 
Probir  Roy,  Tata  Institute  of  Fundamental  Research.  Bombay 
A  K  §ood,  Indian  Institute  of  Science,  Bangalore 


Editor  of  Publications  of  the  Academy 

V  K  Gaur 

C-MMACS.  NAL.  Banaalore 


Subscription  Rates 

(1996) 

All  countries  except  India  1  year 

(Price  includes  AIR  MAIL  charges)  US$  200 

India  1  year 

Rs.  200 

Annual  subscriptions  for  individuals  for  India  and  abroad  are  Rs.  75/-  and  $50  respectively. 

All  correspondence  regarding  subscription  should  be  addressed  to  the  Circulation  Department 
of  the  Academy 

Editorial  Office 

Indian  Academy  of  Sciences,  C  V  Raman  Avenue,  Telephone:  334  2546 

P.B.  No.  8005,  Bangalore  560080,  India  Telex:  845-2178  ACAD  IN 

Telefax:  9 1-80-334  6094 

©  1995  by  the  Indian  Academy  of  Sciences.  All  rights  reserved. 

"Information  fnr  rnntrihiitnre"  ic  nrinfprl  in  the*  l-ict  iccno  nf  /..,/^r-i,  ,/,^1,.,,,,, 


PRAMANA  (©  Printed  in  India  Vol.  45,  No.  6, 

— journal  of  December  1 995 

physics  pp.  471-497 


The  real  symplectic  groups  in  quantum  mechanics  and  optics 

ARVIND,  B  DUTTA1,  N  MUKUNDA1-2  and  R  SIMON3 

Department  of  Physics,  Indian  Institute  of  Science,  Bangalore  560  01 2,  India 

1  Jawaharlal  Nehru  Centre  for  Advanced  Scientific  Research,  Jakkur,  Bangalore  560064,  India 

2Centre  for  Theoretical  Studies  and  Department  of  Physics,  Indian  Institute  of  Science, 

Bangalore  560  01 2,  India 

Institute  of  Mathematical  Sciences,  C.  I.  T.  Campus,  Madras  600  1 13,  India 

MS  received  13  June  1995 

Abstract.  We  present  a  utilitarian  review  of  the  family  of  matrix  groups  Sp(2n,;^),  in  a  form 
suited  to  various  applications  both  in  optics  and  quantum  mechanics.  We  contrast  these  groups 
and  their  geometry  with  the  much  more  familiar  Euclidean  and  unitary  geometries.  Both  the 
properties  of  finite  group  elements  and  of  the  Lie  algebra  are  studied,  and  special  attention  is  paid 
to  the  so-called  unitary  rnetaplectic  representation  of  Sp(2«,  M).  Global  decomposition  theorems, 
interesting  subgroups  and  their  generators  are  described.  Turning  to  n-mode  quantum  systems, 
we  define  and  study  their  variance  matrices  in  general  states,  the  implications  of  the  Heisenberg 
uncertainty  principles,  and  develop  a  U(n)-invariant  squeezing  criterion.  The  particular  proper- 
ties of  Wigner  distributions  and  Gaussian  pure  state  wavefunctions  under  Sp(2/v#)  action  are 
delineated. 

Keywords.  Symplectic  groups;  symplectic  geometry;  Huyghens  kernel;  uncertainty  principle; 
multimode  squeezing;  Gaussian  states. 

PACSNos    3-65;  42-50 


1.  Introduction 

The  symplectic  groups  form  one  of  the  three  major  families  of  classical  semisimple  Lie 
groups,  the  other  two  being  the  real  orthogonal  family  and  the  complex  unitary  family 
[1].  Apart  from  the  groups  describing  nonrelativistic  and  relativistic  space-time 
geometries,  namely  the  Galilei,  Lorentz  and  Poincare  groups,  most  of  the  Lie  groups 
encountered  in  physical  problems,  for  example  as  symmetry  groups,  belong  to  either  the 
real  orthogonal  or  the  unitary  families  [2].  These  are  multidimensional  as  well  as 
complex  generalisations  of  the  rotation  group  of  Euclidean  geometry  characterising 
physical  three-dimensional  space.  As  a  result,  the  intuitive  geometrical  ideas  that  go  with 
real  orthogonal  or  complex  unitary  geometries  are  quite  familiar  to  most  physicists. 

It  has  been  realised  recently,  however,  that  in  several  problems  both  in  quantum 
mechanics  and  in  optics,  the  real  symplectic  groups  play  an  important  role  [3].  In  the 
latter  context,  this  is  so  in  both  classical  and  quantum  theories.  More  generally,  these 
groups  come  in  very  naturally  through  the  canonical  formalism  of  classical  dynamics, 
and  its  counterpart  in  quantum  mechanics. 

Symplectic  geometry,  on  the  other  hand,  differs  profoundly  from  the  real  or 
complex  Euclidean  variety  [4].  Here  the  intuitively  familiar  concepts  of  length,  angle, 


perpendicularity  and  the  Pythagoras  Theorem  are  all  absent.  In  their  place  we  have 
typically  new  concepts  characteristic  of  canonical  mechanics. 

The  purpose  of  this  informal  and  utilitarian  survey  is  to  introduce  methods  based  on 
the  real  symplectic  groups  to  those  who  are  otherwise  familiar  with  the  structures  of 
quantum  mechanics  and/or  the  theory  of  partial  coherence  in  optics.  The  intention  is  to 
"inform  rather  than  astonish",  and  to  describe  the  main  features  of  the  symplectic  point 
of  view,  Our  account  will  not  contain  complete  proofs  of  all  statements  presented,  but 
a  motivated  reader  should  easily  be  able  to  supply  additional  detail  and  proceed  to 
make  practical  use  of  these  methods. 

The  material  of  this  article  is  organised  as  follows.  Section  2  introduces  the  group 
Sp(2n,i$)  as  the  group  of  linear  transformations  preserving  the  classical  Poisson  Brackets 
as  well  as  the  quantum  commutation  relations  among  n  pairs  of  canonical  variables.  In  the 
quantum  case,  the  Stone-von  Neumann  theorem  allows  us  to  infer  that  these  transform- 
ations are  unitarily  implementable.  Section  3  develops  some  ideas  related  to  real  symplectic 
linear  vector  spaces,  specially  the  concepts  of  symplectic  complement  and  symplectic  rank 
of  a  subspace,  in  order  to  contrast  symplectic  geometry  with  real  Euclidean  and  complex 
unitary  geometries.  Some  useful  properties  of  the  matrices  occurring  in  the  defining 
representation  of  Sp(2n,  ^?),  and  their  complex  form,  are  then  explained  in  §  4.  Here  we  also 
list  several  useful  subgroups  of  Sp(2rc,  @l\  and  describe  four  global  decomposition  the- 
orems -  the  polar,  Euler,  pre-Iwasawa  and  Iwasawa  decompositions.  Section  5  studies  the 
Lie  algebra  Sp(2n,^g),  first  in  the  defining  representation  and  then  in  a  general,  possibly 
unitary,  representation.  Convenient  ways  of  breaking  up  the  generators  into  subsets,  and 
generators  of  various  subgroups,  are  described.  In  §  6  we  set  up  and  study  the  special 
unitary  metaplectic  representation  of  Sp(2n,£?)  and  relate  it  to  the  generalised  Huyghens 
kernel  in  any  number  of  dimensions.  The  characteristic  differences  between  the  compact 
and  the  noncompact  generators  of  Sp(2n,^)  are  seen  in  their  dependences  on  mode 
annihilation  and  creation  operators.  The  former  (latter)  conserve  (do  not  conserve)  the  total 
number  operator.  Section  7  studies  the  relationship  between  the  metaplectic  unitary 
representation  of  Sp(2rc,^?),  and  two  often  used  descriptions  of  quantum  mechanical 
operators,  namely  the  Wigner  function  representation  and  the  diagonal  coherent  state 
representation.  While  the  former  is  covariant,  i.e.,  transforms  simply,  under  the  full  group 
Sp(2/7,  t%),  the  latter  is  covariant  only  under  the  maximal  compact  subgroup  K(n)  =  U(n)  of 
Sp(2«,  0t\  Section  8  takes  up  the  questions  of  defining  the  noise  or  variance  matrix  for  any 
state  of  an  «-mode  quantum  system,  both  in  real  and  complex  forms,  and  their  behaviours 
under  Sp(2n,$?).  In  §9  we  carry  this  analysis  further  to  show  that  the  Heisenberg 
uncertainty  principles  for  any  number  of  modes  can  be  given  in  explicitly  Sp(2o,^?) 
covariant  ibrms;  a  key  role  here  is  played  by  Williamson's  Theorem  relating  to  normal 
forms  of  quadratic  Hamiltonians.  This  study  leads  in  §10  to  the  setting  up  of  an 
U(n)-invariant  squeezing  criterion  for  n-mode  systems.  This  is  the  maximal  physically 
reasonable  invariance  one  could  ask  for  in  these  systems,  and  it  can  be  stated  very  elegantly 
in  terms  of  the  general  variance  matrix  set  up  in  §8.  Section  11  describes  and  motivates 
some  interesting  classes  of  variance  matrices  with  distinctive  group  theoretic  properties, 
and  §12  is  devoted  to  a  study  of  general  centred  and  normalized  pure  Gaussian 
wavefunctions  for  H-mode  systems.  The  transitive  action  of  Sp(2«,^?)  on  these  wave- 
functions,  via  the  metaplectic  representation,  and  the  emergence  of  a  matrix  form  of  the 
Mobius  transformation,  are  described.  Section  13  contains  some  concluding  remarks. 
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Symplectic  groups  in  quantum  mechanics  and  optics 

2.  The  real  symplectic  groups  Sp(2«,  ^) 

We  consider  a  classical  or  quantum  canonical  system  with  n  degrees  of  freedom,  that  is, 
n  pairs  of  mutually  conjugate  canonical  variables.  In  the  classical  case  these  are 
numerical  variables  written  as  qr,  pr,  r  —  1, 2, . . . ,  n.  In  quantum  mechanics  we  have  an 
irreducible  set  of  hermitian  operators  qr,pr  acting  on  a  suitable  Hilbert  space  ;#* .  The 
basic  kinematic  structure  is  given  by  Poisson  brackets  (PB)  in  one  case  and  by  the 
Heisenberg  commutation  relations  (CR)  in  the  other.  To  express  them  both  compactly 
and  elegantly,  we  introduce  the  following  notation.  We  assemble  the  g's  and  JP'S  into 
2rc-component  vectors  <;,  £ 

£  =  (O  =  (4r'-4,JVPn)T> 

f=(O  =  (4r"4,A-A,)r,     a=l,2,...,2n.  (2.1) 

Then  the  classical  PB's  and  the  quantum  CR's  are,  respectively 


0       ,  (2.2) 

nxn/ 

The  usual  relations  stated  separately  in  terms  of  g's  and  p's  are  all  contained  here;  and 
the  even-dimensional  real  antisymmetric  matrix  ft  will  play  an  important  role. 

We  assume  always  that  all  the  £a,  cffl  are  of  the  Cartesian  type:  the  natural  range 
(spectrum)  for  each  of  them  is  the  entire  real  line  3ft. 

Changes  of  £,  <f  to  new  quantities  £',  <f  given  as  functions  of  the  old  ones  such  that  the 
basic  kinematic  relations  are  preserved  may  be  called  canonical  transformations  in 
both  situations 

%  =  numerical  functions  of  £:{^,<^,}  =  ftab; 

I'  =  operator  functions  of  |: [£'a,  fjj  =  ihftab.  (2.3) 

Apart  from  C-number  translations  (shift  of  origin)  the  simplest  such  transformations 
are  the  linear  homogeneous  ones.  Each  such  transformation  may  be  specified  by  a  real 
2n-dimensional  matrix  S,  the  actions  being 


In  either  case,  the  requirements  (2.3)  lead  to  a  matrix  condition  on  S  [1] 

SpST  =  p.  (2.5) 

This  is  the  defining  condition  for  the  real  symplectic  group  in  In  dimensions 

Sp(2n,^)  =  [S  =  rea!2n  x  In  matrix \SflST  =  ft}.  (2.6) 

The  matrix  ft  is  real,  even-dimensional,  antisymmetric  and  nonsingular.  It  is  a  "sym- 
plectic metric  matrix".  As  we  see  explicitly  later,  Sp(2n,^)  transformations  preserve 
svmolectic  scalar  oroducts  and  the  svmolectic  metric. 


in  quantum  mecnanics  me  nnoeri  space  <#,  u 
described  in  many  ways.  The  most  familiar  is  the  Schrodinger  description  using  w 
functions  on  #",  that  is,  elements  of  L2(@").  The  4r  act  multiplicatively  while  the  pr 
differential  operators 


Since  the  <ffl  are  hermitian  and  irreducible,  and  since  for  any  SeSp(2n,.^)  the  tra 
formed  £'a  are  also  hermitian  and  irreducible  and  obey  the  same  CR's,  by  the  Stone-^ 
Neumann  theorem  [5]  the  change  <f-»c?  is  unitarily  implementable.  Thus  for  e; 
^)  it  is  definitely  possible.  to  construct  a  unitary  operator  <&($)  acting 
such  that 


This  <%(S)  is  arbitrary  up  to  an  S-dependent  phase  factor.  The  general  composition  i 
that  follows  from  the  irreducibility  of  the  <fa  is 

S',SGSp(2/i,#):<&(S')#(S)  =  (phase  factor  dependent  on  S',S)%(S'S).  (. 

We  shall  discuss  in  §  6  the  maximum  simplification  that  can  be  achieved  in  this  ph 
factor  by  exploiting  the  phase  freedom  in  each  ^(S). 

3.  Aspects  of  symplectic  geometry 

In  this  Section  we  develop  a  few  basic  concepts  related  to  symplectic  vector  spaces 
that  the  contrast  with  Euclidean  and  unitary  geometries  can  be  clearly  seen  [4]. 

Let  Kbe  a  real  In-dimensional  vector  space,  with  vectors  x,y,  ..  ..  Suppose  a  n< 
degenerate  bilinear  antisymmetric  form  (•,•)  -  a  "scalar  product"  -  is  given  on  K  Tl 
for  any  vectors  x,  ye  V,  (x,  y)  is  a  real  number  separately  linear  in  x  and  y;  and  in  addit 
the  following  hold: 

antisymmetry:  (x,  y)  —  —  (y,  x\ 
nondegeneracy:  (x,  y)  =  Q    for  all    yox  =  0.  (! 


Then  K  is  a  symplectic  vector  space,  and  (.  ;  .)  is  a  symplectic  scalar  product. 

We  have  stated  the  properties  of  the  bilinear  form  in  a  basis  independent  way.  It  < 
be  shown  that  if  in  a  general  basis  we  express  (x,y)  in  terms  of  components  of  x  and  j 
the  form 
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involving  an  antisymmetric  nonsingular  matrix  r\,  we  can  always  change  to  more 
convenient  bases  in  which  r\  takes  on  particularly  simple  canonical,  or  normal,  forms. 
Two  such  forms  are  worth  mentioning.  In  one,  rj  becomes  the  matrix  (3  of  (2.2) 


•••+Xny2n-Xn  +  iyi-Xn  +  2y2  -----  *2«3V  (3-3) 

Here  the  first  and  (n  +  l)th  components  belong  to  one  canonical  pair;  the  second  and 
(n  +  2)th  to  the  second  pair;  and  so  on.  Another  normal  form  disposes  the  canonical 
pairs  one  at  a  time 

r\  —  block-diag(icr2,  i<72,  .  .  .  ,  ia2}\ 
(x,  y)  =  xiy2-  x2yl  +  x3y4  -  *4y3  +  •  •  •  (3.4) 


With  the  normal  form  (3.3)  the  meaning  of  the  defining  condition  (2.5)  for  symplectic 
matrices  becomes  geometrically  clear 

SeSp(2n,@),x'  =  Sx,    y'  =  Sy=*(x',y')  =  (x,y).  (3.5) 

In  this  sense  the  symplectic  scalar  product  is  preserved.  Of  course  this  means  that  we 
could  have  replaced  the  condition  (2.5)  by  another  entirely  equivalent  one,  using  in 
place  of  {3  the  matrix  in  (3.4). 

Now  let  us  look  at  linear  subspaces  Vl  £  V.  In  both  Euclidean  and  unitary 
geometries,  it  is  well  known  that  all  subspaces  of  the  same  dimension  are  basically 
similar,  and  cannot  be  distinguished  from  each  other  in  any  intrinsic  sense.  With 
symplectic  geometry  there  is  a  difference,  as  a  new  concept  comes  in.  Given  a  subspace 
V1,  we  consider  the  bilinear  form  (x,y)  defined  over  V,  restrict  both  arguments  to  V1  , 
and  regard  the  result  as  a  bilinear  form  on  Vt  .  Now  the  nondegeneracy  property  may 
well  fail!  Thus,  there  may  exist  a  vector  xe  Vl  such  that  (x,  y)  =  0  for  all  ye  V:  .  So  we 
define  the  rajik  of  this  restricted  form  as  the  symplectic  rank  of  Vl 

Symp.  rk.     Vt  =  rank  (x,y),    x  and  ye  Vv.  (3.6) 

Thus,  if  xr,  r  =  1,  .  .  .  ,  k  is  a  basis  for  Vl  ,  where  k  is  the  dimension  of  Vl  ,  then  Symp.  rk. 
F!  is  the  rank  of  the  k  x  k  antisymmetric  matrix  ((xr,xj).  The  symplectic  rank  is 
necessarily  an  even  integer.  We  have  the  obvious  limits 

0  ^  symp.  rk.     Vl  ^  k.  (3.7) 

But  nondegeneracy  of  (.  ,  .)  over  V  leads  to  another  nontrivial  lower  bound,  which  is 
effective  if  k>  n 

(0,  2(k  -  n))>  <  symp.  rk.     V,  ^  k.  (3.8) 

Basically  we  can  say  that  the  symplectic  rank  of  a  subspace  Vl  is  twice  the  number 
of  complete  canonical  pairs  contained  in  Vi.  Clearly  this  concept  is  symplectic 
invariant.  In  particular  two  subspaces  Vi  and  V\  of  the  same  dimension  cannot  be 
mapped  on  to  one  another  by  any  Sp(2n,^)  element  if  they  have  unequal  symplectic 
ranks. 

As  in  the  Euclidean  case,  we  can  pass  from  Fx  to  its  complement  written  for 
convenience  as  V^.  But  the  geometrical  significance  is  quite  different.  We  call  V\  the 
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symplectic  complement  of  Vl  and  define  it  as  a  subspace  of  V  by 

V{  =  {xeV\(x,y)  =  Q    for  all    yeV^. 
Taking  the  complement  twice  gives  back  ^ 

(W  =  JV 

This  is  as  in  the  Euclidean  case.  Even  the  dimensions  follow  the  same  rule 
Dim  Vi  =  2n-  k. 

This  can  be  shown  by  using  the  nondegeneracy  of  (.  ,  .)•  But  there  the  similarity  en 
can  well  happen  that  V^  and  V\  have  nontrivial  intersection,  and  to  that  extent 
sum  does  not  give  back  all  of  V.  In  general, 


The  two  symplectic  ranks  can  be  related: 

Symp.  rk.  F}  =  2(n  -k)  +  symp.  rk.  V^  .  i 

The  extreme  case  of  (3.1-2),  which  is  very  nonintuitive  on  the  basis  of  Eucli 
geometric  notions,  is  when  symp.  rk.  Vt  vanishes.  For  this  case  we  give  a  special  : 
and  find 

Symp.  rk.     Vl=QoVi    is  an  isotropic  subspace  of  Vo(x,  y)  =  0 

for  all    x,yeVloVl^V^=>k^n.  i 

So  in  this  case  Vl  is  contained  in  V\.  The  opposite  can  also  happen  and  then  w 
Vl  a  co-isotropic  subspace 

71  is  a  co-isotropic  subspace  of  V<=>  V\  is  isotropicoKj  2  V\=>k^- 

So  if  one  of  the  pair  Vl,  V\  is  isotropic,  the  other  is  co-isotropic. 

An  isotropic  subspace  has  dimension  k  ^  n,  while  a  co-isotropic  one  has  dime: 
k^n.  When  they  coincide,  we  have  a  special  situation  and  name.  A  n-dimens 
subspace  Fj  of  K  which  has  vanishing  symplectic  rank  is  both  isotropic  am 
isotropic,  and  coincides  with  V±.  It  is  called  a  Lagrangian  subspace.  This  noti 
important  in  Hamilton-Jacobi  theory  in  classical  dynamics;  it  is  also  relevant  i 
choice  of  complete  commuting  sets  of  operators  in  quantum  mechanics. 

These  properties  and  notions  give  a  feeling  for  symplectic  geometry,  and  for  the 
in  which  it  differs  from  orthogonal  and  unitary  geometries.  In  particular  the  notic 
length,  angle  and  perpendicularity  are  no  longer  available. 

4.  Properties  of  Sp(2/i,  J1)  matrices,  complex  form,  subgroups,  decompositions 

The  matrices  in  the  defining  representation  of  Sp(2n,^?)  obey  (2.5).  From  here  i 
useful  consequences  follow,  and  we  list  them: 

(i)  Sp(2n,^)  is  of  dimension  n(2n  +  1). 
(ii) 


\1\)    UCLO  =   -f  1. 

(v)  SeSp(2n,^?)=>  eigenvalue  spectrum  of  S  is  invariant  under  reflection  about  the  real 
axis,  and  through  unit  circle  (reie-^^elB);  eigenvalues  +  1  have  even  multiplicities. 

(4.1) 

Property  (i)  can  be  seen  from  the  number  of  conditions  contained  in  (2.5),  and  will  be 
confirmed  at  the  Lie  algebra  level.  While  properties  (ii)  and  (iii)  are  easily  checked,  (iv)  is 
rather  subtle;  an  indication  will  be  given  later  to  obtain  it.  Property  (v)  is  a  consequence 
of  S  and  (S~  1)T  being  real  and  related  by  a  similarity  transformation. 

Sometimes  it  is  convenient  to  write  S  in  n  x  n  block  form,  and  then  (2.5)  becomes  a  set 
of  conditions  on  the  blocks 


symmetric,     ADT  -  BCT  =  1WXJI 
STpS  =  P<*ATC,BTD     symmetric,     ATD  -  CTB=  1BXII.  (4.2) 

While  it  is  easy  to  check  (as  mentioned  above)  that  SeSp(2n,£#)  implies  STeSp(2n,^) 
as  well,  it  is  not  so  easy  to  pass  directly  from  the  first  set  of  conditions  above  to  the 
second  set,  aside  from  reconstituting  A,B,  C,D  into  S  and  then  passing  to  ST\ 

Complex  form  of  Sp(2n,^?) 

The  ft  matrix  reflects  the  precise  way  in  which  the  real  q's  and  p's  have  been  put  together  in 
(2.1)  to  form  the  In  component  object  ^  with  real  entries.  Sometimes  it  is  convenient,  for 
instance  in  dealing  with  modes  of  the  radiation  field,  to  work  with  complex  combinations 
of  the  <f  s  and  /Ts-mode  annihilation  and  creation  operators  defined  in  this  way 

.-ip.),    ;  =  !,...,«.  (4.3) 

~ 


It  is  useful  to  arrange  these  into  a  new  column  vector  <f(c)  with  non-hermitian  entries, 


(4.4) 
Then  the  basic  commutation  relations  in  (2.2)  can  be  written  in  two  equivalent  ways 


(4.5) 
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Now  when  we  subject  <;  to  trie  real  transiormation  iebp(zrc,^j,  c     expenenc 
equivalent  complex  transformation 


~2\A-D-i(B  +  Q    A  +  D  +  i(B-Q 

Thus  S(c)  is  just  a  convenient  complex  form  of  the  real  transformation  S,  much  lil 
passage  from  Cartesian  to  spherical  components  of  spherical  tensors. 

Some  subgroups  of  Sp(2rc,^) 

We  shall  describe  here  some  useful  subgroups  of  Sp(2«,^).  Their  dimensions  v 
given,  and  where  it  is  useful  their  complex  forms  exhibited. 

(a)  GL(n,  8%):  This  is  the  rr-dimensional  general  real  linear  group;  in  terms  of  the 
matrices  A,  B,  C,  D  it  is  given  thus 


Here  the  <f  s  are  subject  to  a  general  real  linear  nonsingular  transformation  a: 

themselves,  and  then  the  p's  change  in  a  compensating  contragradient  mam 

(b)  O  («,  ^):  This  is  the  orthogonal  subgroup  of  GL(n,  &),  of  dimension  ^n(n  —  1 

that  part  of  GL(n,M)  under  which  the  <fs  and  the  p's  change  in  the  same  wa 


If  we  impose  the  condition  det  A  ~  +  1,  we  get  the  subgroup  SO(n,^)  of  p 
orthogonal  transformations. 

(c)  U(n):  Now  we  come  to  the  H-dimensional  unitary  group,  of  dimension  n2,  a 
mal  compact  subgroup  within  the  noncornpact  Sp(2n,  ^).  We  shall  sometimes 
K(«),  or  simply  K,  for  it.  The  corresponding  symplectic  matrices  S  are  identii 
follows.  If  we  split  any  UeU(w)  into  real  and  imaginary  parts  we  find  the  prop 


U  =  X-iYeU(n),    UfU  =  UUt=  l^XTX  +  YTY  =  XXT+  YYT 
XTY,XYT  symmetric. 
We  can  then  produce  a  solution  to  the  matrix  condition  (4.2).  We  find 


-l 

It  is  an  interesting  and  easy  exercise  to  check  the  following:  If  a  In  x  2n  real  n 
is  both  orthogonal  and  symplectic,  then  it  is  unimodular  as  well  and  has  to  ha 
form  S(X,  Y)  for  some  U  =  X  -  i  7eU(n) 

O(2n,  #)  n  Sp(2n,  ^)  =  SO(2n,  &)  n  Sp(2n,  ^)  =  K(n) 
=  {S(X,Y)\X-iYeU(n)}. 
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The  complex  form  of  these  matrices  is  very  revealing 
S(C)(X,  Y)  =  QS(X, 
=  S(C)(U) 
'U  0 


o  u* "  (4-12) 

So  the  a's  and  the  af's  undergo  separate  unitary  rotations,  not  mixing  with  one 
another 

UeU(n):    d-»Ua,    a^U*^.  (4.13) 

Indeed,  K(n)  is  the  maximal  subgroup  of  Sp(2«,  ^)  such  that  <f  s  and  ^'s  transform 
independently.  We  also  have  the  expected  relation  between  the  subgroups  O(n,  ^?), 
GL(n,3l),  and  U(»)  exhibited  above 

O(«,#)  =  GL(H,#)nU(«).  (4.14) 

Finally  we  turn  to  some  Abelian  subgroups  [6]. 

(d)  Tf:  This  is  a  subgroup  of  dimension  |n(n+l)  and  may  be  called  the  "free 
propagation"  subgroup 

A  =  D  =  1,     B  =  BT,     C  =  0.  (4.15) 

The  name  comes  from  the  actions  on  q  and  on  p 

q'  =  q  +  Bp,    p'=p.  (4.16) 

Group  composition  corresponds  to  adding  the  B  matrices,  which  explains  the 
Abelian  nature. 

(e)  T(/):  This  is  the  result  of  conjugating  elements  of  T(/)  by  j5.  We  call  it  the  "lens" 
subgroup,  on  account  of  the  action  on  <f  s  and  $'s 

A  =  D  =  l,    J3  =  0,     C  =  Cr; 

q'  =  4,     p'=p  +  Cq.  (4.17) 

The  dimension  is  again  ^n(n  +  1),  and  group  composition  amounts  to  adding  the 
C  matrices. 

Some  other  subgroups  of  Sp(2n,^)  will  appear  in  connection  with  global  decom- 
position theorems. 

Global  decomposition  theorems 

Now  we  describe  four  useful  ways  of  expressing  any  SeSp(2n,$)  as  a  product  of 
specially  chosen  factors,  either  two  or  three  in  number. 

(a)  Polar  decomposition  [7]:  This  says  that  any  5eSp(2«,  &)  can  be  written  uniquely  as 
the  product  of  two  factors,  one  belonging  to  the  maximal  compact  subgroup  K(n), 
the  other  to  an  important  subset  n(n)  in  Sp(2n,  9K).  This  subset  is  defined  by 

n(n)  =  {SeSp(2n,^)|Sr  =  S,S  positive  definite}  cSp(2n,^).  (4.18) 
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Of  course  by  conjugating  P  with  S(X,  Y)  one  could  have  written  the  two  faci 
the  opposite  sequence.  The  important  points  here  are  the  global  nature  < 
result,  and  the  uniqueness  of  the  factors.  From  this  decomposition  one  can  se 
of  the  two  possibilities  det  5  =  +  1  allowed  by  (2.5),  the  choice  det  S  —  +  1 
only  one  allowed. 

(b)  Euler  decomposition:  Next  we  turn  to  a  decomposition  which  involves 
factors,  each  drawn  from  a  subgroup  of  Sp(2n,  ^?),  but  which  is  nonunique.  1 
the  factors  are  from  K(n),  the  third  from  those  elements  of  TL(n)  which  are  die 
and  do  form  a  subgroup 


&)    S  =  S(X1,Y1)D(K)S(X2,Y2), 
S(X1,Y1),S(X2,Y2)EK(n), 

>cr>0,     r  =!,...,«. 

If  one  adds  the  number  of  free  parameters  in  the  three  factors,  one  gets  th 
n(2n  +  1)  which  is  just  the  dimension  of  Sp(2rc,  ^?).  Thus  the  nonuniqueness 
decomposition  is  of  a  discrete,  not  a  continuous,  nature.  It  stems  essential!] 
the  freedom  to  order  the  first  n  diagonal  elements  of  D(K)  in  any  way  we  lik 
(c)  Pre-Iwasawa  decomposition:  Here  we  have  a  three-factor  unique  decompc 
which  one  encounters  on  the  way  to  establishing  the  (next)  Iwasawa  decompc 
but  the  factors  do  not  all  belong  to  subgroups  of  Sp(2n,  £%).  The  present  decor 
tion  results  from  attempting  to  reduce  the  off  diagonal  block  B  in  a  g 
to  zero,  by  using  an  element  of  K(rc)  on  the  right.  The  result  re 

r  n  i  ~  \  c  A~I  1/1  o    A~I  l\  —  Y  y 

C-    U  /  \  L^n/tr,         1   /  \     U      fir,       I  \  —   I      A 


X-iY^A'^A-iB), 


Here  the  matrix  A0  is  to  be  chosen  symmetric  positive  definite,  and  all  factc 
unique.  The  symmetry  of  CQAQl  can  be  checked,  so  the  first  factor  lies 
lens  subgroup  Tl  of  (4.17).  The  middle  factor  belongs  to  the  inters 
GL(/i,^)nH(«),  which  is  not  a  subgroup.  And  the  third  factor  is  from  K(n] 
particular  decomposition  is  of  importance  in  obtaining  the  generalised  Huy 
kernel,  which  we  describe  in  §  6. 

(d)  Iwasawa  decomposition  [8]:  The  polar  and  pre-Iwasawa  decompositior 
similar  since  they  involve  unique  factors,  but  each  factor  is  not  taken 
a  subgroup.  The  Euler  decomposition  solves  the  latter  problem,  but  in  the  p 
uniqueness  is  lost.  The  fourth  and  last  Iwasawa  decomposition  retains  both  v: 
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it  is  global,  has  unique  factors,  and  each  is  taken  from  a  characteristic  subgroup  of 
Sp(2n,^).  It  is  a  result  of  fundamental  group  theoretical  significance,  valid  for  all 
simple  non-compact  Lie  groups.  The  three  subgroups  involved  are  the  maximal 
compact  K(«),  a  certain  maximal  Abelian  subgroup  jtf,  and  a  certain  nilpotent 
subgroup  «V.  Therefore  this  decomposition  is  often  called  the  JfWyT  decomposi- 
tion. We  first  display  it  for  Sp(2,<#) 


a  b 

c  d 


ad-bc=l: 


S  = 


0    \/cos<p/2    —  sin<p/2\ 
-ni2  M  e\nmn        cos  <p/2  y 


1  0 

c  iJl  0    « 

=  (ac  +  bd)/(a2  +  b2)e(  —  oo,  oo), 
=  ln(a2  +  b2)e(—  oo,  x), 
=  2arg(«  —  J6)e(  — 27r,27t]. 


(4.22) 


Here  the  first  —  £  —  factor  belongs  to  the  subgroup  J7",  coinciding  for  n  =  I  with  the 
lens  subgroup  Tl;  the  second  -n  —  factor  belongs  to  the  subgroup  jaf;  and  the  third 
-  <p  -  factor  is  from  K(l)  =  SO  (2). 
For  general  Sp(2n,  0t\  the  situation  is  more  involved.  The  subgroups  <sf  and  J'"  are 


o\ 


C  symmetric    c:Sp(2n,^). 


(4.23) 

The  abelian  subgroup  sf  consists  of  just  the  elements  D(K)  that  were  used  in  the 
Euler  decomposition  (4.20).  The  Iwasawa  decomposition  for  Sp(2n,^?)  then  states 
that  any  SeSp(2n,#£)  can  be  uniquely  expressed  as  the  product  of  three  factors, 


0 


D(K)  S(X,  Y). 


(4.24) 


taken  respectively  from  JT>d  and  K(n).  The  dimensionalities  of  these  subgroups, 
respectively  n2,  n  and  n2,  add  up  correctly  to  n(2n  +  1). 

5.  The  Lie  algebra  of  Sp(2n,#) 

We  first  study  the  Lie  algebra  Sp(2«,  ^)  in  the  defining  representation,  and  then 
generalise  to  any  other  representation.  In  keeping  with  quantum  mechanical  conven- 
tion, we  shall  retain  a  factor  of  i  in  the  definition,  even  though,  this  might  seem 
unnecessary  in  dealing  with  a  group  of  real  matrices. 
We  examine  the  form  of  matrices  SeSp(2n,-^)  close  to  the  identity 

S  =  exp(  —  isJ)  ~  1  —  i&J.     \B\  «  1: 


inner  worus  in  me  ueuiiuig  icpicbci.iia.uuii  we  gci  cui  pusMuic  j  5  uy  pic-  ui 

multiplying  all  possible  pure  imaginary  symmetric  2n  x  2n  matrices  by  ft.  Takii 
former  alternative  and  choosing  the  simplest  possible  basis  for  symmetric  2? 
matrices,  we  obtain  the  following  basis  for  Sp(2n,&): 


a,b=l,...,2n; 


These  matrices  can  be  easily  seen  to  obey  the  commutation  relations 


The  structure  ofSp(2n,  J?)is  determined  by  these  relations.  In  a  general  represent 
of  Sp(2n,  <%)  we  have  generators  Xab  =  Xba  obeying 


Finite  dimensional  representations  of  Sp(2n,^?)  are  necessarily  nonunitary,  hei 
them  the  Xab  cannot  all  be  hermitian.  This  is  because  of  the  noncompactni 
Sp(2n,  £%).  On  the  other  hand,  in  a  unitary  representation  which  is  necessarily  ir 
dimensional,  we  have  X\b  —  Xab. 

To  help  identify  the  subsets  of  generators  for  various  subgroups  it  is  useful  to  us 
index  notation.  We  use  r,  s,...  =  l,...,n  to  label  the  various  canonical  pairs 
a,  /?,...  =  1,  2  to  pick  out  the  q  and  the  p  in  each  pair 


Pab  =  Pra.s^  =  "rsfia/J' 
0    1' 


£~'    i  oy 

Then  the  various  components  of  Xab  =  Xra  sp  are  handled  thus: 
X       —  V—V- 

rl,sl  rs  sr' 


There  are  \n(n  +  1)  K's,  a  similar  number  of  Z's,  and  n2  Ws;  in  a  unitary  represent 
each  of  them  is  hermitian.  In  this  split  form  the  commutation  relations  (5.4)  rea 

VW    W  1  =  U5    W  —8    W  \ 

LKKrs'  reuv-i        L\urv'Yus        uusr¥rui~ 
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C^«.  Zvo}  =  i(dn  Wsv  +  dsu  Wn  +  5rv  Wsu  +  6SV  WJ. 

[K,K]=[Z,Z]=0.  (5.7) 

'sfow  one  can  pick  out  the  subsets  of  generators  for  various  subgroups  of  Sp(2n,^);  we 
live  the  results  in  the  form  of  a  table. 

Subgroup  Generators 

&)  W 


^  Wrs  for  r  <  s,  and  all  Vrs. 

We  have  mentioned  that  any  nontrivial  finite  dimensional  representation  of 
5p(2n,  &£)  is  necessarily  nonunitary.  It  turns  out  that,  with  no  loss  of  generality,  we  may 
issume  that  the  "compact"  generators  of  K(n)  are  hermitian,  while  a  balance  of  "non- 
;ompact"  generators  are  antihermitian.  That  is,  in  any  finite  dimensional  representa- 
ion  we  can  assume  the  following 

Generators  of  K(n)  =  compact  generators 

=  Wrs  -  Wsr,  Vrs  +  Zrs  =  hermitian: 
Balance  of  generators  =  noncompact  generators 

=  Wrs  +  w*r,  vrs  ~  ZrS  =  antihermitian.  (5.8) 

Fhe  noncompact  generators  can  be  arranged  into  complex  combinations  with  definite 
:ensor  behaviour  under  U(n).  These  combinations  are 

Trs=Tsr  =  Vrs-Zrs-i(Wrs+Wsr}. 

Trs  =  Tsr  =  Vrs  -  Zrs  +  i(Wrs  +  Wsr).  (5.9) 

rhen  the  complete  set  of  Sp(2n,^)  commutation  relations  (5.4)  appears  in  a  U(n) 
idapted  form  [9] 

\_Ars,  Aav]  =  osuArv  —  orvAus\ 

L^rs'   •*  uv  J        ^su  •*  rv    '    "sv  •*  ru' 

L^ra»  TUV^  =  ~  &,„  Tsv  —  dn.  Tsu; 

[Tn,  TUJ  =  -  4(6nAn  +  5roAsu  +  6suArv  +  5SVAJ; 

[T,  T]  =  [f,f]=0.  (5-10) 

WQ  see  that  Trs  and  frs  are  second  rank  symmetric  tensors  under  U(n),  of  contravariant 


the  composition  law  (2.9)  of  the  ^/'s  as  simple  as  possible.  The  answer  is  that  this  can  be 
done,  and  upon  maximum  simplification  we  can  achieve 

S1,S26Sp(2«,.«)    W(S1)<%(S2)=±W(S1S2).  (6.1) 

This  sign  ambiguity  cannot  be  eliminated.  So  we  say  that  we  have  here  a  two-  valued 
unitary  representation  of  Sp(2n,£#).  A  more  correct  or  useful  statement  is  that  the 
operators  involved  provide  a  faithful  unitary  representation  of  the  metaplectic  group 
Mp(2n),  which  is  a  two-fold  covering  of  Sp(2n,$)  [10].  Strictly  speaking  this  means 
that  the  argument  of  '?/(•)  should  be  an  element  of  Mp(2n),  not  SeSp(2n,<%).  However, 
having  made  this  point,  we  shall  continue  to  write  tf/(S)  as  in  (2.8),  (2.9)  and  (6.1). 

The  generators  of  this  metaplectic  representation  of  Sp(2n,£?)  are  all  hermitian;  in 
terms  of  q's  and  p's  they  are  the  quadratic  expressions  [1  1] 


Vn  =  M,*H  =  P,P,'  (6-2) 

The  characteristic  differences  between  the  compact  and  the  noncompact  combinations 
become  clear  when  expressed  in  terms  of  a's  and  af's 

Compact  generators:  Wrs  —  Wsr  =  i(dldr  —  a\a^, 

Vrs  +  Zrs  =  (d]ds  +  did,  +  «5J,  (6.3a) 

Non  compact  generators:  Wrs  +  Wsr  —  i(al&l  —  drds), 

?n-Zn  =  &M  +  W.-  (6.3b) 

We  see  that  the  compact  generators  of  U(n)  conserve  "total  photon  number",  thus  this 
subgroup  of  Sp(2«,^?)  consists  of  "passive"  transformations.  The  noncompact  gene- 
rators on  the  other  hand  do  not  conserve  "photon  number",  so  we  may  call  them 
"active"  generators.  These  properties  are  expressed  thus 


[Wrs-Wsr    or     Krs  +  Zrs,N]  =  0, 

LWrs+Wsr    or     Krs-Zrs,JV]^0.  (6.4) 

In  fact,  single  exponentials  of  i  times  real  linear  combinations  of  the  compact  gener- 
ators give  us  operators  of  the  form  <%(S(X,  Y));  while  single  exponentials  of  i  times  real 
linear  combinations  of  the  noncompact  generators  give  us  operators  of  the  form  °U(P\ 
-Pell(n).  For  this  reason  the  latter  may  be  called  "squeezing  transformations"  [12], 
[13];  and  the  polar  decomposition  (4.19)  may  be  read  as  stating  that  any  metaplectic 
unitary  transformation  is  uniquely  the  product  of  a  compact  passive  factor  and  a  non- 
compact  active  squeeze  factor.  The  definition  and  production  of  squeezed  states  are 
taken  up  in  more  detail  in  §  10. 
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The  Schrodinger  description  of  the  Hilbert  space  3F  on  which  the  metaplectic 
representation  acts  has  been  given  in  (2.7).  In  this  description,  the  eigenvectors  |q>  of 
the  commuting  position  operators  qr  appear  as  a  basis 


—  ih—  <q|.  (6.5) 

It  is  useful  to  know  that  certain  operators  <%(S)  have  very  simple  actions  on  these 
basis  vectors.  We  list  them  below 

A       0 
0  U"1 


l/2(A-1q\\  (6.6a) 

Z)(K)  =  diagOq,. . . , Kn, K~ *, . . .  ,K~ l),  Kr  >  0: 


r-  1 
n  \  1/2 

«  \~l/2 


. 

9 


(6.6b) 


qTgqqy.  (6.6c) 

With  the  help  of  these  results,  and  the  pre-Iwasawa  decomposition  for  elements  of 
Sp(2n,3?)  described  in  §4,  it  turns  out  to  be  possible  to  calculate  the  generalized 
Huyghen's  kernel  in  n-dimensions  without  too  much  effort.  This  kernel  is  the  confi- 
guration space  matrix  element  <q|^(5)|q'>  of  the  metaplectic  unitary  operator  tft(S). 
We  recall  that  in  the  case  of  one  degree  of  freedom,  this  kernel  has  the  following 
form  [11] 
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(These  results  are,  strictly  speaking,  valid  only  if  S  is  sufficiently  close  to  the  ident 
point  being  that  <?/(•)  is  actually  a  representation  of  Mp(2)  and  so  carries  as  argurr 
element  of  this  group.  Therefore  the  generalised  Huyghens  kernel  is  not  expr 
totally  in  terms  of  SeSp(2,$).)  We  may  regard  the  case  i>^0  as  generic 
generalizes  nicely  to  any  number  of  dimensions  and  we  find 


-  inn/4- 


x  exp 


The  nongeneric  case  when  det  B  =  0  has  to  be  handled  carefully  -  then  the 
collapses  to  a  lower  dimensional  expression  with  a  certain  number  of  delta  fu 
factors.  However,  having  given  an  indication  of  the  structure  involved,  we  will 
into  any  further  details. 

7.  Sp(2n,^)  actions  on  Wigner  and  diagonal  coherent  state  representations 

Let  f  be  any  quantum  mechanical  operator,  specified  in  the  Schrddinger  repr< 
tion  by  its  configuration  space  kernel  <q|f  |q'>.  We  can  see  that  if  F  is  conjug£ 
ty(S)  for  general  SeSp(2n,t%),  the  change  in  the  kernel  involves  an  integral  tran 
ation  in  which  the  generalised  Huygens  kernel  (6.8)  and  its  complex  conjugal 
appear.  We  can  ask  whether  there  is  any  other  way  of  specifying  or  describing 
that  this  change  takes  a  simpler  form,  not  requiring  any  integrations  at  all.  Indee 
is,  and  it  is  given  by  the  use  of  the  techniques  due  to  Weyl,  Wigner  and  Moyal  (\ 
[14].  We  describe  this  aspect,  and  then  go  on  to  another  practically  important 
describing  operators,  namely  via  the  diagonal  coherent  state  representation,  ; 
behaviour  under  Sp(2n,  ^). 

Let  us  hereafter  set  h  —  \.  From  the  configuration  space  kernel  <q|f  |q'>  o 
obtain  its  Wigner  distribution  or  WWM  representative  by  a  partial  Fourier  tran 
ation 


<q|f|q')= 

Here  W(£,)  is  a  function  on  the  classical  phase  space  corresponding  to  the  qu 
system,  with  arguments  which  are  In  classical  c-number  g's  and  p's.  As  seen  abo 


can  recover  the  operator  F  from  its  WWM  representative  W(E,}  unambiguously.  Then 
one  finds  that  under  conjugation  by  the  metaplectic  operators  ^(5),  the  changes  in 
f  are  very  simply  expressed  in  terms  of  W(£)  [15] 


f'  =  <%(S)-  *?<%($)<*  W'(Q  =  W(SQ.  (7.2) 

This  behaviour  of  W(£)  may  in  fact  be  regarded  as  the  key  or  characteristic  property  of 
the  WWM  method  in  quantum  mechanics;  we  may  say  that  this  description  of 
operators  is  covariant  under  the  full  symplectic  group  Sp(2n,$). 

Next  we  turn  to  the  diagonal  coherent  state  description  of  operators  f  [16].  For 
n  degrees  of  freedom  the  coherent  states  are  defined  as  usual  by 


*-r=l  r=l 

dr|0>  =  0, 

f)\7\  =  Z\7^>         7  fC        7  =  (z  z}  (13} 

I*..  I  Ms  /    ~~   £*  r\£j  /  ^  Zi..  C.  \-s  9  Ju   I  £*  i^*   *•;*•'•]/*  \    I  »*J  j 

These  are  normalised  states,  no  two  being  orthogonal,  and  can  be  written  also  as  the 
result  of  phase  space  displacement  operators  acting  on  the  ground  state  |0>: 


.r=l 

<z'|z>  =  exp{  —  |z'V—  |zfz  +  z'fz}.  (7.4) 

The  coherent  states  form  an  (over)  complete  set;  the  resolution  of  the  identity 

/*  M  1   O 

1-fn^izX.i  (7.5) 

'    J  r=i    n 

shows  that  any  vector  |^>  can  certainly  be  expanded  using  them 

^«A(Z*)|2> 
r=l      n 

\j/  (z)  =  <z*  |  \l/  >  =  exp(  —  jztz)  (entire  analytic  function  of  z).  (7.6) 

Moreover  the  overcompleteness  allows  expansion  of  any  operator  f  in  the  form  of  an 
integral  over  projections  to  these  states 

"\\7\S7\  ni\ 

r\*'/\e'\-  {'•'} 

For  given  f ,  this  expansion  and  the  weight  function  </>(z)  are  unique,  however  the  latter 
could  in  general  be  a  distribution  of  quite  a  singular  kind. 

Now  we  look  at  the  behaviour  under  Sp(2n,  £%).  It  turns  out  that  the  states  |z>  have  a 
simple  behaviour  only  under  the  "passive"  maximal  compact  subgroup  K(n)  c: 


=  |Uz>.  (7.8) 

This  can  be  traced  to  the  fact  that  the  generators  of  K(n)  involve  only  terms  of  the  form 
tfa,  as  we  see  in  (6.3a).  On  the  other  hand,  the  effect  of  ^(P),  for  any  PeFI(n),  on  |z> 
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involves  an  integration  using  a  suitable  kernel,  namely  the  generalised  Hi 
kernel  expressed  in  the  coherent  state  language  [17],  which  for  the  Sp(2,  #)  cas 
written  in  a  simple  form  in  terms  of  complex  SU(1, 1)  parameters  A/x 


z2     z'*2 


•—  z 


1- 


.'12 


x  exp|  -^ {(/*  +  ju)z2  +  zz'*  4-  (A*  -  ju*)z'*2} 

2.A. 


12 


Im(ju  —  /l)  =  0,     i.e.,     (p  =  2mt,    n  =—1,0,1, 


(Here  each  individual  square  root  is  defined  to  have  a  positive  real  part.) 
In  contrast  to  the  WWM  result  (7.2),  we  now  have  covariance  under  K(n) 


Active  elements  of  Sp(2n,.#)  change  0  in  a  manner  involving  a  nontrivial 
transformation. 

8.  Quantum  noise  matrices  and  their  Sp(2/i,  ^)  behaviour 

Let  /5  be  the  density  operator  of  any  (pure  or  mixed)  quantum  state.  For  simplici 
let  us  assume  that  the  means  of  £  vanish 


Nonzero  values  for  these  means  can  always  be  reinstated  by  a  suitable  pha: 
displacement.  The  variance  or  noise  or  second  order  moment  matrix  of  the  s 
then  defined  as  follows  [12] 


This  is  a  real  symmetric  2n  x  In  positive  definite  matrix  subject  to  further  matrix 
inequalities  which  express  the  uncertainty  principles  (see  below). 

We  must  note  the  compact  way  in  which  we  are  able  to  express  Vab  as  a  phase  space 
integral  involving  the  WWM  representative  W(£)  of  the  density  operator  p.  This  is 
a  consequence  of  the  general  rule  [14] 


valid  for  any  numerical  £0.  Indeed,  one  can  regard  this  property  of  W(£)  as  being  as 
basic  as  the  symplectic  transformation  rule  (7.2).  A  particular  case  of  (8.3),  when  £0  =  0, 
shows  that  W(£)  is  normalised,  because  Trp  =  1;  its  phase  space  integral  is  unity.  But  we 
must  remember  that  though  for  hermitian  p  the  function  W(£)  is  real,  it  may  not  be 
non-negative  everywhere. 

The  change  in  the  noise  matrix  V  when  p  is  changed  by  a  symplectic  transformation 
is  now  easily  obtained  by  exploiting  (7.2)  along  with  the  above  expression  for  Vab  in 
terms  of  W(£).  We  have  the  extremely  simple  law,  for  any  SeSp(2«,  &) 

p'  =  ^(S)p^(Sr1^>V'  =  SVST.  (8.4) 

(Note  that  in  comparison  to  the  change  f-»T  '  in  (7.2),  S  has  been  replaced  by  S~  1 
here).  We  may  say  that  V  undergoes  a  symmetric  symplectic  transformation,  which 
preserves  its  symmetry  and  positive  definiteness. 

The  information  contained  in  V  can  also  be  given  in  complex  form  using  second 
order  moments  of  d's  and  tf's 


Thus  V(c}  is  a  hermitian  In  x  2n  positive  definite  matrix  subject  to  further  uncertainty 
inequalities  given  below. 
The  relations  connecting  these  two  forms  of  the  noise  matrix  are 
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Here  the  matrix  Q  is  as  given  in  (4.4).  And  in  place  of  (8.4)  we  have  the  eq( 
transformation  law 


9.  Williamson's  theorem  and  uncertainty  principles 

For  general  SeSp(2n,^)  the  transformation  law  (8.4)  for  the  noise  matri) 
a  similarity  transformation,  it  is  a  similarity  only  if  S  =  S(X,  y)eK(n).  Norn- 
expect  that  the  diagonalization  of  V  will  require  a  matrix  belonging  to  thi 
SO(2n,$).  However,  the  fundamental  Theorem  of  Williamson  comes  to  oui 
[18].  This  theorem  is  a  complete  answer  to  the  question:  Given  a  real  syr 
In  x  In  matrix  V,  what  is  the  maximum  simplification  we  can  achieve  in  the 
V'  =  SVSr  by  allowing  S  to  vary  all  over  Sp{2«,.'#)?  For  general  K,  the  no 
canonical  form  of  V  is  not  a  diagonal  form;  however  the  theorem  shows  that  in  < 
positive  (or  negative)  definite  to  begin  with,  then  we  can  certainly  choose  an  S  sc 
is  diagonal.  By  suitable  further  rcscaling  and  ordering  of  elements  we  can  then 
the  following 

V  real  symmetric  positive  definite: 
SVST  = 


suitable  SeSp(2n,$). 


We  shall  call  this  the  Williamson  normal  form  of  V.  In  general,  the  K,  are 
eigenvalues  of  Kat  all.  Also  note  that  when  Vis  in  this  form,  then  we  have  K(' 
In  the  Williamson  normal  form  we  see  that  for  each  canonical  pair  c/r,  pr  we  ha 
uncertainties:  Agr  =  Apr  =  /cr1/2.  Also  all  the  off-diagonal  variances  vanish.  There 
complete  statement  of  the  uncertainty  principles  for  all  degrees  of  freedom  woi 


A  given  In  x  In  real  symmetric  positive  definite  matrix  V  is  quantum  rnech 
realizable  as  the  noise  matrix  of  some  state  f>  if  and  only  if  in  its  Williamson 
form  (9.1)  every  diagonal  entry  is  greater  than  or  equal  to  one-half.  This  is  an  S 
invariant  statement.  It  is  naturally  useful  to  express  these  uncertainty  pi 
directly  in  terms  of  V  without  actually  passing  to  its  Williamson  normal  for 
There  are  several  ways  of  doing  this  and  for  illustration  we  quote  just  one,  ej 
both  in  terms  of  V  and  V(c} 

V  +  -/?  =  hermitian  positive  sernidefmitc. 
V(c)  +  ^Z3  =  hermitian  positive  scmidefmite. 


We  emphasize  that  any  F,  V{c)  obeying  these  conditions  are  quantum  mechanically 
realizable  -  they  are  necessary  and  sufficient. 

There  is  a  subtle  distinction  between  the  matrix  V  being  brought  to  diagonal  form, 
and  the  matrix  V(c)  being  diagonal.  From  the  relations  (8.6)  among  the  two  sets  of 
submatrices  we  can  see  easily 

V  diagonal o  Vl ,  V3  diagonal,  V2  =  0, 
<=><«/,  US  real  diagonal 

=?£  K(c)  diagonal:  (9.4a) 

V(c]  diagonal  <=><c/  real  diagonal,  $  =  0 
=>V,,V3  diagonal,  V2  =  Q 
•oK  diagonal.  (9.4b) 

This  means  that  the  set  of  states  which  have  diagonal  V(c}  is  a  subset  of  the  set  of  states 
having  diagonal  V:  the  former  is  a  more  restrictive  condition,  so  fewer  states  obey  it.  In 
the  particular  situation  when  V  is  in  Williamson  normal  form,  it  is  not  only  diagonal 
but  in  addition  V1  —  K3 ;  thus,  in  spite  of  the  general  statement  (9.4a),  we  do  find  V('-)also 
diagonal  in  that  case.  In  complex  form,  the  relevant  consequence  of  Williamson's 
Theorem  for  V(c}  is  that  with  the  transformation  law  (8.7),  V(c]  can  be  "diagonalized". 

10.  A  U(n)-invariant  multimode  squeezing  criterion 

The  noise  matrix  V  for  any  state  of  a  single  mode  system  is  two  dimensional  and  has  the 
form 


v_,,     Afop) 
VAfop)   (Ap)2 

with  obvious  meanings  for  the  various  matrix  elements.  The  usual  Heisenberg  uncer- 
tainty relation  (with  h  =  1)  which  reads  [19] 


£  (10.2) 

can,  as  is  well  known,  be  strengthened  to  the  statement  [20] 

det  V=  (A<?)2(Ap)2  -  (Afop))2  &  J.  (10.3) 


This  is  in  fact  the  content  of  the  conditions  (9.3)  in  this  case,  and  this  is  Sp(2,  &t}  invariant. 
The  state  p  with  variance  matrix  V  is  usually  said  to  be  a  squeezed  state  if  either  one 
of  the  two  diagonal  elements  of  V  (but  of  course  not  both)  is  less  than  one  half.  However 
this  definition  possesses  no  useful  or  interesting  continuous  invariance  at  all.  A  defini- 
tion of  squeezing  possessing  invariance  under  the  maximal  compact  U(l)  or  SO(2) 
subgroup  of  Sp(2,^2)  is  this:  the  state  p  is  squeezed  if  and  only  if  the  lesser  of  the  two 
eigenvalues  of  V  is  less  than  one  half.  A  state  squeezed  in  the  former  sense  is  squeezed 
also  in  the  latter  sense  but  not  conversely,  so  the  former  is  more  restrictive 

A<j  or  Ap<—  7=  =>  lesser  eigenvalue  of  V  <\.  (10.4) 
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The  U(l)-invariant  squeezing  criterion  has  been  used  in  several  studies.  It  is  of 
clear  that  we  cannot  ask  for  any  more  invariance  in  the  squeezing  criterion,  for  e; 
it  would  be  meaningless  to  think  of  an  Sp(2,^?)  invariant  squeezing  criterion. 
Motivated  by  the  above,  we  now  describe  a  criterion  for  squeezing  for  si 
n  mode  systems  [12].  Suppose  that  for  a  given  state  p  the  noise  matrix  V  alre* 
some  diagonal  element  less  than  one  half.  Then  we  say  that  the  state  is  ma 
squeezed.  However  it  may  happen  that  every  diagonal  element  Vaa  of  7exo 
equals  one  half,  yet  squeezing  is  buried  and  not  manifest.  We  have  divided  elen 
Sp(2n,&)  into  passive  U(«)  elements  and  active  Yl(n)  elements.  We  would  like  • 
a  definition  of  squeezing  invariant  under  passive  U(n)  transformations.  Such  a 
tion  is  the  following 

p  is  a  squeezed  state <*(S(X,  Y)  VS(X,  Y)T)aa  <  £, 
some  X  —  iYe\J(n), 
some  a  =  1,2,...,2«. 

Thus,  p  is  squeezed  if  V  shows  it  to  be  manifestly  so,  or  if  this  happens  after  a  s 
passive  transformation. 

We  know  that  in  general  V  cannot  be  diagonalized  by  similarity  transforr 
within  K(n)  -  we  expect  to  have  to  use  matrices  from  the  much  larger  group  SO 
which  may  be  noncanonical.  Nevertheless  it  is  remarkable  that  the  abov< 
invariant  squeezing  criterion  can  be  expressed  in  terms  of  the  eigenvalue  spectri 

p  is  a  squeezed  state<=> 

l(V]  =  minimum  eigenvalue  of  V<\. 

This  is  precisely  equivalent  to  the  definition  (10.5). 

Obviously  the  squeezed  or  nonsqueezed  nature  of  a  state  p  is  unchanged  by 
elements  of  Sp(2n,  £?)  lying  within  K(n),  since  then  V  undergoes  a  similarity  trai 
ation  which  leaves  l(V)  unaltered.  To  change  l(V\  and  so  the  status  of  a  state,  ii 
direction,  we  must  use  active  noncompact  elements  lying  in  Il(«),  if  we  wish  ti 
within  the  framework  of  Sp(2n,^)  transformations.  Thus  justifies  our  using  tl 
"squeezing  transformation"  for  the  metaplectic  unitary  operator  <%(P)  for  el 
Pell(n). 

11.  Some  interesting  families  of  variance  matrices 

Let  us  denote  by  s  the  set  of  all  allowed  noise  matrices  V,  i.e.,  all  physically  re; 
ones  obeying  the  uncertainty  principles 

s  =  <  V  =  In  x  In  real  symmetric  positive  definite  \V  +  -fi 

positive  semidefinite  >. 

This  is  an  n(2n  +  1)  parameter  family.  We  have  seen  that  a  general  Ves 
diagonalizable  by  elements  within  K(n).  We  ask:  can  we  characterize  the  subse 
consisting  of  all  noise  matrices  which  are  diagonalizable  using  elements  of  K( 


VesK=>S(X,  Y)  VS(X,  Y)T  =  diagonal 

someU  =  X-zTeU(«).  (11.2) 

The  subset  SK  is  an  n(n  +  2)  parameter  family. 

There  are  two  further  subsets  of  SK  which  are  interesting  [12].  We  call  them  the 
"hermitian"  family  SH  and  the  "Gaussian"  family  SG  -  they  are  of  dimensions  n2  and 
n(n  +  1)  respectively.  The  definition  of  SH  is  motivated  by  the  fact  that  the  general 
transformation  rule  (8.7)  for  V(c}  becomes  very  simple  if  we  have  a  K(«)  element 

UeU(n):  K(C)-^5(C)(U)  V(e)S(t)(U)\ 


(11.3) 

So  if  ^  =  0  to  begin  with,  it  remains  zero;  and  stf  being  hermitian  can  be  diagonalized 
by  some  U.  This  would  then  result  in  both  K(c)  and  V  becoming  diagonal.  Thus, 
including  the  uncertainty  conditions  (9.3),  the  definition  of  SH  is 

%  =  {  VesK\&  =  0,  jtf  —  \-  1  positive  semidefinite}  a  SK  c:  s.  (1  1.4) 

One  can  verify  that  no  member  of  this  family  SH  is  squeezed. 
Next  to  the  family  SG.  Here  the  definition  involves  the  "noncompact"  subset  Ft  (n)  of 


SG  =  {VesK\2VeU(n)  c  Sp(2rc,  #)}  c  SK  c  s.  (1  1.5) 

It  is  a  fact  that  such  noise  matrices  are  physically  realizable,  i.e.,  they  obey  the 
uncertainty  conditions,  and  are  diagonalizable  within  K(«).  Further,  except  for  K  =  -|-  1, 
every  other  VesG  is  squeezed.  So  we  see  incidentally  that  this  is  the  only  common 
element  in  SH  and  SG.  All  in  all,  we  have 

%'  5G  cr  SK  c:  5 
sHnsG={fl}.  (11.6) 

12.  Gaussian  pure  states,  Gaussian  Wigner  distributions 

The  generators  of  the  metaplectic  unitary  representation  of  Sp(2n,^£)  are  quadratics  in 
<fs  and  p's.  The  most  general  centred  «-mode  Gaussian  wave  function  involves 
a  quadratic  in  the  g's  in  the  exponent.  It  turns  out  that  the  former  act  in  very  nice  and 
compact  ways  on  the  latter  [21].  We  describe  the  main  features  briefly  in  this  Section. 
A  general  centred  and  normalized  /7-mode  Gaussian  pure  state  can  be  parameterized 
by  two  real  symmetric  n  x  n  matrices  u  and  v,  of  which  the  former  is  positive  definite 


q)l2  =  l.  (12-1) 
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The  calculation  of  the  WWM  representative  for  (/%,„) (q),  and  of  the  noise  mati 
easy  since  only  Gaussian  integrals  and  moments  are  involved.  The  results  are 

gVoi  8)fa-  i9')</%,y)(q  +  i<lTexp(iq"p) 


.  u  +  uu   *u  iw 
G(u,y)=          _,  _i 

v     ;      '      u   Ju       u   ! 


~1/2 


F(«,  0)  =  |  G(U,  V)-  *  =  iS(u,  17)S(tt,  V)T€SG. 

While  the  calculations  leading  to  these  results  are  elementary,  it  is  worth 
attention  to  the  structures  involved.  The  WWM  representative  W(U|P)(^)  is  expei 
be  a  Gaussian,  with  a  positive  definite  matrix  G(u,  v)  in  the  exponent.  \\ 
interesting  is  the  factorization  of  this  matrix  in  terms  of  an  Sp(2n,  ^)-matrix  S(i 
added  feature  is  that  this  S(u,v)  is  an  example  of  the  product  of  the  first  two  fac 
the  general  pre-Iwasawa  decomposition  (4.21)  for  any  5eSp(2n,^)!  That  the 
matrix  V(u,  v)  should  be  essentially  the  inverse  of  G  (u,  v)  is  clear  from  (8.2);  it  is  tl 
product  structure  for  G(w,  v)  that  results  in  V(u,  v)  being  an  element  of  the 
SG  defined  in  the  previous  Section.  Incidentally  we  also  see  that  IT(n)cSp 
defined  in  (4.18)  and  used  in  defining  the  family  SG  in  (11.4)  can  be  describee 
explicitly  using  S(u,  v) 


U(n)  =  {S(u)v) 

u  and  v  real  symmetric  n  x  n  matrices,  u  positive  definite}. 

We  can  argue  further  along  similar  lines,  concerning  the  form  to  be  expected 
action  of  an  operator  <%(S)  on  any  iA(u,u)(q).  From  (7.2)  it  is  clear  that  W(u>v)(£)  m 
mapped  onto  another  Gaussian;  the  fact  that  it  arises  from  a  pure  state  wave  fu 
must  also  be  retained;  and  the  transformation  rule  (8.4)  plus  the  explicit  form  of 
in  (12.3)  means  that  any  iA(M>u)(q)  can  be  mapped  onto  <A(1  ,0)(q)  by  a  suitable  <%($).  , 
is  indeed  true.  We  find  that 

^(«,»)  =  (Phase 


,     =    -. 
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So  Sp(2n,  ^?)  acts  transitively  on  the  set  of  Gaussian  pure  states.  The  subgroup  of 
Sp(2rc,^)  leaving  the  particular  state  i//(1>0)  invariant  (apart  from  phases)  is  just 
K(n)  -  ar  annihilates  <A{li0),  and  the  generators  of  K(n)  in  the  metaplectic  representation 
are  of  the  form  a^as.  More  precisely,  the  situation  may  be  described  as  follows.  One  can 
easily  establish  that  the  behavior  of  (A(1(0)  under  K(n)  is  given  by 

U  =  X-iYe\J(n): 

V(S(X,  Y))iA(1.0)  =  v/deOJ.Vo).  (12.6) 

The  sign  ambiguity  here  explains  the  appearance  of  the  metaplectic  group  Mp(2n). 
The  stability  group  of  \j/(1  i0)-the  subgroup  of  Sp(2n,^)  leaving  this  vector  strictly 
invariant  is  thus  the  n2  —  1  parameter  subgroup  SU(n)e£/(n).  Correspondingly  the 
orbit  of  i/f(1)0)  under  Sp(2n,^),  made  up  of  the  vectors  ^(5)^(li0)  for  all  SeSp(2n,^), 
is  an  (n(n  +  1)  +  1)  parameter  family.  It  consists  of  the  vectors  eia\j/(u  v)  for  0  ^  a  <  2n  and 
all  allowed  (w,u).  This  orbit  is  essentially  the  coset  space  Sp(2n,  ^?)/SU(n);  there 
is  a  one-to-one  correspondence  between  vectors  e'ai//(u  i0)  and  points  in  this  space. 
Suppressing  the  phase  we  can  next  say  that  the  set  of  density  matrices  ^  (UtV}^lUtV}, 
or  equally  well  the  set  of  representative  vectors  \l/(uo),  is  essentially  the  coset  space 


Turning  next  to  the  effect  of  <%(S)  on  <A(UiU),  we  see  that  apart  from  a  phase  factor 
it  has  to  result  in  ij/(u-<v>)  for  suitable  u'  and  v'.  The  formula  for  this  change  is  a 
beautiful  one 

«(S)^(H>n)  =  (phase  factor)^,.,.,; 

A  =(iM-u)"1->  A'-^V-i;')"1 

=  (AA  +B)(CA  +Z))'1.  (12.7) 

The  Gaussian  WWM  function  in  (12.3)  arose  from  a  pure  state.  Suppose  now  we 
consider  a  general  centred  normalized  Gaussian  phase  space  distribution  with  a 
general  parameter  matrix  G 

WG{§  =  7r-"(detG)1/2exp(-  cfG£), 

G  =  real  symmetric  2n  x  In  positive  definite  matrix.  (12.8) 

The  question  is:  When  is  this  a  WWM  function  corresponding  to  some  pure  or  mixed 
quantum  state?  Thanks  to  Williamson's  theorem,  the  answer  is  elementary  [15].  The 
noise  matrix  would  clearly  be 

K=^G~1.  (12.9) 

So  we  start  from  the  given  Gaussian  WG(£\  pass  to  its  V,  then  go  to  the  Williamson 
normal  form  (9.1)  of  V 

F-*diag(;q,...,  *„,*!,...,*„).  (12.10) 

and  then  demand  that  each  Kr  be  greater  than  or  equal  to  one  half.  This  is  a  complete 
necessary  and  sufficient  condition  for  WG(£)  to  be  a  bonafide  WWM  phase  space 
distribution;  as  we  have  seen,  however,  this  condition  can  be  stated  directly  without 
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going  to  the  normal  form 

W      is  a  WWM  distribution^ G  "  *  4-  zjff  =  hermitian  positive  sem 


13.  Concluding  remarks 

In  this  review  we  have  tried  to  convey  the  main  features  of  the  family  of  real  sy 
groups  Sp(2n,  @t\  and  have  outlined  some  problems  in  optics  and  quantum  m 
where  they  are  useful.  Our  account  has  been  descriptive  and  suggestive,  . 
detailed  proofs  of  various  statements  made.  We  believe  that  any  interestei 
wishing  to  apply  symplectic  techniques  to  any  concrete  problem  would 
equipped  for  the  purpose,  and  able  to  supply  necessary  details. 

Some  general  remarks  -  partly  to  counter  apparently  common  misconce 
may  be  useful  at  this  stage.  The  group  Sp(2n,#2)  comes  in  when  we  defir 
canonical  transformations  on  given  canonical  variables.  It  is  the  fact  that  the  co 
tion  relations  and  hermiticity  are  maintained  that  is  responsible  for  the  exis 
unitary  operators  $t(S)  implementing  these  transformations.  In  particular, 
unitary  whether  SeK(n),  when  S  itself  is  unitary,  or  S£K(n),  for  example  Se 
which  case  S  is  hermitian  rather  than  unitary.  Correspondingly,  the  "Hamil 
generating  these  unitary  operators  tft(S)  are  hermitian  quadratics  in  the  $'s  and 
the  case  of  the  most  general  unitary  evolution  via  a  general  hermitian  Hamiltoi 
action  on  <f  s  and  p's  (equivalently  on  ffs  and  aps)  definitely  preserves  the  c< 
commutation  relations  and  hermiticity  properties,  but  the  transformed  operat 
not  be  linear  combinations  of  the  original  ones. 

The  maximal  compact  subgroup  U(n)  of  Sp(2«,^?)  has  naturally  played  ar 
tant  role  in  our  considerations.  The  n-mode  squeezing  criterion  described  in 
a  built-in  U(n)  invariance.  As  a  result,  for  a  state  with  a  given  variance  m 
squeezing  if  present  may  be  manifest  (one  of  the  diagonal  elements  of  V  is  less  t 
or  may  be  hidden  (this  happens  only  after  a  suitable  U(n)  transformation).  This 
clear  that  our  squeezing  criterion  is  weaker  than  the  usual  one  stated  directly  in 
the  diagonal  elements  of  V,  since  these  would  rule  out  the  hidden  case.  Correspc 
there  are  more  states  which  are  squeezed  by  our  criterion  than  by  the  usual  on 
in  any  case  has  much  less  invariance  built  in. 

The  metaplectic  group  is  essential  to  describe  properly  the  structure  of  the 
operators  implementing  linear  canonical  transformations  on  the  canonical  v 
(in  this  respect  the  notation  $t(S)  with  SeSp(2n,^)  is  inadequate  -  the  argu 
<%(S)  should  be  an  element  of  Mp(2n)).  The  importance  of  this  group  is  see 
example,  the  calculation  of  geometric  phases  for  cyclic  evolution  of  squeezed  st 
interpretation  of  the  Guoy  phase,  etc  [22].  It  has  also  been  shown  elsewhere 
metaplectic  group  is  relevant  in  setting  up  operator  Mobius  transformations 
degree  of  freedom  [17]. 

Work  on  extending  U(w)  invariant  notions  beyond  quadrature  squeezi: 
progress  and  will  be  reported  elsewhere.  Thus,  for  two-mode  systems,  one  can 
squeezing  transformations  into  U(2)  invariant  equivalence  classes,  study  buncl 
antibunching  with  such  invariance,  etc.  [13].  These  notions  can  be  genera 
n  mode  systems  as  well. 
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Abstract.  A  rigorous  derivation  of  the  optical  theorem  (OT)  from  the  conservation  of  probabil- 
ity flux  (CPF)  is  presented  for  scattering  on  an  arbitrary  spherically  symmetric  potential  in 
JV-spatial  dimensions  (ND).  The  constructed  expression  for  the  OT  is  found  to  yield  the 
corresponding  well-known  results  for  two-  and  three-dimensional  cases  in  a  rather  natural  way. 
The  Aharonov-Bohm  (AB)  effect  is  considered  as  a  scattering  event  of  an  electron  by  a  magnetic 
field  confined  in  an  infinitely  long  shielded  solenoid  and  a  similar  derivation  is  attempted  for  an 
appropriate  optical  theorem.  Our  current  understanding  of  the  scattering  theory  is  found  to  be 
inadequate  for  the  purpose.  The  reason  for  this  is  discussed  in  some  detail. 

Keywords.    Optical  theorem;  AB  scattering. 
PACS  No.    03-30 


1.  Introduction 

In  the  three  dimensional  (3D)  quantum  mechanical  scattering  the  optical  theorem 
(OT)  expresses  the  total  cross  section  in  terms  of  imaginary  part  of  the  forward 
scattering  amplitude,  f(0  =  0).  In  two  spatial  dimensions  the  theorem  does  not 
look  exactly  like  that  found  in  the  3D  case.  However,  it  can  be  made  to  appear  so 
by  not  absorbing  a  phase  factor  [1]  into  the  definition  of  scattering  amplitude.  Being 
related  to  the  unitarity  of  S  matrix  or  t  matrix  on  the  energy  shell,  the  OT  provides 
a  mathematical  statement  for  the  conservation  of  probability  flux  [2]  (CPF).  Therefore, 
extraction  of  the  OT  from  CPF  should  not  involve  assumptions  or  approximations 
that  vary  with  the  dimensionality  of  space  for  the  scattering  event.  But 
studies  in  existing  literature  [3, 4]  indicate  that  this  is  not  the  case  with  2D  scattering. 
The  derivation  of  2DOT  does  really  involve  mathematical  approximations  which  are 
not  demanded  in  a  similar  logical  progression  for  the  3DOT.  More  specifically,  the 
2DOT  is  constructed  by  assuming  /(0)~/(0)  or  by  making  explicit  use  of  the 
stationary  phase  approximation  [5].  The  situation  thus  calls  for  a  more  detailed 
investigation  into  the  causal  connection  between  the  OT  and  CPF.  To  that  end  we 
derive  the  optical  theorem  in  N-spatial  dimensions  by  using  an  equivalent  mathema- 
tical procedure  as  is  followed  to  deal  with  the  3DOT.  Further,  we  examine  the  OT 
in  the  context  of  Aharonov-Bohm  (AB)  scattering  [6,7].  This  is  expected  to  be  quite 
instructive  because  the  AB  effect  is  characterized  by  a  non-trivial  geometry  [8]  that 
does  not  fit  into  the  framework  of  formal  scattering  theory.  Also,  regarded  as 
a  scattering  event  the  process  is  characterized  by  kinetic  momentum  (mr  =  p  —  (e/c)  A) 
rather  than  the  canonical  momentum  (p). 
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In  §2,  we  address  the  proposed  generalization  of  OT  for  the  time-indep< 
Schrodinger  equation  in  an  arbitrary  number  (N)  of  spatial  dimension.  As  a 
emphasized,  our  analysis  will  be  based  on  the  CPF  rather  than  mere  identificatior 
NDOT  from  partial  wave  expressions  [9].  We  shall  pay  special  attention  to  the  sul 
that  arise  in  the  course  of  our  derivation.  The  optical  theorem  for  the  AB  e 
considered  in  §3.  We  have  chosen  to  work  with  two  types  of  scattering  states  of 
system  modelled  by  simply  and  non-simply  connected  spaces  and  found  that  nei 
the  representations  permits  one  to  derive  the  theorem  rigorously.  However,  our  ai 
of  the  difficulties  encountered  followed  by  appropriate  attempts  to  deal  with  sc 
them  will  be  quite  illuminating  for  future  researchers  in  this  area  of  investigatio 
example,  in  view  of  the  apprehension  that  the  AB  scattering  cross  section  is  st: 
beamsize  dependent,  it  may  be  worthwhile  to  see  if  an  accurate  statement  of  OT  co 
made  by  limiting  the  spatial  extent  of  the  incident  beam.  Finally,  in  §  4,  we  sumi 
our  outlook  on  the  present  work  and  make  some  concluding  remarks. 

2.  OT  in  many  dimensions 

The  N  dimensional  Schrodinger  equation  for  a  hypercentral  potential  V(r]  at  an  t 
£  =  /c2>Oisgivenby  [10] 


with  the  hyperradius  r  written  as 

.V/2 
xixl}     >    Xj  =  rsin01sin02---sin0,-_1cos0i. 

i=l  / 

The  angular  variables  0's  satisfy  the  relation 

(K0|<7t,    i  =  1,2,...,  AT  -2;    0<0W_,<  27t  and  0^  =  0. 
The  Laplacian  V2^  is 


Here  L2(Q)  stands  for  the  grand  orbital  operator  at  the  hyperangle  Q(  = 
0JV-J)  on  the  hypersphere  SN~1.  From  (1)  and  (4)  we  have 

N-ld      L2Q 


Separation  of  the  angular  and  radial  part  of  (5)  yields  the  radial  equation 
d2      N-ld      L(L  +  N-2) 


The  conservation  of  probability  flux  for  the  ND  scattering  is  expressed  as 

limrN-1 
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(8) 
On  the  unit  sphere  SN~*  the  surface  element 

N-l 

dfi  =   FT   sin^"-7"  1 B  dB  (Q\ 

uj.6          j  |    aui  UjUUj.  ^yj 

The  physical  or  outgoing  wave  solution  of  (5)  behaves  asymptotically  as  [11] 

r 

Clearly,  9  is  the  angle  between  the  wave  vector  k  and  r.  In  terms  of  the  L-th  partial  wave 
scattering  phase  shift  <5L  and  the  Gegenbauer  polynomial  C|f  }(x),  the  amplitude/(Q)  is 
given  by 


2\(W-5)/2 


(2L+N-2)(e2i'5'--l)C»'v/2)-1)(cos0).  (11) 


Substituting  (10)  in  (8),  we  get 


hkY  1  1 

y,(r,  ft)  -  - 


ml  r  r 

_feN-l           1 


m     2     r((A' 

If  we  take  the  incident  wave  along  the  j^-axis,  (7),  (11)  and  (12)  supplemented  by  the 
definition 


(13) 

J 

gve 

(N-S)t2 

-- 
(i 


x    >    (2L  +  N-2)(e^t-l)</J"'-i-^;— — /^  V  (14a) 

^-^  '    ^  '        \         *-•  •*!  I.—         JL(  |  |  V  / 

L  =  0 

with 


o       o          o 

N-l 

x   f]  smN-J-l6jdOj.  (14b) 

j  =  i 
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The  integral  I(j*]  is  obtained  from  I("]  by  replacing  the  quantity  in  the  square 
by  unity.  The  expression  for  Ip  is  found  as 


rr 

IL   =  L_N-2)/2)l  L  +  ((N-2)I2}( 


Here  the  prime  denotes  differentiation  with  respect  to  the  argument.  The  d 
of  this  result  is  given  in  appendix  A.  Using  the  asymptotic  form  of  the  . 
its  derivative  in  (15)  the  resulting  expression  can  be  used  in  (14a)  to  reca 
the  form 


In  the  asymptotic  limit  the  contribution  of  7^N)  to  the  scattering  cross  section  di 
because  of  the  factor  (kr)'1  in  front  of  it.  From  (11)  and  (16),  we  get  the  OT, 

v(N-D/2 


for  the  ND  scattering.  Clearly/(Qk)  is  the  scattering  amplitude  in  the  directior 
plane  wave. 

In  three  dimensional  problems  with  rotational-reflectional  symmetry,  /(Qk 
a  function  of  di  only  and  by  our  choice  Ql  =  0.  Therefore,  for  N  =  3  we  recover 
known  optical  theorem  [2] 


Similarly  for  2D  scattering  we  have  [3] 


We  have  seen  that  for  all  spatial  dimensions  the  optical  theorem  is  a  di  rect  cons< 
of  CPF  and  our  derivation  for  the  NDOT  does  not  involve  any  approxi 
However,  from  (15)  it  is  clear  that  use  of  the  asymptotic  values  for  Jv  and  J' 
through  /?  )  requires  SL(k)  to  vanish  for  large  L.  This  observation  will  play  a  cru< 
in  the  discussion  of  OT  for  the  AB  effect.  In  wave  language,  the  OT  follows  from 
that  the  total  cross  section  represents  the  removal  of  flux  from  the  incident  be 
such  a  removal  can  only  occur  as  a  result  of  destructive  interference  betw 
incident  wave  and  the  elastically  scattered  wave  in  the  forward  direction.  Th 
point  is  followed  to  visualize  the  3D  optical  theorem  in  terms  of  shadow  scatters 
Specific  examples  are  often  used  to  make  the  situation  more  transparent.  Cons 
the  hard-sphere  scattering  [13]  we  have 


47T 

yr          T-r 


where 

i    kR 

(21) 


1  =  0 


with  R,  the  radius  of  the  sphere.  The  equivalence  of  (18)  and  (20)  for  the  present  instance 
can  be  established  by  using  the  fact  that  Im[/;enect.on(0)]  averages  to  zero  due  to 
oscillating  phase.  Interestingly,  by  exploiting  the  orthogonality  properties  of  Gegen- 
bauer  polynomials  we  can  also  arrive  at  a  similar  conclusion  for  the  JV-dimensional 
scattering. 

3.  OT  in  AB  scattering 

The  AB  effect  is  a  typical  quantum  mechanical  phenomenon  which  results  from  the 
scattering  of  electrons  by  a  magnetic  field  confined  in  an  infinite  solenoid  [6],  It  thus 
constitutes  the  observable  consequence  of  a  hidden  field  B.  Here  the  vector  potential 
A  plays  a  crucial  role.  For  example,  despite  B  =  0,  the  space  region  outside  the  solenoid 
is  characterized  by  a  non-vanishing  gauge  potential  together  with  concentrated 
singularity  along  the  negative  x-axis.  The  essential  singularity  in  A  along  the  line  is 
called  Dirac  string  which,  from  geometer's  point  of  view,  manifest  itself  in  giving  rise  to 
a  multi-connected  space  [8].  The  AB  effect  exists  for  those  representations  in  which  the 
electron  wave  function  is  either  (i)  discontinuous  in  a  simply  connected  space  3$2  or  (ii) 
continuous  over  a  non-simply  connected  space  (^/{X^^Q,  X2=0}).  Also  the 
long-range  behaviour  of  the  vector  potential  does  not  permit  clear  separation  of  the 
complete  wave  function  into  the  incoming  and  scattered  part.  In  the  course  of  our  study 
we  shall  see  that  these  features  of  the  AB  effect  pose  problems  to  write  an  optical 
theorem  for  the  process. 
Consider  the  Schrodinger  equation 

(r)  =  0  (22) 

for  scattering  of  a  charged  particle  by  an  impenetrable  infinitely  long  thin  cylindrical 
solenoid  of  radius  R.  The  vector  potential  A  =  (0,  <$/2nr)  outside  the  solenoid  contain- 
ing the  magnetic  flux  $  lies  on  a  plane  perpendicular  to  the  axis  of  the  cylinder.  As 
already  noted,  there  can  be  two  types  of  scattering  solutions  for  (22).  In  the  following  we 
use  them,  one  by  one  to  look  for  an  appropriate  optical  theorem  for  the  AB  scattering. 

3.1  Wave  function  in  the  simply-connected  space 

This  wave  function  has  been  obtained  by  making  use  of  the  two-potential  formula  of 
Gellman  and  Goldbarger  [14].  Asymptotically,  i/^(r)  behaves  as 

'A-'/'AB  +  'Ac-  <23) 

Here 


AB 


i*(<t>  -  «)  +  orcoi*  + 


represents  the  asymptotic  wave  function  for  the  non-shielded  solenoid  [15].  The  effect 
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of  shielding  is  taken  care  of  by  the  second  term  in  (23)  and  we  have 


with 


The  marked  difference  in  the  structure  of  (24)  from  the  standard  scattering 
form  shown  in  ( 10)  may  be  attributed  to  the  inextricable  mixing  between  the  r; 
angular  variables  in  the  scattered  part.  However,  if  we  restrict  to  kr  sin2  0/2  » 
recast  it  in  the  form  of  (10)  to  get  the  scattering  amplitude 


as  obtained  by  Aharonov  and  Bohm  [16].  It  is  crucial  to  observe  that  the 
mation  sought  here  precludes  the  value  0  =  0.  This  forbids  to  write  an  optical 
for  the  AB  effect  by  using  the  wave  function  in  a  simply  connected  space. 

3.2  Wave  function  in  the  non-simply  connected  space 

The  wave  function  in  this  case  is  constructed  by  considering  (22)  in  the  cylindri 
coordinates  (r,  </>)  and  assuming  a  general  solution  in  the  form 


This  leads  us  to  the  differential  equations 
d2       1  d  (m  +  a)2 


and 


r  dr 


—  +m" 


for  the  radial  and  angular  parts  of  (28).  Using  the  Dirichlet  and  periodic  b 
conditions,  solutions  of  (29)  and  (30)  can  be  obtained  to  read 


and 


Ylm+xl(kR)-J[m+a[(kR)  Ylm+K{ 


d>m(0)  =  e''m*,    me/. 
In  analogy  with  (10),  we  choose  the  asymptotic  form  of  ^(r,  0)  as 


From  (28),  (31)  and  (33)  we  get 
1 

~' 
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(35) 
he  quantities  «5m(a)  and  Am  given  by 

<*«(«)  =  (M-|m  +  a|)|  '  (36) 

'  ,-« 

(37) 

for  the  phases  induced  by  the  vector  potential  A  and  the  impenetrable  cylinder, 

:tively. 

lations  (7),  (12),  (13)  and  (34)  can  be  combined  to  write 


<r£?  =  -  lim 


(38) 


ssult  for  I™  can  be  obtained  from  (14b)  and  we  have 

(39) 


>ears  that  (34),  (35)  and  (39)  together  with  the  asymptotic  forms 


,40b) 

o  the  optical  theorem  for  AB  scattering.  Unfortunately,  such  a  derivation  involves 

ous  logical  flaw.  This  can  be  seen  as  follows. 

e  asymptotic  forms  in  (40)  are  valid  only  for  m  «  kr.  This  inequality  is  violated  for 

tely  many  of  the  terms  in  a  Bessel  function  expansion.  Since  Am  +  <5m(a)  takes 

stant  value  as  |m|  -»•  oo,  use  of  (40)  in  (38)  gives  rise  to  a  divergent  er£f  .  This  does 

appen  for  the  ordinary  scattering  on  short  range  interactions  as  discussed  in  most 

3  text  books.  With  Henneberger  [17]  one  may  attempt  to  resolve  the  difficulty 

iated  with  the  AB  effect  by  controlling  of  the  incident  beam  t^inc  and  study  its 

cation  for  the  optical  theorem. 

e  partial  wave  expansion  for  i/>inc  =  elkrcos<t)  is  taken  in  the  form 

^inc  =  X'lm|^|m|^N(^)^,  (41) 

m 

e  the  small  positive  quantity  e  satisfies  e  »  l/kd  and  d  is  the  distance  from  the 
srer  to  the  detector.  In  this  case  one  obtains  the  scattering  amplitude  and  cross 
Dn  as 

y**  (42) 
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1        r  °°    p~\m\ 

/   2i( 

2.TIK, 


The  damping  exponential  e~|w|£  tends  to  remove  the  awkward  divergence 
resulting  from  the  use  of  (40)  and  thus,  in  principle,  one  could  write  an  optical 
for  the  process.  But  we  can  see  that  this  claim  will  also  be  based  on  in 
mathematical  treatment  of  the  problem. 
For  pure  AB  scattering  (Am  =  0),  (37)  and  (42)  can  be  combined  to  write  [ 


Interestingly,  (44)  can  be  reduced  to  the  standard  AB  scattering  ampli 
provided  one  agrees  to  drop  the  third  term  which  vanishes  for  angles  deviating 
forward  direction  by  an  amount  »  g.  This  is  in  agreement  with  the  obser 
Corinaldesi  and  Rafeli  [18].  However,  suppression  of  this  term  amoun 
violation  of  unitarity  for  the  5  matrix.  Thus,  it  is  not  possible  to  make  a  statem 
OT  by  using  (42)  and  yet  keep  the  well  known  AB  scattering  amplitude  unch; 
this  context  another  significant  point  to  note  is  the  following. 

The  two  wave  functions,  one  described  in  the  simply-connected  space  and 
the  non-simply  connected  space,  are  locally  gauge  equivalent.  Since  the  gau 
formation  does  not  change  the  physics  of  the  problem,  the  claimed  validity  of  ( 
will  contradict  the  conclusion  drawn  by  working  with  the  local  gauge  e< 
multiple  valued  representation.  A  more  instructive  step  to  dispense  with  the 
constructing  OT  in  the  present  instance  is  now  in  order.  For  example,  use  of  (^ 
and  (19)  leads  to 

<C=I    Z    *~2N£sin2(<5>)  +  AJ 

Km=  -co 

and 

<C  =  r    Z    *-H£sin2(<$>)  +  AJ. 

"•  m=  -co 

Since  <5m(a)  does  not  go  to  zero  as  |m|  -» oo,  the  results  in  (45)  and  (46)  do  not  a^ 
disagreement  prohibits  use  of  (13)  in  the  derivation  of  OT  from  the  CPF.  Thu: 
possible  to  provide  a  definitive  statement  for  the  OT  even  by  working  wi 
valued  wave  function  in  our  unified  approach. 

4.  Concluding  remarks 

The  optical  theorem  is  embedded  in  the  partial-wave  expressions  for  the  s 
quantities.  But  it  is  widely  believed  that  to  appreciate  the  physical  significar 
theorem,  the  OT  should  be  derived  directly  from  CPF.  We  have  followed 
point  to  construct  an  expression  for  the  OT  for  N-dimensional  potential  s 
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without  using  any  approximation  like  that  of  Boya  and  Murray  [21].  Being  mathemat- 
ically rigorous,  the  present  derivation  exhibits  the  causal  connection  between  the  OT 
and  CPF  in  a  more  general  context.  As  special  instances  of  our  general  expression,  we 
have  obtained  the  well-known  results  for  three-  and  two-dimensional  optical  theorem. 
The  3D  optical  theorem  is  often  physically  visualized  in  terms  of  shadow  scattering. 
The  same  is  found  true  for  the  ND  scattering. 

We  have  attempted  to  provide  a  statement  of  the  optical  theorem  for  the  Aharonov- 
Bohm  scattering.  Being  related  to  the  observable  effects  of  a  hidden  field,  it  is  very 
difficult  to  treat  this  process  by  the  methods  of  formal  scattering  theory.  Despite  that, 
we  worked  with  two  types  of  wave  functions  in  looking  for  an  appropriate  OT  for  the 
AB  process.  The  scattering  amplitude  for  the  wave  function  in  the  simply-connected 
space  is  not  defined  in  the  forward  direction  and  is  thus  unsuitable  for  deriving  the 
optical  theorem.  This  is  reminiscent  of  the  so-called  Coulomb  scattering.  For  the 
Coulomb  problem,  divergence  in  cross  section  and/or  scattering  amplitude  arises  from 
the  long-range  nature  of  the  interaction.  The  speciality  of  the  AB  process  is  that  there  is 
no  long-range  force  to  cause  divergence.  However,  the  long-range  vector  potential  can 
lead  to  divergent  scattering  cross  section  for  an  incident  beam  of  infinite  extent. 

Our  next  choice  of  the  wave  function  was  that  in  the  non-simply  connected  space. 
Here  the  vector  potential  is  gauged  everywhere  except  on  a  ray,  viz.,  the  negative  real 
axis  such  that  the  present  wave  function  is  related  to  that  in  the  simply  connected  space 
by  unitary  transformation.  Here  we  have  also  chosen  to  work  with  an  incident  beam  of 
finite  extent.  Even  with  all  these  modifications  ,a  rigorous  derivation  of  the  optical 
theorem  was  not  possible,  presumably  because  the  reflection  symmetry  of  the  scatter- 
ing interaction  is  broken  by  the  presence  of  the  noted  branch-cut.  We,  therefore, 
conclude  by  noting  that  one  requires  a  thorough  modification  of  the  scattering  theory 
to  deal  with  the  AB  process,  but  this  seems  to  be  far  from  clear. 
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Appendix  A 

The  integral  I^  in  (14b)  can  be  written  as 

/w  =  ^(/f>  +  /£W),  (Al) 


where 


o 


(A2) 


and 


P" 
£<">=       e-'^c°sfl'Cf/2)-1)(cos0Jsm"-201cos01d01  (A3) 

J  o 

=\     d0  I    sinN-302d02  j    sinN-403d03---       sin0w_2d0N_2.      (A4) 
Jojo  Jo  Jo 
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Using  the  formulae  [19] 


o  (-zTT(2v)r(n 

and 


in  (A2)  one  obtains 

^       -  2 
r 


Differentiation  of  (A2)  under  the  sign  of  integration  with  respect  to  kr 


d(fcr)  ' 

The  results  in  (A5)  and  (A6)  give 


Furthermore  [20], 


2 
2  ~2 


=  0. 
Thus,  (A7)  becomes 


508 


To  evaluate  the  integrals  in  (A4),  we  use  the  formula 

,P+l\    fq+1 

n/2  A    | )i 

sinl'0cos«0d0  = 
o 


and  find 

(A9) 


From  (Al),  (A5),  (A8)  and  (A9),  one  obtains  the  desired  expression  for  7<-V)  given  in  (15). 

References 

[1]  S  K  Adhikari,  Am.  J.  Phys.  54,  362  (1986) 

[2]  R  G  Newton,  Scattering  theory  of  waves  and  panicles  (Springer  Verlag,  New  York  1982) 

[3]  P  A  Maurone  and  T  K  Lim,  Am.  J.  Phys.  51,  856  (1983) 

[4]  Z  Y  Gu  and  S  W  Qian,  Phys.  Lett.  A136,  6  (1989) 

[5]  H  Jeffreys  and  B  S  Jeffreys,  Methods  of  mathematical  physics  (Cambridge  University  Press 
Cambridge,  U.K.  1956) 

[6]  M  Peshkin  and  A  Tonomura,  The  Aharonov-Bohm  Effect  (Springer-Verlag,  Berlin,  1989) 

[7]  Y  Aharonov,  C  K  Au,  E  C  Lerner  and  J  Q  Liang,  Phys.  Rev.  D29,  2396  (1984) 

[8]  B  Felsager,  Geometry,  particles  and  fields  (Odense  University  Press,  Copenhagen,  1987) 

[9]  I  Adawi,  J.  Math.  Phys.  12,  358  (1971) 
[10]  M  Fabre  de  la  Ripelle,  Ann.  Phys.  147,  281  (1983) 
[1 1]  M  Fabre  de  la  Ripelle,  Few-Body  systems  14, 1  (1993) 

[12]  LSRodbergand  R  MThalar,  Introduction  to  quantum  theory  of  scattering  (Academic,  New- 
York,  1967) 
[13]  J  J  Sakurai,  Modern  quantum  mechanics  (Addison  Wesley  Publishing  Co.  Inc.,  California, 

1985) 

[14]  M  Gell-Mann  and  M  L  Goldberger,  Phys.  Rev.  91,  358  (1953) 
[15]  G  N  Afnasiev  and  V  M  Shilov,  J.  Phys.  A22,  5195  (1989);  23,  5185  (1990) 
[16]  Y  Aharonov  and  D  Bohm,  Phys.  Rev.  115,  485  (1959) 
[17]  W<C  Henneberger,  Phys.  Rev.  A22, 1383  (1980);  J.  Math.  Phys.  22, 1 16  (1981) 
[18]  E  Corinaldesi  and  F  Rafeli,  Am.  J.  Phys.  46,  1 187  (1978) 
[19]  G  N  Watson,  A  treatise  on  the  theory  of  Bessel  functions  (Cambridge  University  Press, 

Cambridge,  1958) 
[20]  M  Abromowitz  and  I A  Stegun,  Handbook  of  Mathematical  Functions  (Dover  Publications 

Inc.,  New  York,  1972) 
[21]  L  J  Boya  and  R  Murray,  Phys.  Rev.  A50, 4397  (1994) 


Pramana  -  J.  Phys.,  Vol.  45,  No.  6,  December  1995  509 


.journal  ui  •  —   •-">  *••"•  «, 

physics  December  1 995 

pp.  511-517 


BD-FRW  models  in  the  framework  of  Israel-Stewart-Hiscock  theory 

KALYANI  DESIKAN 

Department  of  Mathematics,  Indian  Institute  of  Technology,  Madras  600036,  India 
MS  received  3  July  1995 

Abstract.  BD-FRW  universe  filled  with  imperfect  fluid  having  bulk  viscosity  is  investigated 
ander  the  framework  of  Israel-Stewart-Hiscock  causal  theory.  The  field  equations  haw  teen 
solved  by  using  the  relation  0  =  KR*  where  K  and  a  are  constants,  between  the  Brans-Dicke 
;calar  field  cp  and  the  scale  factor  R.  This  relation,  in  fact,  leads  to  a  constant  deceleration 
parameter  q.  It  is  shown  that  the  constancy  of  the  deceleration  parameter  permits  only  two 
possibilities  i.e.  either  H  =  constant  with  m  =  1  or  m  =  (1+  b  -  a)/(2(l  +b)~  a),  irrespective  of 
:he  value  of  e.  y 

Keywords.  Cosmology;  Brans-Dicke  theory;  Israel-Stewart-Hiscock  causal  ther- 
modynamics. 

PACSNo.    98-80 
I.  Introduction 

rhe  evolution  of  the  universe  includes  a  number  of  dissipative  processes.  The  role  of 
iissipative  effects  on  the  evolution  of  the  universe  has  been  investigated  by  many 
luthors  [1].  These  investigations  are  based  on  the  conventional  therm odynamic 
;heory  of  irreversible  processes  due  to  Eckart  [2].  Hiscock  and  Lindblom  [3]  showed 
:hat  Eckart's  theory  is  non-causal  (since  it  admits  dissipative  signals  with  super-luminal 
/elocities)  and  all  its  equilibrium  states  are  unstable.  These  problems  arise  due  to  the 
irst  order  nature  of  the  theory  i.e.  it  considers  only  first-order  deviations  from 
equilibrium.  The  neglected  second-order  terms  are  in  fact  necessary  to  prevent 
ion-causal  and  unstable  behaviour.  Alternatively,  second-order  theories  [4-6]  have 
?een  proposed  to  remedy  this  drawback.  Particularly  successful  is  the  so-called 
extended  irreversible  thermodynamics'  (EIT)  theory  [4,6].  However,  Hiscock  and 
;o-workers  [7, 8]  have  shown  that  most  versions  of  the  causal  second-order  theories, 
ncluding  EIT,  omit  certain  divergence  terms.  Hence,  the  resulting  "truncated"  evol- 
Jtion  equations  give  rise  to  very  different  behaviour  than  the  full  equations.  Therefore, 
:he  best  currently  available  theory  for  analysing  dissipative  processes  in  the  universe  is 
:he  full  Israel-Stewart-Hiscock  causal  thermodynamics. 

In  this  paper  we  wish  to  investigate  the  Brans-Dicke  [BD]-FRW  universe  filled  with 
mperfect  fluid  having  bulk  viscosity  under  the  frame-work  of  the  full  Israel-Stewart- 
ffiscock  causal  theory.  Most  of  the  well  known  BD-FRW  models  of  the  universe  with 
curvature  parameter  k  =  0  are  models  with  constant  deceleration  parameter.  Hence,  in 
:his  paper  the  field  equations  have  been  solved  by  considering  the  relation  4>  =  KR* 
)etween  the  scalar  field  <£  and  the  scale  factor  R.  This  relation  leads  to  the  constancy  of 
:he  deceleration  parameter. 
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2.  Field  equations 

The  field  equations  of  BD  theory  (with  c  =  1)  are 


Gnh  — r 


OJ 


'ab 


-  -r  *  aft  - 

1 

(f)    '0:b 


1 


and 


8?r 


r 


where  $  is  the  scalar  field  and  co  the  BD  coupling  parameter;  Tfl6,  the 
momentum  tensor  for  a  bulk  viscous  fluid,  is  given  by 

Tab  —  [p  +  p  +  n]!^!^  4-  [p  +  n]gai 

where  p  is  the  energy  density,  p  the  thermodynamic  pressure,  n  the  bulk 
pressure  and  ua  the  four  velocity  satisfying  the  condition  uaua  =  l. 
Considering  the  FRW  metric,  with  k  =  0 

ds2  =  dt2  -  R2(r)[dr2  +  r2(d02  +  s 


and  an  equation  of  state 

p  ~  yp,    Q^y^l 
the  field  eqs  (1)  and  (2)  reduce  to 


O) 


^ 


•R  + 


R 


_L    <*\    T  .       ^^ 

^R      2" 


(j)          87typ 


R. 

+  3—  (b  = 
V 


87: 


•[(l-3y)p-3n] 


while  the  continuity  equation  Tafc6  =  0  gives 


where  H  =  .R/.R  is  the  Rubble's  constant. 
The  causal  evolution  equation  for  the  bulk  viscous  pressure  is  given  by 


\          T     C 

where  T  is  the  temperature,  £  the  bulk  viscosity  co-efficient  and  T  the  relaxat: 
For  T  =  0,  (10)  reduces  to  the  non-causal  equation.  In  (10),  a  =  0  gives  the  ti 
theory  (extended  irreversible  thermodynamics  [BIT]),  while  £  =  1  gives  the  fu 
Stewart-Hiscock  theory. 


ew  of  the  complex  non-linear  character  of  the  BD  field  equations,  to  solve  these 
tions  we  assume  a  functional  relation  between  0  and  R  to  be  of  the  form 

</)  =  KRa  I11) 

•e  K  and  a  are  constants.  With  the  choice  (11),  (6)  to  (8)  reduce  to 

(12) 


0 

_^_^  =  (2  +  a)//+r3  +  ^  +  2a  +  a2V  (13) 

00  L       2  J 

—  [(1  -  3y)p  -  30]  =  (3  +  2a>)[a(a  +  3)tf  2  +  aJf).  (14) 

0 

inating  p  and  n  from  the  above  equations,  we  have 

H=-(l  +b)H2  (15) 


;     oxx  -  3  7*  0. 


2(coa-3) 

ition  (15)  shows  that  these  models  have  a  constant  deceleration  parameter  b. 
lentally,  most  of  the  well-known  perfect  fluid  models  in  general  relativity  and  BD 
ry  belong  to  this  category. 
Cations  (12-14),  (9)  and  (10)  imply  the  dynamical  equation  for  the  Hubble 

meter 

—  [(2  +  a)#0  +  [6(1  +  y)  +  coa2(l  - y]  +  2a(2  +  3y)  +  2a2]HH0]        (18) 
STI 


n  integration,  (16)  gives  the  exact  solution 
R(t)  = 

•e  C,  D  and  #  are  constants  of  integration. 

•0):  b*-l 

is  case  it  can  be  seen  that  the  Hubble  parameter  can  be  written  in  terms  of  the  scale 

>r  as 

~_D-d+b)  (20) 


TT  __ 


(l  +  b) 

Pramana  -  J.  Phys.,  Vol.  45,  No.  6,  December  1995  5  1  3 


H: 


Now  using  (21)  and  (22)  in  (18)  yields 


3(1  +  y)  +  (2  +  3y)a  4-  (1-y)—  +  «• 

2 


(2 


^  +  a2  | 


H2 


Further,  if  we  assume 


=  _     /     m  £  ,     m-i 

p  '  PP 

where  a'(  ^  0),  m(  >  0),  0(  ^  0)  and  r(  ^  0)  are  constants,  then  we  have 


Using  (25)  in  (23)  leads  to 

00 

Sn 


+  (2 


+ 
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Now  using  (24)  together  with  (12)  and  (15)  in  (26)  we  have 


H(2(l 


-I- 


I          n 

oa2^)1"'-1 


(27) 


From  (27)  it  can  be  seen  that  m  can  only  take  the  value 
_^    1+fe-g 

Now  (28)  can  be  rewritten  as 


(28) 


(29) 


For  models  of  the  expanding  universe  with  0  increasing  we  require  a  >  0  and  1  +  0  >  O 
this  constrains  m  not  to  lie  in  the  range  ^  m  <  1  . 
For  singular  models,  since  R(Q)  =  0,  (19)  leads  to 

D  _    D     ,l/(l+b) 

A-A0r         . 
From  (30)  and  (11)  we  have 


Using  (30)  and  (31)  in  (12)  yields 


P^TT^ 


f(a/l+b)~2 


2   J"  '  (32) 

From  (32)  it  can  be  seen  that  p  would  increase  or  decrease  with  time  according  as 


rr  __  ~u  I  3/1  _i_  ., 

87r(_         r  )~<~\-'r ->?)"•  '          "2 
From  (24)  and  (32)  with  m  =  (l  +  b-  a)/(2(l  +  6)  -  a)  we  get 


)  \r~'  .....  . 

(33) 


From  (34)  it  can  be  seen  that  £  would  increase  or  decrease  according  as 


or    a 


From  (35)  it  can  be  seen  that  the  behaviour  of  the  relaxation  time  T  is  independent  of  the 
scalar-field  <£  (i.e.,  the  power-index  a)  and  it  increases  with  time. 

Cose  (if):  b  =  —  1 
In  this  case  H  is  a  constant  (as  can  be  seen  from  (15))  given  by 

H-x  =  constant. 
From  (1 1)  and  (19)  we  have 


Using  (36)  and  (37)  in  (12)  leads  to 

o)y. 
+  3»-T 

Now  using  (36),  (37)  and  (38)  in  (13)  yields 


Equation  (18)  now  reduces  to 


Using  (38)  in  (24)  we  have 
f  =  A'exp(am%t) 
i  =  Fexp(a(m  - 


where 


,      .  3  +  3a-(coa2/2) 


2yn~lm 
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From  (24)  we  have 

T        £         t  6 

-~^--=-(\+ry-.  (43) 

T        C,        T  p 

Using  (38)  in  (43)  leads  to 

i     t      T 

—  -  —  —  =  —  (1  +  r)a/.  (44) 

T     C      T 

Using  (44)  in  (40)  yields 

3£#  +  ^Tnjt(3— (1 +r)a)  + FI  =0.  (45) 

From  (39),  (41),  (42)  and  (45)  we  see  that  m  can  only  take  the  value  1. 

4.  Conclusion 

Constancy  of  the  deceleration  parameter  permits  only  two  possibilities  i.e.,  either 
H  =  constant  with  m  =  1  or  m  =  (1  +  b  —  a)/(2(l  +b)  —  a),  irrespective  of  the  value  of  c. 
Incidentally,  most  of  the  well  known  perfect  fluid  models  in  general  relativity  and  BD 
theory  are  models  with  constant  deceleration  parameter.  It  is  interesting  to  note  that  in  the 
presence  of  bulk  viscous  pressure  there  could  be  exponential  expansion  in  BD  theory. 
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Abstract.  Using  Lexan  polycarbonate  plastic  as  the  fission  fragment  track  detectors,  the 
fragment  angular  distributions  have  been  measured  in  the  cases  of  fission  of  232Th  and  238U 
induced  by  alpha  particles  of  various  energies  ranging  from  40  to  70  MeV  obtained  from  the  88" 
variable  energy  cyclotron  at  Calcutta.  The  center-of-mass  angular  distributions  have  been 
calculated  and  fitted  by  a  series  of  Legendre  polynomials.  The  W(]&')/W(9Q°)  ratios  (defined  as 
tmisotropy)  were  measured  at  several  energies  for  both  the  targets.  These  data  are  utilized  in 
calculation  of  the  energy  dependence  of  K2,,  the  standard  deviation  of  the  distribution  in  the  angular 
momentum  projection  on  the  nuclear  symmetry  axis  at  the  saddle  point.  Values  of  I"}/!",,,  i.e. 
the  ratio  of  the  fission  width  to  neutron  emission  width  have  been  determined  for  232Th  and  238U 
nuclei.  The  integral  cross-section  for  alpha  induced  fission  in  each  target  was  determined  by 
numerical  integration  of  the  respective  center-of-mass  angular  differential  cross-sections.  The 
results  were  compared  with  similar  data  available  in  the  literature  which  served  to  resolve  some 
of  the  discrepancies  observed  in  earlier  measurements.  The  results  were  also  compared  with 
theoretical  cross-sections. 

Keywords,  Angular  distribution;  angular  momentum;  fission  cross-section;  alpha  particle; 
fission  width;  neutron  emission  width;  nuclear  track  detectors. 

PACS  No.     25-85 


1.  Introduction 

The  present  investigations  on  fission  fragment  angular  distributions  are  undertaken 
to  study  the  saddle  point  configuration  and  the  competition  between  fission  and 
neutron  evaporation.  Several  measurements  on  the  fission  cross-section  and  fragment 
angular  distribution  in  the  fission  of  a  number  of  nuclei  induced  by  a  variety  of 
projectiles  have  been  reported,  a  few  of  them  using  SSNTDs  (Solid  State  Nuclear 
Track  Detectors)  [1-7].  The  interpretation  of  the  observed  angular  distributions 
and  the  important  inferences  drawn  from  them,  are  all  primarily  based  on  the  model 
proposed  by  Bohr  [8].  The  underlying  idea  of  the  model  is  that  the  stretched  fission 
nucleus,  in  passing  over  the  saddle  point,  exhibits  quantum  states  similar  to  those 
observed  in  the  permanently  deformed  nuclei,  except  that  the  states  of  the  saddle 
point  nucleus  are  expected  to  be  quasi-stationary  since  the  nucleus  only  stays  a  while 
at  the  saddle  point.  Then  the  angular  distribution  of  the  fragments  depends  on  the 
available  quantum  states  characterized  by  /,  M  and  K,  where  /  is  the  total  angular 
momentum  of  the  nucleus,  M  is  the  projection  of  /  on  a  space  fixed  axis  (taken  as 
the  incident  beam  direction)  and  K  is  the  projection  of  /  on  the  symmetry  axis  of 
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M  is  usually  zero  for  particle  induced  nssion  [9J. 

Another  interesting  information,  the  angular  distribution  study  can  yield,  is 
about  the  dissipation  of  the  large  angular  momentum  brought  in  by  the  incident 
particle  into  the  fissioning  system.  The  simple  classical  model  of  Coffin  and  Halpern 
[3],  is  useful  for  letting  one  see  qualitatively  what  sort  of  insight  can  be  obtained. 
In  three  dimensions  one  would  observe  the  number  of  fragments  per  unit  solid 
angle  to  be  dN/sin9dO,  which  would  consequently  be  proportional  to  (sinfl)"1. 
The  angular  distribution  peaks  very  sharply  in  the  forward  and  backward  directions 
(figure  3).  Thus  if  a  major  fraction  of  the  'input'  angular  momentum  were  to  feed 
into  the  orbital  momentum  between  the  fragments,  then  one  would  observe  a  very 
large  anisotropy.  However  the  experimental  observations  point  out  to  a  much  less 
anisotropy.  It  is  expected  that  there  would  be  a  general  increase  of  the  anisotropy 
in  the  angular  distribution  of  the  fission  fragments  with  increasing  bombarding 
energy  and  hence  with  the  excitation  energy  for  a  particular  fissioning  species  [10]. 
The  simplest  way  to  extract  the  relative  orbital  angular  momentum  of  the  fission 
fragments  is  to  fit  the  angular  distribution  pattern  in  terms  of  a  series  of  Legendre 
polynomials 


A  weighted  least  squares  fitting  is  usually  done  taking  into  account  the  experimental 
uncertainties.  The  1 -value  corresponding  to  the  highest  angular  momentum  compo- 
nent with  a  non-zero  coefficient  gives  an  idea  of  the  number  of  units  of  angular 
momentum  carried  by  the  fission  fragment. 

From  the  angular  distribution  measurements,  the  value  of  KQ  is  evaluated  for  each 
compound  nucleus  as  a  function  of  excitation  energy  in  excess  of  the  fission  barrier.  The 
general  features  of  the  variation  of  anisotropy  with  projectile  bombarding  energy  can 
be  worked  out  by  the  theory  advanced  by  Bohr  [8]  and  amplified  by  Strutinsky  [11], 
Halpern  and  Strutinsky  [12],  Griffin  [13]  and  others.  According  to  this  theory,  the 
angular  distribution  W(B)  of  fission  fragments  emitted  per  unit  solid  angle  for  a  single 
fissioning  species  is  described  by  the  relation 

-  1/2 

(1) 


W(8)~  \dl  \dKf(KJ)    o_  „ 

where  0  is  the  direction  of  the  fission  fragment  with  respect  to  the  beam  direction,  /  is 
the  angular  momentum  of  the  compound  state  and  K  is  the  component  of  angular 
momentum  along  the  symmetry  axis. 

With  the  assumptions  (1)  that  /(K,  /)  is  a  product  of  functions  fK(K)  and/7(/),  (2)  that 
fK(K)^e\p(-K2/2K^)  and  (3)  that  ft(l)~I  up  to  some  limiting  value  7max,  the 
integration  of  (1)  results  in  an  expression  dependent  upon  KQ  and  /max.  Both  W(6)  and 
the  ratio  W(6*)/W(9Q°)  can  be  characterized  by  a  parameter  p  =  (ImJ2KQ)2  [1]  in 
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W(9Q°)  fp  .    x1/2exp(-x)J0(fx) 

CI.X  " 


erf(V2x) 

where  J0  is  the  zero-order  Bessel  function.  Calculations  based  on  this  expression  are  shown 
in  figure  3.  From  statistical  considerations,  coupled  with  Fermi  gas  model,  KQ  is  given  by 

is 2  r        T/fc2  d\ 

A.Q  =  J eff  I  /n  (j) 

where  T  is  the  nuclear  temperature  at  the  saddle  point  and  Jeff  is  the  effective  moment 
of  inertia,  defined  as 


•>.* 


where  J±  is  the  moment  of  inertia  perpendicular  to  symmetry  axis  and  J,,  is  the  moment 
of  inertia  parallel  to  the  symmetry  axis.  Further,  T  on  the  basis  of  Fermi  gas  model,  ijs 
given  by 

r/r*~|i/2 

(4) 

where  Esx  and  af  are  the  excitation  energy  and  nuclear  level  density  parameter,  both 
corresponding  to  the  saddle  point  configuration  of  the  fissioning  nucleus. 

Thus  Kl  turns  out  to  be  proportional  to  the  square  root  of  the  excitation  energy  Esx  at 
the  saddle  point.  /2nax  taken  to  be  equal  to  2  <72  >av  exhibits  a  nearly  linear  dependence 
upon  the  bombarding  energy  of  the  projectile.  Consequently,  because  the  parameter 
p  =  I^M/^KQ  controls  the  angular  anisotropy  W(9°)/W(9Q°),  it  is  expected  that  there 
would  be  a  general  increase  of  the  anisotropy  in  the  angular  distribution  of  the  fission 
fragments  with  increasing  bombarding  energy,  and  hence  with  the  excitation  energy  for 
a  particular  fissioning  species. 

2.  Experimental  details 

2.1  General 

The  energetic  alpha  particles  were  obtained  from  the  variable  energy  cyclotron  (VEC) 
at  Calcutta,  India.  The  collimating  system  restricts  the  diameter  of  the  beam  at  the 
target  to  <  2  mm.  The  beam  current  on  the  target  was  of  the  order  of  50  n  A.  The  total 
number  of  alpha  particles  striking  the  target  was  measured  by  a  Faraday  cup  (FC) 
equipped  with  a  secondary  electron  suppression  device.  The  connections  of  FC  were 
brought  out  and  fed  to  an  integrator.  The  details  of  experimental  procedure  and 
detector  assembly  can  be  found  in  our  earlier  paper  [4]. 

2.2  Target  preparation 

The  thorium  target  was  prepared  by  electrodeposition  having  thickness  =  200^g/cm2 
of  the  fissionable  material  (ThO2)  onto  a  3  mg/cm2  nickel  backing  foil.  The  uranium 
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target  was  prepared  by  vacuum  evaporation  of  238UO2  onto  a  2mg/cm2  Ni-backing 
foil.  These  targets  were  obtained  from  the  VEC  Target  Lab.,  Calcutta. 

2.3  Experimental  observation 

The  fission  fragment  track  densities  were  measured  for  thorium  and  uranium  targets 
for  angles  between  10C  and  80°  in  the  laboratory  system.  Data  collected  for  fissioning 
nucleus  were  converted  to  center-of-mass  coordinates  assuming  (1)  full  momentum 
transfer  of  the  incident  charged  particle  to  the  compound  nucleus,  (2)  equal  kinetic 
energy  for  all  fission  fragments  and  (3)  symmetric  fragment  mass  distribution.  These 
three  conditions  are  more  or  less  generally  satisfied  in  charged  particle  induced 
fission  of  actinide  elements  at  medium  energies  as  those  employed  in  the  present 
investigation.  The  kinetic  energy  release  in  the  center-of-mass  system  was  estimated 
from  the  relation  [14] 

Z2 


where  EK  represents  the  average  total  kinetic  energy  of  the  fission  fragments  before 
neutron  emission  and  Z  and  A  are  the  atomic  and  mass  numbers,  respectively,  of  the 
compound  nucleus. 

3.  Results  and  discussion 

The  analysis  of  the  data  essentially  consists  of  the  following  parts: 

(1)  least  square  fitting  of  the  center-of-mass  angular  distributions  by  a  series  of 
Legendre  polynomials,  to  draw  inferences  about  the  relative  orbital  angular 
momentum  of  the  fission  fragments. 

(2)  a  comparison  of  the  asymmetry  of  the  angular  distribution  for  different  targets  as 
a  function  Z2/,4,  to  assess  the  competition  between  fission  and  neutron  evapor- 
ation. 

(3)  determination  of  KQ  values  from  (5)  and  (6)  as  given  below  [1,  15,4a]: 

-' 


max 


i,       max         1     i    "  i  f\ 

+  ^pl  =  !  +  o  (6) 

5A.0  2 

(4)  measurement  of  the  total  fission  cross-section  by  integrating  the  measured  differen- 
tial cross-sections  and  comparing  the  results  with  theory  and  previous  data  to 
resolve  discrepancies  and 

(5)  calculation  of  Fy/F,,  from  CT//<TR  where  aR  is  the  total  reaction  cross-section  taken 
from  the  calculations  of  Igo  and  Huizenga  [16, 17]. 

The  difficulty  in  obtaining  angular  distributions  to  the  desired  accuracy  is  connec- 
tion with  a  background  of  small  tracks  in  the  plastic  track  detector,  which  was  observed 
in  the  forward  hemisphere.  Although  the  cause  of  such  a  background  is  not  yet  well 
understood,  normally  this  is  consistent  with  the  expectation  that  in  addition  to  fission 
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Fission  fragment  angular  distribution 
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Figure  1.  Photomicrographs  of  the  fission  fragment  tracks  in  Lexan  from  60  Me  V 
He-ion  induced  fission  of  232Th  (a)  without  covering  the  plastic  with  scotch  tape,  (b) 
after  covering  the  plastic  with  scotch-tape. 

fragments  these  would  appear  in  the  forward  hemisphere  due  to  some  radioactive 
recoils  from  nuclear  reactions  [3].  However,  these  background  tracks  caused  no 
difficulty  in  counting  the  fission  tracks.  In  one  particular  instance  where  this  back- 
ground did  become  serious,  the  simple  device  of  covering  the  plastic  detector  with 
scotch-tape  dramatically  reduced  the  background  without  appreciably  affecting  the 
fission  tracks.  Presumably  the  soft  gum  on  the  tape  fills  in  the  shallow  "pits"  on  the 
surface.  The  fission  tracks,  being  formed  much  deeper  into  the  surface,  were  not  very 
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Figure  2.    Angular  distributions  of  fission  fragments  from  the  helium-ion-induced 
fission  of  232Th  and  238U  at  different  incident  energies  of  helium  ion. 


plastic  detector  with  scotch-tape  and  (b)  corresponds  to  after  covering  the  plastic 
detector  with  scotch  tape. 

The  measured  track  densities  in  the  laboratory  frame  WL(B]  are  converted  into 
laboratory  differential  cross-section  (d<r/dfi)L  using  the  formula 


WL(ff) 


(7) 


where  QL  is  the  laboratory  solid  angle  subtended  by  the  unit  area  of  the  detector  over 
which  the  track  density  W^(B)  is  measured,  (f>  is  the  incident  alpha  particle  flux  and  N  is 
the  number  of  target  nuclei  per  unit  area.  The  laboratory  differential  cross-sections  are 
then  converted  into  center-of-mass  differential  cross-sections  d<r/dQ,  using  the  relevant 
transformation  equations  described  in  our  earlier  paper  [4].  The  relative  differential 
fission  cross-section  (d<r(00)/dQ)/(d<r(90°)/dQ)  or  angular  anisotropies  W(9°)/W(9Q°)  as 
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a  function  of  angle  6  were  deduced  for  thorium  and  uranium  targets  and  shown  in 
figure  2  for  different  energies. 

The  solid  and  dashed  lines  in  figure  2  for  232Th  and  238U  respectively  are  the  best  fit 
to  the  experimental  data  obtained  using'  Legendre  polynomials  with  terms  up  to 
P6(cos0)  with  coefficients  as  tabulated  in  table  1.  Coefficients  higher  than  A6  were 
found  to  be  statistically  not  significant  and  hence  not  included  in  the  table. 

The  dashed  line  in  figure  3  indicates  (sin  0)~ 1  variation  of  the  anisotropy  as  expected 
from  a  classical  model  in  which  all  the  angular  momentum  brought  in  by  the  incident 
particle  is  delivered  to  the  fission  fragments  and  appears  as  their  relative  orbital  angular 
momentum.  Mathematically  in  terms  of  Legendre  polynomials  the  variation  of  the 
anisotropy  is  expressed  as  [3] 


1 


H/(90C)     sine 


1-25P,  +  1-27  P.  +  1-27  Pf 


(8) 


Comparing  these  coefficients  with  the  experimentally  observed  coefficients  listed  in 
table  1,  one  can  see  that  higher  angular  momentum  components  in  the  experimentally 
observed  distributions  drop  off  rapidly  as  compared  to  those  in  (sin  0)~  *.  This  is  an 
indication  that  the  observed  relative  orbital  angular  momentum  of  the  fission  frag- 
ments is  much  smaller  than  the  actual  angular  momentum  brought  in  by  the  incid.ent 
particle. 

In  an  approximate  way,  one  may  expect  the  average  orbital  angular  momentum  of 
the  fission  fragments  to  be  given  by  the  1  value  of  the  highest  angular  momentum  term 
(with  statistically  significant  coefficient)  in  the  Legendre  polynomial  expansion.  On  this 
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Figure  3.  Calculated  angular  distributions  for  fission  fragments  obtained  by 
summing  over  a  classical  sharp  cut-off  distribution  of  I  (from  0  to  /  )  (Vaz  and 
Alexander  [1]).  max 
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basis  and  looking  at  table  1,  one  can  conclude  that  in  the  presently  studied  alpha 
induced  fission  of  232Th  and  238U,  the  orbital  momentum  of  the  fission  fragments 
generally  does  not  exceed  3h  while  the  average  incident  angular  momentum  is  about 
20/i.  The  difference  between  the  two  values  is  dissipated  into  the  formation  of  high  spin 
states  of  the  fission  fragments  as  well  as  into  collective  rotational  degrees  of  freedom 
such  as  rolling  friction  in  some  cases. 

The  experimental  anisotropy  may  be  conveniently  defined  as  W(\Q°)/W(9Q°)  and  is 
listed  in  table  1  as  a  function  of  the  parameter  Z2/A  for  the  two  fissioning  nuclei  at 
different  energies.  It  can  be  seen  from  table  1  that  the  anisotropy  W(IQ°)/W(9Q°) 
increases  with  increase  of  incident  particle  energy;  on  the  other  hand  W(W°)/W(9Q°} 
decreases  with  increasing  value  of  Z2/A  of  the  target  nucleus.  This  trend  is  generally 
observed  experimentally,  but  the  correlation  between  anisotropy  and  Z2//4  cannot  be 
described  from  the  theoretical  stand  point  as  fundamental  to  the  fission  process.  On  the 
other  hand,  it  is  mostly  likely  to  arise  from  fortuitous  effects  connected  with  neutron 
evaporation  before  fission. 

To  check  the  systematics  of  the  present  experiment,  we  deduced  the  value  of 
KQ  which  is  the  standard  deviation  of  the  angular  momentum  projection  on  the  nuclear 
symmetry  axis  in  the  saddle  point  configuration,  which  sensitively  controls  the  fission 
fragment  angular  distribution.  For  this  purpose  we  used  (5)  and  (6).  The  value  of 
K2,  from  (5)  matches  well  with  the  value  of  K2,  derived  from  (6).  For  calculation  the  value 
of  KQ  and  parameter  p  has  been  taken  from  the  figure  3  corresponding  to  the 
experimental  anisotropy,  which  has  been  measured  in  the  present  experimental  work. 
The  value  of  p  has  been  interpolated  from  figure  3  corresponding  to  our  experimental 
anisotropy  W(IQ°)/W(9Q°).  For  example,  in  case  of  alpha  induced  fission  of  232Th  at 
40  MeV,  we  have  taken  the  value  of  p  =±  1-1 1  (from  figure  3)  corresponding  to  the  value 
of  experimental  anisotropy  ^  1-54.  Values  of  /max  are  estimated  from  the  fusion 
cross-section  code  FRANPIE  Vaz  [18]  which  utilises  empirical  fusion  barriers  from 
Vaz  et  al  [19]  and  (for  high  energies)  the  critical  radius  approach  of  Galin  et  al  [20]. 
Values  of  7max,  p  and  KQ  are  shown  in  table  2  corresponding  to  incident  particle 


Table  2.  Values  of  /max  angular  momentum,  p-parameter  corresponding  to  aniso- 
tropy (W(10°)/W(90°))  and  K20  obtained  by  the  eqs  (5)  and  (6)  for  the  various 
energies  of  alpha  particles. 


Value  of  p-parameter 
Value       taken  from  figure  3 

Value  of  Kl 

Energy 

OJ  •'max 

(corresponds  to         W 

(0°)                  'max  , 

'max 

Target 

(MeV) 

(ft) 

anisotropy)           W( 

90°)         '  8KJ5 

l(6))    P     4K*(eq-(5)) 

232Th 

40 

19 

1-11 

82 

81 

50 

23 

1-26 

107 

105 

60 

26 

1-65 

103 

102 

70 

28 

1-80 

106 

109 

238U 

40 

19 

0-92 

98 

98 

50 

23 

1-00 

125 

132 

60 

26 

1-20 

141 

141 

70 

28 

1-30 

151 

151 
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energies  From  table  2,  it  can  be  noted  that  the  value  of  K 20  increases  with  incree 
particle  energy  and  the  value  of  K20  in  the  fission  of  (a  +  238U)  system  is  greater 
that  of  (a +  232Th)  system. 

The  integral  cross-section  for  fission  at  a  given  energy  of  the  projectile  af  ca 
determined  by  integration,  over  the  solid  angle,  as  follows  [5], 


where  <£  is  the  flux  of  bombarding  particles,  N  is  the  number  of  target  nuclei  per 
W(9Q~)  is  the  number  of  tracks  detected  at  90°  in  the  center-of-mass  coordir 
system,  Q  is  the  solid  angle  subtended  by  the  detector,  W(0)/W(9Q°)  is  the  cente 
mass  angular  distribution  for  the  particular  energy  of  the  charged  particle  involved 
the  integration  is  made  over  2n  steradians. 

The  experimentally  measured  cross-sections  are  listed  in  table  3  for  232Th  and  ' 
targets.  The  errors  associated  with  the  (a,/)  cross-sections  are  estimated  to  be  n< 
5%  for  thorium  as  well  as  uranium  target.  The  measurements  of  some  of  these  fis 
cross-sections  have  been  made  by  different  methods  by  different  authors.  The  resu 
those  measurements  are  shown  in  figure  4  for  comparison  with  the  present  data.  I 
be  seen  that  there  are  wide  discrepancies  in  the  fission  cross-section  measure 
different  authors  with  different  detectors.  In  case  of  (a  +  232Th)  system,  Jungen 
Hicks  et  al  [2  1  ,  22]  and  Ford  and  Leachman  [23]  measured  the  fission  cross-sectic 
to  only  42  MeV.  Ralarosy  et  al  [24]  measured  the  fission  cross-sections  at  diffc 
energies.  If  we  interpolate  the  data  of  Ralarosy  et  al  [24]  then  our  present  results  j 
to  confirm  the  experimental  results  of  Ralarosy  et  al  [24]  for  238U(a,/)  as  well  a 
232Th(a,/).  However,  in  the  case  of  238U(a,/),  the  measured  fission  cross-section 
found  to  be  less  than  that  measured  by  Kapoor  et  al  [2].  But  at  40  MeV,  our  r 
matches  well  with  the  results  of  Kapoor  et  al  [2],  Jungerman  [21],  Colby  et  al 
Viola  and  Sikkeland  [26]  and  Wing  et  al  [27].  In  both  cases,  232Th  and  238U 
measured  fission  cross-sections  are  found  to  be  very  close  to  the  total  optical  rr. 
reaction  cross-section  due  to  Blatt  and  Weisskopf  [28].  This  shows  the  predomin 
of  the  fission  process  in  actinide  elements  even  at  high  bombarding  energies. 

The  fission  cross-sections  presented  in  figure  4  and  listed  in  table  3  can  be  us( 
deduce  quantitative  information  on  the  ratio  of  the  partial  widths  for  fission 
neutron  emission.  The  steep  excitation  functions  (see  figure  4)  suggest  that  the  n 
ured  fission  cross-sections  are  mostly  due  to  first  chance  fission.  Therefore  the  fii 
with  rf  is  very  nearly  equal  to  the  fission  cross-section  af.  Charged  particle  emi; 
can  be  ignored  at  these  moderate  energies  and  therefore  the  neutron  emission  wid 
can  be  approximated  with  the  total  reaction  cross-section  aR.  Thus 


The  experimental  values  ofrf/Tn  as  a  function  of  incident  particle  energy  for  232T1 
238U  are  listed  in  table  3.  The  total  reaction  cross  sections  were  calculated  as  disci 
earlier.  From  table  3  one  can  notice  that  all  the  values  of  Tf/rn  are  nearly  equal  to 
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Figure  4.  Measured  fission  cross-section  at  different  alpha  bombarding  energies 
for  the  case  of  alpha  induced  fission  of  232Th  and  238U  and  other  author's  values  for 
the  same  energies. 
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Abstract.  The  first  relativistic  correction  of  order  a2  to  the  dipole  polarizability  of  a  hydrogenic 
ion  has  been  investigated  by  using  mean  excitation  energy  of  the  ion  within  the  second-order 
perturbation  theory.  The  density-dependent  mean  excitation  energy  is  estimated  via  Bethe 
theory  for  the  stopping  cross  section  for  a  moving  point  charge  interacting  with  the  hydrogenic 
ion.  In  this  approach  only  the  unperturbed  Dirac  wavefunctions  are  required  to  evaluate  the 
appropriate  matrix  elements.  The  first  relativistic  correction  turns  out  to  be  —(13/12)  (a  Z)2.  This 
has  the  correct  sign  and  is  within  5%  of  the  exact  result  which  is  —  (28/27)  (a  Z)2. 

Keywords.  Dipole  polarizability;  mean  excitation  energy;  completeness  relation;  stopping  cross 
section;  unperturbed  Dirac  wave  function. 
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1.  Introduction 

The  electric  dipole  polarizability  ad  of  an  atom  represents  the  simplest  linear  response 
of  the  electronic  charge  cloud  to  an  electric  field.  It  determines  the  most  important  term 
in  the  asymptotic  behaviour  of  electron-atom  interaction  in  the  low  energy  collisions 
[1].  In  electrostatics,  it  is  closely  related  to  dielectric  constant  [2]  and  it  is  also 
a  measure  of  the  Stark  shift  of  energy  levels.  In  optics,  it  is  useful  in  the  analysis  of 
refraction  and  scattering  of  light,  and  of  the  electrical  birefringence  [3].  It  is  of  interest 
to  know  the  relativistic  correction  due  to  Dirac  nature  of  the  electron  cloud.  This  can 
improve  the  accuracy  of  the  coefficient  in  the  intermolecular  potentials  [4, 5]. 
The  dipole  polarizability  is  given  by  [6] 

/',  £  rf^cosOMo]  (1) 

1  =  1  / 

where  <//0  is  the  unperturbed  ground  state  wave  function,  [j/(*}  is  the  first-order 
perturbed  wave  function,  M  is  the  number  of  electrons  in  the  system.  For  one-electron 
systems,  i/^1'  can  be  calculated  exactly  by  the  method  of  Dalgarno  and  Lewis  [7]  in 
which  the  perturbed  wave  function  satisfies  an  inhomogeneous  differential  equation. 
For  H  atom,  the  non-relativistic  value  of  ad  =  4-5.  When  the  perturbation  theory  is 
applied  for  the  relativistic  calculation  of  ad,  the  calculation  is  quite  complicated  even 
for  H  atom.  For  atoms  other  than  H,  or  for  a  Coulomb  potential  perturbed  by  other 
central  potentials,  the  calculation  becomes  severely  complicated. 

There  are  several  calculations  of  relativistic  corrections  to  ad  for  H  atom.  A  general 
theory  [8]  of  electromagnetic  polarizabilities  of  composite  system  described  by 
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infinite-component  wave  equation  has  been  developed  and  applied  to  the  relativistic 
H  atom  which  is  described  completely  by  an  O(4,  2)  infinite-component  wave  equation. 
The  method  of  Dalgarno  and  Lewis  [7]  has  been  used  [9]  to  obtain  the  relativistic 
ad  for  an  H  atom.  Several  calculations  using  the  method  of  Coulomb's  Green  function 
[10,  1  1]  have  been  performed  for  the  relativistic  calculation  of  ad  for  H  atom.  In  one 
calculation  [12],  only  the  Darwin  Hamiltonian  was  taken  into  account  which  pro- 
duced wrong  results.  Kaneko  [13]  showed  that  the  explicit  solution  of  the  perturbed 
Dirac  equation  is  not  necessary  to  calculate  the  exact  values  of  the  corrections. 

* 
2.  Theory 

By  expressing  ij/[1}  over  a  complete  set  of  unperturbed  eigenstates,  we  get 


The  summation  is  over  all  intermediate  states  including  the  continuum.  Equation  (2) 
requires  a  complete  knowledge  of  the  spectrum  of  the  atomic  system.  A  useful  approach 
is  to  replace  En  —  EQ  by  a  mean  excitation  energy  A,  and  making  use  of  the  complete- 
ness relation 


we  now  get 


a,,  =  — 


r?P?(cos 


If  A  is  taken  to  be  equal  to  the  ionization  potential  then  ad  comes  out  to  be  correct 
within  an  order  of  magnitude  [6]. 

In  the  present  work,  we  show  that  A  can  be  connected  to  the  electron  density  p  (/•)- 
The  stopping  cross  section  S  for  a  high-speed  point  charge  penetrating  through 
a  homogeneous  electron  gas  is  given  by  [14,  15] 

S  =  K—  In  A  (5) 

E 

where  K  is  a  constant,  E  is  the  energy  of  the  penetrating  charge  and 

f 
lnA=     d3rp(r)  In  ojp(r)  (6) 

*/ 

with  o)p(r)  being  the  plasma  frequency  of  the  medium  which  is  given  by 

cop(r)  =  (4np(r}2}112  (7) 

where  z  —  Z  —  N  +  1,  is  the  residual  charge  of  the  electron  gas  which  has  nuclear  charge 
Z  and  N  electrons. 

We  now  apply  (4)  with  the  definition  of  A  given  by  (6)  to  calculate  ad  for  a  hydrogenic 
ion  in  the  ground  state. 
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Keiatwistic  correction  oj  a  nyarogemc  ion 
We  can  now  write 

ao  =  3A<r2>-  (8) 

The  relativistic  charge  density  of  a  hydrogenic  ion  is  [16] 


47i(l  +£1 
where 


and 


a  is  the  fine-structure  constant. 

With  p(r)  given  by  (9),  we  obtained  all  the  relevant  integrals  analytically  [17]  and  got 
the  following  expressions  for  <r2>  and  In  A 

d 


2  ,„  A  . 


H 


(12) 


with  C  =  0-577215  •  •  •  as  the  Euler's  constant. 

Now  we  expand  y1,£l  and  their  functions  up  to  terms  of  order  (aZ)2,  after  some 
algebra,  we  obtain 


(14) 

^"L       u         J 
and 

A^%  *-r7  /  ,  2  <-y2    \  /•<  £*\ 

=  2Z2exp    --  +  -a2Z2    .  (15) 


We  finally  obtain  ad  correct  to  order  (aZ)2. 

e3/2T  ^3  -j 

JLJ        1  1  ^  I 

3.  Results 

The  first  term  in  (16)  yields  the  non-relativistic  value  with  the  correct  Z  dependence. 
Our  non-relativistic  value  of  H  atom  is  4-4817  against  the  exact  value  of  4-50.  The 
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second  term  gives  the  first-order  relativistic  correction.  Our  coefficient  13/12  is  within 
five  per  cent  of  the  exact  value  28/27  given  by  Kaneko  [13],  who  did  a  perturbation 
calculation  by  use  of  the  Dirac  equation.  The  negative  sign  indicates  relativistic 
contraction  of  the  mean  radius  of  the  electron  orbit.  Feiock  and  Johnson  [12]  obtained 
a  positive  sign  for  their  first-order  relativistic  correction  because  they  employed 
incorrect  first-order  perturbed  wave  function. 

Though  our  calculations  presented  here  are  much  simpler  than  other  calculations 
present  in  the  literature,  yet  they  yield  quite  accurate  results.  The  same  formalism, 
which  requires  only  the  unperturbed  density,  can  be  applied  to  Coulomb  potentials 
perturbed  by  more  complex  potentials. 
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Abstract.  Line  strength  S  and  radial  matrix  element  a  for  the  dipole  allowed  transitions  within 
n  —  2  complex  of  ions  in  the  Be  isoelectronic  sequence  have  been  fitted  in  the  forms 
Z2S  =  A  +  B/(Z  -  C)  and  Zff  =  A'  +  B'/(Z  -  C).  The  constants  A,  5,  C  and  A',  B',  C  have  been 
calculated  by  employing  a  non-linear  least  square  method.  The  relevant  data  for  S  and  o  have 
been  taken  from  calculations  which  includes  correlation  effects.  It  is  shown  that  the  fitted  yalues 
of  A  and  A1  are  in  excellent  accord  with  their  hydrogenic  values  (Z  ->  a)  provided  that  we  express 
the  zeroth-orderwavefunction  of  the  ground  state  Is2  2s2 1S  as  a  quantum-mechanical  admixture 
of  the  Hartree-Fock  (HF)  state  ls22s21S  and  the  near-degenerate  state  ls22p21S. 

Keywords.  Radial  matrix  elements;  line  strength;  oscillator  strength;  Hartree-Fock;  near- 
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1.  Introduction 

The  optical  oscillator  strength  between  the  two  bound  states  of  an  atomic  system  is  an 
important  atomic  quantity.  This  is  related  to  the  transition  probability,  refractive 
index,  the  dipole  polarizability  and  the  diamagnetic  susceptibility.  The  optical  oscil- 
lator strength  can  be  used  to  test  the  accuracy  of  the  generalized  oscillator  strength  [1]. 
Its  knowledge  is  useful  in  fields  like  astrophysics  [2]  to  compute  the  opacity  of  the 
stellar  envelopes  [3].  It  also  finds  application  in  fusion  research  [4]. 

Experimentally,  the  oscillator  strength  can  be  extracted  from  the  measurements  of 
transition  probabilities  by  beam-foil  [5]  and  beam-laser  [6,7]  techniques.  Various 
theoretical  methods  [8-11]  which  take  into  account  the  correlation  effects  can  be 
employed  for  its  calculations.  The  amount  of  correlation  depends  upon  the  size  and 
nature  of  the  basis  function  included  in  the  calculation.  If  the  values  of  oscillator 
strengths  for  few  ions  in  an  isoelectronic  sequence  of  an  atomic  system  are  known,  then 
an  interpolation  curve  based  on  perturbation  theory  may  be  used  to  extract  the 
information  about  other  ions  in  the  sequence.  The  perturbation  theory  is  based  upon 
1/Z  expansion  of  oscillator  strength,  where  Z  is  the  nuclear  charge  of  an  ion.  This 
method  is  valid  only  when  there  are  no  plunging  configurations  involved  for  an  ion  in 
an  isoelectronic  sequence.  An  interpolation  curve  based  on  the  expansion  of  Z2S  is 
more  suitable  as  it  does  not  involve  the  transition  energy.  In  the  limit  of  Z-»  oo,  the 
value  of  Z2S  approaches  a  finite  hydrogenic  value  which  can  be  trivially  calculated. 
However,  this  limit  is  appropriate  only  when  the  near-degenerate  effects  [12],  if 
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correlation  effect.  This  effect  is  present  in  the  ground  state  configuration  of  Be,  B,  C, 
N  and  O  isoelectronic  sequences  within  the  n  =  2  complex.  The  Z25  expansion  has 
been  studied  for  carbon  [13]  and  oxygen  [14]  isoelectronic  sequences  which  included 
the  effects  of  near-degeneracy.  The  effect  of  2s-2p  near-degeneracy  was  studied  for 
these  sequences  and  it  was  then  shown  that  the  correct  value  of  Z2  S  was  obtained  in  the 
limit  of  infinite  nuclear  charge.  In  the  present  work,  we  have  extended  the  Z2S 
expansion  for  Be  isoelectronic  sequence.  We  have  shown  that  the  correct  value  of  Z2S 
is  obtained  in  the  limit  Z  ->  oo  if  we  augment  the  HF  ground  state  Is2 2s2  *S  with  its 
near-degenerate  state  ls22p21S.  We  have  also  expanded  the  dipole  radial  matrix 
elements  based  upon  the  perturbation  expansion  method  for  the  dipole  allowed 
transitions  in  the  Be  sequence. 

2.  Theory 

The  length  form  of  line  strength  S  between  an  initial  bound  state  0  and  a  final  bound 
state  (f)'  is  given  by  [15] 

N 

where  summation  is  taken  over  all  the  electrons  of  an  atom,  rt  is  the  dipole  operator  for 
the  ith  electron,  LSn(L 'S' 'it'}  refer  to  the  total  angular  momentum,  spin  angular 
momentum  and  parity  of  the  initial  (final)  states.  Because  of  the  odd  parity  of  the  dipole 
operator,  only  states  of  opposite  parity  are  connected  by  the  dipole  operator. 
According  to  the  perturbation  theory  the  expansion  for  S  is 

In  terms  of  the  radial  matrix  elements,  the  perturbation  expansion  becomes 

7fr  —   V    J?  J7l  C%\ 

Z,<T  —   2_j  K-i/6  (J) 

where 

S  =  Acr2  (4) 

with  A  being  an  angular  factor  and  a  is  the  dipole  radial  matrix  element  between  an 
initial  and  the  final  bound  states.  The  infinite  series  expansion  of  (2)  and  (3)  can  be 
written  in  a  more  compact  fashion  as 


It  is  trivial  to  note  that  S0  =  A,S1  =  B,  Sj  =  B(CJ'~  *)  withy  ^  2.  In  other  words,  we  have 
replaced  the  summation  in  (2)  with  a  two-term  expansion.  The  constant  C  may  loosely 
be  interpreted  as  a  screening  constant  and  B  accounts  for  the  residual  correlation  left  over 
and  above  the  HF  correlation  and  the  internal  correlation.  The  same  interpretation  can 
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be  given  to  the  constants  B'  and  C.  Equation  (5)  was  originally  suggested  by  Edlen 
[16].  The  constants  A,  B,  C  or  A\  B',  C  were  obtained  by  fitting  the  assumed  forms 
given  by  (5)  or  (6)  to  the  known  values  of  Z2S  or  Zo  by  employing  a  non-linear  least 
square  method.  The  details  of  this  procedure  have  already  been  given  in  [14].  In  this 
reference  there  is  a  slight  misprint  in  (8).  There  should  be  a  minus  sign  in  the 
denominator  in  front  of  the  square  brackets.  Here  we  have  studied  the  transitions 
ls22s2lSe-*\s22s2p1P°,  ls22s2p3P0-*ls22P23Pandls22p21S-^ls22s2p1P°inthe 
Be  isoelectronic  sequence  for  ions  in  the  range  Z  =  6  to  10,  for  which  data  for  atomic 
opacity  calculations  was  available  [17].  These  calculations  involve  correlation  effects 
within  and  outside  the  n  =  2  complex.  We  have  also  used  the  data  of  Cheng  et  al  [18] 
for  the  resonant  transition  Is2  2s2  1S->  Is22s2p  1P°  for  ions  up  to  Z  =  26  which  includes 
only  the  internal  correlations.  These  calculations  were  done  in  j-j  coupling  scheme  by 
employing  multi-configuration  Dirac-Fock  (MCDF)  technique  [18].  The  MCDF 
wavefunctions  are  combinations  of  Slater  determinants  constructed  from 
ls1/2,2s1/2,2p1/2,2p3/2  orbitals  to  form  eigenfunctions  of  total  angular  momentum 
J  and  parity  n.  We  truncated  the  data  at  Z  =  26  because  for  heavier  ions  relativistic 
effects  begin  to  become  appreciable. 

The  values  of  constants  for  the.  transitions  studied  here  are  presented  in  tables  1-3. 
We  have  also  calculated  the  ab-initio  values  A0  and  A'Q,  which  correspond  to  the 
values  of  Z2S  and  Zcr  respectively  in  the  limit  Z->  oo.  In  this  limit  the  nuclear  term 
dominates  over  electron-electron  repulsion  term  and  the  HF  orbitals  assume  their 
hydrogenic  character.  For  the  ls22s21S  ground  state,  near  degenerate  effects  are 
present  due  to  the  promotion  of  two  '2s'  electrons  into  the  two  '2p'  electrons  giving 
rise  to  the  near-degenerate  state  ls22p21S  which  strongly  interacts  with  the  HF 
state  ls22s21S.  In  the  limit  Z-+oowe  can  write  the  ground  state  wavefunction  as 
a  quantum  mechanical  admixture  of  the  HF  state  ^>1  and  the  near-degenerate  state 
<>  i.e. 


S)  +  c2<Mls22p21S).  (7) 

The  expansion  coefficients  c1?  c2  can  be  calculated  trivially  by  diagonalizing  the 


Table  la.    The  values  of  parameters  of  eq.  (5).  A0  is  the  ab-initio  value 
ofZ2SwithZ->oo. 


Transition  AQ  A  B  C  f 

ls22s21S^  38-495     38-714     145-418      2-353      l-OE-03 

ls22s2p1P° 


Table  Ib.    The  values  of  parameters  of  eq.  (6).  A'0  is  the  ab-initio  value 
of  Za  with  Z->oo. 


Transition 

4, 

A 

B' 

C 

f 

ls22s21S-+ 
ls22s2p1P° 

-4-242 

-4-271 

-7-561 

1-788 

2-2E-05 

Pramana  -  J.  Phys.,  Vol.  45,  No.  6,  December  1995  539 


Table  2a.    The  values  of  parameters  of  eq.  (5).  AQ  is  the  ab-initio  value 
ofZ2SwithZ-->oo. 

Transition  A0  A  B  C  ~t 

ls22s21S->  38-495      39-250      134-461       3-296      5-4E-15 

ls22s2p1P° 

ls22p21S-+  33-504     34-194      102-529       3-318      1-7E-15 

ls22s2p1P° 


Table  2b.    The  values  of  parameters  of  eq.  (6).  A'0  is  the  ab-initio  value 
of  Zff  with  Z->ac. 

Transition  A'Q  A'  B'  C  %2 

ls22s21S-»  -4-242      -4-308      -6-929     2-765     l-OE-16 

ls22s2plP° 

ls22p21S-*  -4-242      -4-304      -6-127     2-836     4-7E-17 

ls22s2p1P° 


Table  3a.    The  values  of  parameters  of  eq.  (5).  AQ  is  the  ab-initio  value 
ofZ2S  withZ-»oc. 

Transition  A0  A  B  C  "/? 

ls22s2p3P-^  162-0      156-657     691-283       2-407      6-7E-15 

ls22P23P 


Table  3b.    The  values  of  parameters  of  eq.  (6).  A'Q  is  the  ab-initio  value  of  Zo  with 

Z->-oo. 

Transition  A'Q  A'  B'  C  %2 

ls22s2p3P°-*  -5-196        -5-143        -10-597        1-733        2-53E-16 

ls22P23P 


Hamiltonian  in  this  two-basis  set.  We  solve  the  secular  equation 

Ec£(HJ,-£y  =  0,    j=l,2  (8) 

!=  1 

where  H£j  is  a  typical  Hamiltonian  matrix  element;  and 


with  the  normalization  condition 

c2  +  q  =  l.  (10) 
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Using  Slater's  approach  [17],  we  can  write  down 

Hii-#22  =  2J(2s,2s)  +  4F0(ls,2s)-2G0(ls,2s)  +  F°(2s,2s) 

-  2/(2p,  2p)  -  4F°(ls,  2p)  -  F°(2p, 2p)  -  0-4F2(2p,  2p) 
+  0-66666667G1(ls,2p) 


The  J(nl,nl),  F*(nl,nT)  and  G*(nl,nT)  are  the  one-  and  two-electron  Slater  integrals 
[13].  Using  the  hydrogenic  values  of  these  integrals  we  get 

0(ls22s2  XS)  =  -  0974321(£1(is22s2 1S)  +  0-225 16502(ls22p2 1S)  (1 1) 

and 

</)(ls22p21S)  =  0-974321(/)1(ls22p21S)  +  0-22516502(ls22s21S).  (12) 

It  is  now  possible  to  write  the  line-strength  S  as 

S0(LS->  L'S')  =  c21S1  +  c22S2  +  2c1c2x/S1S2  (13) 

where 

51  =  |<01(ls22s21S)|r|^3(ls22s2p1P°)>|2  (14) 

52  =  |<(/>2(ls22P215)|r|03(ls22s2P1P0)>|2.  (15) 

The  angular  factor  A  for  each  transition  is  calculated  using  the  standard  Racah  algebra 
[19].  The  angular  factors  for  various  transitions  are 

A(ls22s21S->ls22s2p1P°)  =  2 
A(ls22p215-^ls22s2p1P°)  =  2/3 
A(ls22s2p3P°->ls22p23P)  =  6. 
For  the  mixed  state 

A(Cl{ls22s21S}  +  c2{ls22p21S}-+  Is2  2s2p1P°)  =  2-138636414 
A(c1{ls22p21S}4-c2{ls22s21S}-^ls22s2p1P°)=  1-861363589. 
The  radial  matrix  elements  are  given  by 


Z<r(2s  -» 2p)  =        P2S(r)P2P(r)r  dr  =  -  ^27.  (16) 

Jo 

The  zeroth  order  multiple!  strengths  for  various  transitions  in  the  hydrogenic  limit  are 

Z2S1(ls22s21S-»ls22s2p1P°)  =  54 

Z2S2(ls22p2  lS->  ls22s2p1P°)  =  18 

Z2S0(ls22s2P3P°-^  Is2 2p23P)  =  162-0. 
We  finally  get 

Z21S0(ls22s21S->ls22s2p1P°)  =  38-485 

Z2S0(ls22p2  ^-^  ls22s2p1P°)  =  33-504. 
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3.  Results  and  discussion 

We  obtained  excellent  fits  for  Z2S  and  Zcr  data  of  Cheng  et  al  [18]  from  their  assumed 
forms  given  by  (5)  and  (6)  for  the  values  of  Z  up  to  26.  From  tables  1  a  and  1  b,  we  notice 
that  for  the  transition  Is2 2s2  1S-+  ls22s2p1F°  the  fitted  value  A(=  38-7 146)  is  in 
excellent  accord  with  the  corresponding  ab-initio  value  A0(  =  38495).  However,  if  the 
near-degeneracy  effect  is  omitted,  then  the  HF  value  ( =  54-0)  for  this  transition 
becomes  higher  than  the  ab-initio  value  by  about  40%.  The  value  of  A'  which  refers  to 
the  value  of  Za  in  the  limit  of  Z-»oo  for  this  transition  is  —4-271  which  is  again  in 
excellent  accord  with  the  ab-initio  value  —4-242.  Its  corresponding  HF  value  is 
-  5-196  which  is  higher  than  the  ab-initio  value  by  about  22%. 

In  table  2a,  we  have  displayed  the  values  of  fitted  and  ab-initio  constants  for  the 
resonant  transition  and  the  transition  1  s2  2p2 !  S  -*  1  s2  2s2p l  P°  where  we  have  used  the 
opacity  data  [17]  which  is  given  only  up  to  Z=  10.  For  the  resonant  transition  our 
fitted  value  is  once  again  in  good  agreement  with  the  ab-initio  value.  For  the  transition 
ls22p21S-»ls22s2p1?0,  our  fitted  value  of  34-194  differs  from  the  ab-initio  value  of 
33-504  by  about  2%.  These  values  (ab-initio  and  fitted)  are  almost  twice  the  HF  value 
which  is  18.  From  table  2b,  we  notice  that  the  agreement  between  A'0  and  A'  with  their 
respective  ab-initio  values  persist  for  both  the  transitions.  These  ab-initio  values  differ 
from  their  corresponding  HF  values  of  -  5-196  by  22%. 

We  have  also  carried  out  calculations  for  the  transition  ls22s2p3P°->  ls22p23P 
using  the  opacity  data  [1 7].  In  this  transition  there  is  no  near-degeneracy  effect  present 
and  so  it  is  expected  that  the  fitted  value  of  A  and  A'  would  be  in  good  agreement  with 
the  single-configuration  value.  From  table  3a  we  notice  that  the  fitted  value  of  A  is 
156-657  which  is  in  good  agreement  with  the  HF  value  of  162-0.  Similarly,  in  table  3b 
the  value  of  A'  is  -  5-143  which  differs  from  its  ab-initio  value  of  -  5-196  by  about  1  %. 

We  conclude  from  our  present  study  that  both  the  fitted  forms  of  Z2S  and  Z<r  give 
a  good  description  of  trends  in  S  and  a  along  an  isoelectronic  sequence  if  the 
near-degenerate  effects  are  included  in  the  interpretation  of  A  or  A'  values. 
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Abstract.  Dynamics  of  a  wavepacket  on  a  model  fractal  surface  has  been  studied  by  solving  the 
pertinent  time  dependent  Schrodinger  equation.  Spatial  and  temporal  behaviour  of  charge  and 
current  densities  and  a  local  chemical  potential  for  two  different  fractal  lattices  have  been 
considered.  Important  insight  into  the  dynamics  has  been  obtained  through  time  dependence  of 
various  quantities  like  macroscopic  kinetic  energy,  global  current,  Shanon  entropy,  density 
correlation  and  global  chemical  potential.  This  study  would  be  helpful  in  simulating  adsorption 
and  catalysis. 

Keywords.    Fractal  surface;  spatio  temporal  variations. 
PACS  Nos    05-45;  47-53;  34-10;  82-65 

1.  Introduction 

One  of  the  commonly  used  methods  of  studying  dynamical  problems  is  to  follow  the 
time  evolution  of  a  wavepacket  through  numerical  integration  of  the  pertinent  time 
dependent  Schrodinger  equation  (TDSE).  Different  variants  of  this  method  success- 
fully accounted  for  various  time  dependent  processes  like  chemical  reactions  [1,2], 
collision  induced  dissociation  [3,4],  gas-surface  scattering  [5],  photodissociation  [6], 
photoabsorption  and  emission  [7],  Raman  scattering  [8],  atom  diffraction  by  surfaces 
[9],  etc.  An  excellent  review  [10]  on  this  subject  has  also  been  published.  In  recent  years 
wavepacket  dynamics  has  been  extensively  used  [11-16]  in  understanding  quantum 
domain  behaviour  of  a  classically  chaotic  system.  Since  any  chaotic  system  is  geometri- 
cally characterized  [17]  by  its  scale  in  variance  property  a  detailed  knowledge  of  the 
corresponding  fractal  [18]  behaviour  is  important.  This  study  is  supposed  to  provide 
[19-22]  a  connection  between  structural  and  dynamical  information.  As  has  been 
pointed  out  [23]  the  relevance  of  fractal  behaviour  is  likely  to  be  manifested  in 
heterogeneous  reaction  dynamics  which  has  been  explicitly  demonstrated  [24]  in 
a  study  of  catalytic  properties  of  solid  surfaces.  Another  way  is  to  simulate  a  catalyst  or 
any  disordered  surface  in  general  as  a  fractal  surface,  e.g.  the  diffusion-limited  aggregate 
[25-27],  devil's  staircase  [28]  or  a  Cantor  set  [28]  like  structure.  In  this  article  we  have 
solved  the  pertinent  TDSE  to  study  the  dynamics  of  a  wavepacket  on  a  model  fractal 
lattice.  While  methodology  is  discussed  in  §  2,  a  brief  outline  of  the  numerical  solution  is 
given  in  §3.  In  §4  the  results  and  discussion  are  presented.  Finally,  §5  contains  some 
concluding  remarks. 
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2.  Methodology 

The  time  dependent  Schrodinger  equation  for  a  single  particle  moving  under  some 
external  potential  can  be  written  as  (in  a.u.) 

(1) 


.  . 

Dynamics  of  a  single  particle,  simulated  by  a  wavepacket,  on  a  model  fractal  lattice  can 
be  studied  by  solving  (1).  The  external  potential  v(r,  t)  arising  from  a  fractal  lattice  of 
embedding  Euclidean  dimension  two  has  been  formulated  as, 


where  (a^b^  denote  the  Cartesian  coordinates  of  a  point  on  the  fractal  lattice 
which  acts  as  an  attractive  Coulomb  source  of  unit  charge.  The  points  {(aitbj)}  are 
obtained  as  follows.  Start  with  a  conventional  Cantor  set-like  fractal  lattice  of 
dimension  Iog2/log3.  Take  the  interval  (0, 1),  divide  it  into  three  equal  segments  and 
delete  the  central  one  (1/3,  2/3).  The  process  is  continued  until  the  third  generation 
lattice  with  16  lattice  points  is  obtained.  A  two-dimensional  extension  to  this  lattice 
has  been  generated  by  replacing  each  point  of  this  horizontal  Cantor  lattice  by  a  similar 
vertical  Cantor  lattice  with  an  overall  fractal  dimension  [18]  21og2/log3.  Finally 
these  points  are  normalized  to  have  the  lattice  points.  We  term  this  lattice  as  lattice  I. 
For  comparison,  another  lattice  of  dimension  21og3/log5  has  been  constructed  in  its 
second  generation  with  18  lattice  points.  This  lattice  is  termed  as  lattice  II.  It  may  be 
noted  that  external  potential  given  in  (5)  does  not  contain  any  time.  For  a  real  system 
explicit  time  dependence  may  come  from  the  relative  motion  between  the  target  and 
the  projectile  and/or  the  oscillation  of  the  lattice.  Thus,  the  dynamics  in  the  present 
study  has  only  implicit  time  dependence  arising  from  the  temporal  evolution  of  the 
wavepacket. 

3.  Numerical  solution 

The  numerical  solution  is  launched  with  the  following  initial  and  boundary  conditions 

i     r_i_  2 

=  0    V;M 
MX,  ±oo,0  =  0    Vx,t.  (3) 

The  TDSE  (1)  is  solved  using  an  alternating  direction  implicit  (ADI)  finite  difference 
algorithm  [29]  with  the  above  initial  and  boundary  conditions.  Spatial  and  temporal 
mesh  sizes  adopted  here  are  Ax  =  Ay  =  0-097  and  At  =  0-01.  Spatial  grid  has  been 
chosen  in  such  a  way  that  it  avoids  singularity  associated  with  the  Coulomb  potential 
in  (2).  Each  ADI  cycle  comprises  two  steps,  i)  substitute  all  x  derivatives  by  Cranck- 
Nicolson  analogues  and  y  derivatives  by  explicit  scheme.  Therefore  the  finite  difference 


T_     £.     T\ I 


analogue  (FDA)  of  (1)  at  a  mesh  point  (xt,yj  becomes 

where,  '""  ^  ' 


and 


4  dx2      2  dy2  ' 

Explicit  FDA  of  derivatives  in  d,  of  (4)  are  used;  (ii)  in  the  next  step  Cranck-Nicolson 
explicit  sequence  is  reversed  and  the  corresponding  FDA  of  (1)  becomes 


where 


4(Aj/) 


and 

d  =iA" 


In  order  to  follow  the  dynamics  we  have  also  calculated  the  following  quantities: 

a)  charge  density  given  by 

b)  current  density  given  by 

c)  global  current  defined  as 

f 
J=  \\J(r,t)\p(r,t)dr, 

J 

d)  Shanon  entropy  as 

r 

S=-k    p(r,t)lnp(r,r)dr, 

J 

k  being  the  Boltzmann  constant, 
5)    density  correlation  as 

p(r,0)p(r,*)dr, 
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g)    local  chemical  potential  defined  as  [30] 

VpV      2VV 


/<(r.  0  = 

up 

and 

h)    chemical  potential  as, 


H(t)=  \n(r,t)p(r,t)dr. 


+^ii+, 


4.  Results  and  discussion 

Figures  la  and  Ib. depict  the  spatial  behaviour  of  external  potentials  associated  with 
two  different  fractal  lattices  (I  and  II).  Local  extrema  in  these  plots  are  different  both  in 
magnitude  and  in  position. 

Time  evolution  of  charge  density  profile  is  presented  in  figures  2a  to  2e.  Initial 
density  profile  (figure  2a)  gets  decomposed  and  dispersed  in  course  of  time  over  lattice 
I  (figures  2b  and  2c,  at  f  =  0-16  and  1  respectively)  and  lattice  II  (figures  2d  and  2e,  at 
i  =  0- 1 6  and  1  respectively).  Although  the  initial  decomposition  patterns  (figures  2b  and 
2d)  are  different  in  two  lattices,  the  long  time  behaviour  (figures  2c  and  2e)  is  similar  but 
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Figure  la. 
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(b) 


Figure  1.  Spatial  profile  of  external  potential  for  propagation  of  a  wavepacket  on 
model  fractal  lattices.  The  basal  square  mesh  designates  -  3  ^  x  ^  3  and 
-  3  ^  y  <  3  (a)  Lattice  I;  (b)  Lattice  II. 


(a) 


Figure  2a. 
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(c) 


Figure  2b-c. 


to  move  the  wavepacket 

dvnamiVc  r,f  th0  "  i  "^u  *"  l^Q  ^ractal  lattice  points. 

dynamics  of  the  wavepacket  ,s  governed  by  an  interplay  of  these  kinetic  and 
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Dynamics  on  fractal  surface 


(e) 


Figure  2.    Spatial  profile  of  density  (a)  t  =  0;  (b),  (c)  lattice  I,  t  =  0-16,  1  and  (d),  (e) 
lattice  II,  t  =  0-16,  1.  See  caption  of  figure  1  for  details. 
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potential  effects.  Potential  effect  is  dominant  at  the  beginning  and  the  wavepacket  is 
mostly  localized  around  coulomb  attractors.  As  time  progresses  the  kinetic  effect  takes 
over  and  the  probability  of  getting  the  particle  maximizes  (albeit  unequal)  in  two  zones 
near  the  edges  of  the  mesh  and  away  from  coulomb  centres. 
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(d) 


lattice  II  t-0-lrf  f  s       ^^ensity  (a),  (b)  lattice  I,  ^0-16.  I  and  (c),(d) 
lattice  11,  t-  o-l 6,  l.  See  caption  of  figure  1  for  details. 
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difference  is  reduced  as  time  progresses.  Out  of  the  two  accessible  zones  the  probability 
of  getting  the  particle  is  more  where  the  current  density  is  higher. 

The  most  intricate  dynamical  behaviour  is  revealed  through  the  plots  of  local 
chemical  potential  (figures  4a  to  4e).  It  starts  from  a  cot-like  structure  at  t  =  0.  Since  in 


(b) 
Figure  4a-b. 
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the  present  problem  v  does  not  vary  with  time  local  chemical  potential  is  plotted 
relative  to  v  in  figure  3.  Local  chemical  potential  gets  concentrated  at  a  specific  site  and 
its  base  broadens  in  course  of  time.  This  site  is  around  one  of  the  two  zones  discussed 
above.  High  positive  values  of  fj.  here  reveal  that  energy  of  the  system  increases  on 
charge  accumulation  and  hence  the  lower  current/probability  zone  does  not  like 
further  charge  transfer  towards  it. 

Figures  5-9  depict  respectively  the  time  variations  of  global  current,  Shanon 
entropy,  density  correlation,  macroscopic  kinetic  energy  and  chemical  potential.  In 
each  figure  two  different  curves  refer  to  lattices  I  and  II  respectively.  As  expected, 
although  the  numerical  values  differ  the  overall  behaviour  of  a  given  quantity  remains 
same  in  two  lattices  in  all  figures.  Global  current  increases  to  a  maximum  value  around 
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Figure  4c-d. 
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Figure  4.    Spatial  profile  of  local  chemical  potential  (a)  t  =  0;  (b),  (c)  lattice  I, 
t  =  0-16,  1  and  (d),  (e)  lattice  II,  t  =  0-16,  1.  See  caption  of  figure  1  for  details. 
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Figure  6.    Temporal  evolution  of  Shanon  entropy  (a)  ( )  lattice  I,  (b) 

lattice  II.  See  caption  of  figure  1  for  details. 
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Figure  8.    Temporal  evolution  of  macroscopic  kinetic  energy  (a)  (  -  )  lattice  I, 
(b)  (  ---  )  lattice  II.  See  caption  of  figure  1  for  details. 
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Figure  9.    Temporal  evolution  of  chemical  potential  (a)  ( )  lattice  I,  (b)  (- 

lattice  II.  See  caption  of  figure  1  for  details. 
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=  0-2  where  kinetic  energy  also  becomes  maximum.  Density  correlation  starts 
Creasing  from  an  initial  value  unity.  At  around  t  =  0-2  it  goes  to  zero  and  system  loses 
;  correlation  with  its  initial  state.  At  t  =  0,  Shanon  entropy  has  the  value  /c(l  +  In  TT), 
being  the  Boltzmann  constant.  It  starts  oscillating  with  time  and  a  local  maximum  in 
;tropy  (a  measure  of  disorderliness)  slightly  before  t  —  0-2  goes  along  with  maxima  in 
obal  current  and  KE  and  zero  value  of  correlation.  However,  the  minimum  in 
tropy  around  t  —  0-2  is  apparently  not  in  conformity  with  extremal  behaviour  of 
her  quantities  at  that  time  step.  Entropy  then  starts  increasing  and  a  broad  maximum 
it,  zero  value  of  correlation  and  very  small  values  for  current  and  KE  are  internally 
insistent.  Global  chemical  potential  starts  from  a  negative  value  (  —  360-28  and 
466-43  for  lattices  I  and  II  respectively  at  t  =  0  which  are  not  conspicuous  in  the 
;ures)  and  increases  rapidly  to  a  maximum  positive  value  around  t  =  0-2.  Then  it 
irts  decreasing  and  passes  through  a  broad  minimum.  Since  its  extremal  characteris- 
;  as  a  function  of  time  resembles  that  of  current  density  it  can  be  treated  as  a  useful 
dex  for  studying  dynamical  problems.  This  quantity  contains  information  of  the 
rrect  time  of  charge  transfer  such  that  the  change  in  the  energy  of  the  system  stabilizes 
the  most. 

Concluding  remarks 

iportant  insight  into  the  dynamics  of  a  wavepacket  on  a  model  fractal  lattice  has  been 
>tained  through  spatial  and  temporal  variations  of  several  quantities  which  depend 
i  time  dependent  charge  and  current  densities.  Interplay  of  kinetic  and  potential 
ects  governs  the  overall  dynamics.  Initially  the  wavepacket  remains  confined  around 
ulomb  centres  of  the  fractal  lattice.  In  course  of  time  there  result  two  accessible  zones 
ound  the  edges,  away  from  the  coulomb  sources.  Local  chemical  potential  exhibits 
triguing  spatio-temporal  behaviour.  Different  time  dependent  quantities  extremize  in 
ne  in  a  consistent  manner.  After  certain  time  most  of  these  quantities  do  not  change 
ipreciably.  Chemical  potential  seems  to  contain  valuable  dynamical  information  and 
serves  closer  scrutiny.  As  expected  the  qualitative  behaviour  of  various  dynamical 
lantities  in  two  lattices  is  similar  although  they  differ  quantitatively.  At  this  stage  the 
act  connection  between  structure  of  the  surface  and  the  resulting  dynamical  informa- 
>n  is  not  clear.  However,  the  dependence  of  the  dynamical  observables  on  the  fractal 
haviour  of  the  lattice  is  conspicuous. 
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Kinetic  description  of  ion  acoustic  waves  in  a  dusty  plasma 
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Abstract.  A  self-consistent  model  for  the  description  of  the  ion  sound  wave  in  a  dusty  plasma  is 
given.  We  show  that  proper  consideration  of  the  ion  attachment  to  the  grains  gives  rise  to  the 
dissipation  of  the  ion  acoustic  wave  in  a  dusty  plasma  medium.  Dissipation  rate  is  proportional 
to  the  sum  of  the  electron  and  twice  the  ion  attachment  frequencies. 

Keywords.    Ion  acoustic  wave;  dusty  plasma;  charge  fluctuations. 
PACSNos    52-25;  52-35 

1.  Introduction 

Investigations  of  the  dusty  plasma  in  recent  times  have  acquired  considerable  momen- 
tum. Major  reason  for  the  spurt  in  the  interest  is  due  to  its  numerous  applications  to  the 
industrial  (e.g.  semiconductor  industry,  plasma  etching,  etc.)  as  well  as  cosmic  plasmas 
(e.g.  planetary  rings,  cometary  tails,  etc.).  Periodicity  of  the  subject  being  reviewed  at 
present  is  1-5  years  [1-4].  Recent  interest  in  the  investigation  of  the  dielectric  response 
of  the  dusty  plasma  medium  focusses  around  charge  fluctuating  grains.  A  consistent 
formulation  of  the  dynamics  of  the  dusty  plasma  (with  constant  mass  and  radius  of  the 
grain)  is  given  by  Bhatt  and  Pandey  [5]  and  Vladmirov  [6].  Mass  of  the  grain  may  also 
vary  e.g.  due  to  coagulation  of  the  grains  [7]  and  thus,  grain  radius  and  mass  in  general, 
may  also  become  dynamical  quantities.  Set  of  equations  considering  mass  as  a  dynami- 
cal variable  have  been  given  elsewhere  [8].  For  the  present  however,  we  shall  assume 
that  the  grain  is  spherical  and  of  fixed  radius  and  mass  and  that  the  grain  charging  is 
solely  due  to  the  attachment  of  electrons  and  ions.  Grains  will  be  assumed  massive 
compared  to  the  electrons  and  ions  and  therefore,  immobile.  Dynamical  response  time 
of  the  grain  is  slowest  in  comparison  with  the  response  time  of  the  plasma  particles  so 
that  the  assumption  about  their  immobileness  is  justified. 

We  shall  study  the  ion  acoustic  waves  in  a  dusty  plasma.  Ion  acoustic  mode  has  been 
studied,  in  the  past  by  several  authors  e.g.  Tsytovich  and  Havnes  [9],  D'Angelo  [10], 
Bhatt  and  Pandey  [5]  and  Ma  and  Yu  [1 1  ].  Tsytovich  and  Havnes  [9]  describe  the  ion 
acoustic  mode  by  assuming  that  the  plasma  fluctuations  maintain  their  quasineut- 
rality.  They  demonstrate  that  the  mode  is  damped  by  the  charge  fluctuation  and  that 
the  charge  fluctuation  induced  damping  could  be  larger  than  the  collisionless  Landau 
damping.  In  the  kinetic  description  of  Ma  and  Yu  [1 1],  ion  acoustic  mode  is  described 
by  assuming  that  the  perturbations  in  the  plasma  may  cause  space  charge  field  and  that 
the  mode  may  damp  or  grow  depending  upon  how  strong  the  deviation  of  plasma  is 
from  quasineutrality.  D'Angelo  [10]  gives  a  fluid  description  of  the  ion  acoustic  mode 
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ind  shows  that  the  mode  damps  at  the  frequency  at  which  ions  are  injected  in  the 
3lasma.  He  calls  it  'creation  damping'  as  fresh  ions  injected  in  the  plasma  counters  the 
oss  caused  by  the  sticking  of  the  ion  to  the  grain  surface.  His  description  assumes  that 
:he  plasma  always  maintains  quasineutrality.  Bhatt  and  Pandey  [5]  investigate  the  ion- 
icoustic  streaming  instability  and  show  that  the  charge  fluctuation  narrows  the 
parameter  space  of  instability.  Scope  of  the  present  work  includes  the  consistent 
:ormulation  of  the  kinetic  problem  for  the  ion  acoustic  mode  in  the  spirit  of  Vladmirov 
_9]  and  explore,  whether  conclusions  arrived  by  previous  investigation  in  general  and 
Ma  and  Yu  [11]  in  particular,  are  valid.  To  that  end,  we  shall  investigate  the  ion 
acoustic  wave  in  the  dusty  plasma. 

Since  we  are  interested  in  the  low  frequency  ion  acoustic  mode,  electrons  can  be 
issumed  to  follow  Boltzmann  distribution  i.e.  they  are  in  thermodynamic  equilibrium. 
Furthermore,  charging  time  scale  is  comparable  with  the  attachment  time  scale  of  the 
ions  which  in  turn  is  comparable  with  the  acoustic  time  scale  leading  to  appreciable 
dissipation  of  the  acoustic  mode. 

This  paper  is  organized  in  the  following  fashion.  Section  2  describes  the  basic 
equations.  In  §  3  we  discuss  the  perturbed  equations  and  derive  the  dispersion  relation. 
In  §4  discussion  of  the  result  and  comparison  with  the  previous  work  are  given. 

2.  Basic  equation 

It  is  assumed  that  the  grains  are  spherical  conductors  and  charging  of  the  grains  are  due 
to  the  potential  difference  between  plasma  (electrons  and  ions)  and  grain  surface. 
Equilibrium  currents  are  given  by  [12] 


where  subscript  o  denotes  the  equilibrium  quantities;  ma,  Tx  and  nao  are  the  mass, 
temperature  and  equilibrium  density  of  the  ath  particle;  a  and  (j>g  are  the  grain  radius 
and  potential  respectively;  $  is  the  plasma  potential  and  e  is  the  electronic  charge. 
Charge  on  the  grain  is  a  dynamical  quantity  and  its  dynamics  is  described  by  the 
following  equation 


where  Q  is  the  average  grain  charge  i.e. 


and  charge  q  is  an  additional  independent  variable  and  fd  is  the  grain  distribution 
function.  In  equilibrium  leo  +  Iio  =  0  and  Q0  charge  is  carried  by  the  grains.  Equilib- 
rium grain  distribution  is  given  by 

/<to  =  MOz  +  Qo)-  (5) 
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Ion  acoustic  waves  in  a  dusty  plasma 
/namics  of  the  ion  is  described  by  kinetic  Boltzmann  equation 
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:he  effective  collision  cross  section  of  the  ions  with  the  grains.  Moment  of  eq  (5)  is 
scribed  by  Bhatt  and  Pandey  [5].  We  shall  assume  that  in  the  equilibrium  state,  loss 
ion  is  compensated  by  the  external  source  and  the  ion  distribution  function  fio  is 
ixwellian.  We  should  add  that  assumption  of  ion  following  Maxwellian  distribution 
in  general  not  valid  in  the  vicinity  of  the  grains.  Ion  will  respond  to  the  electric  field  of 
;  highly  charged  grains  and  therefore  in  general  should  be  described  by  the  Maxwell 
Itzmann  distribution. 
Electron  follows  Boltzmann  distribution 

(8) 


L      <? . 

Localized  charge  build-up  is  described  by  PoissoiV s  equation 


/,dv.  (9) 

uations(l)-(9)  will  adequately  describe  the  ion  dynamics  in  a  dusty  plasma. 

Dispersion  relation 

xt,  we  introduce  small  perturbation  in  the  distribution  function 

/„=/„„+/</;  /,=/*, +/f,   with  \fd\«flltM'i\«L. 

arge  fluctuation  equation  can  be  written  as 
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ere  fle  is  the  perturbed  electron  density.  Last  term  in  the  above  equation  is  from  the 
•turbed  ion  currents.  The  charge  dissipation  is  described  by  the  charging  frequency 
which  for  Maxwellian  particle  distribution  is  given  by 
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*        dq 
:h  T  =  T,/TP,  z  =  eOn/aTf .  Fluctuation  in  the  grain  distribution  is  described  bv 


For  the  ion  distribution,  one  has 


where  attachment  frequencies  v°td(v]  and  v>M(u)  are  given  by 


(13) 
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Assuming    that    the    fluctuations    can    be    represented    as    the    normal    mode 
~  expz'(cot  -  kx),  ion  distribution  function  is  given  by 
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and  the  charge  fluctuation  Q  is  given  by 
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Last  term  in  the  above  expression  is  0(^dv°d/io)  and  thus  can  be  ignored.  Then  taking 
note  of  the  fact  that  for  co  ~  copi »  m&\(kvTi,  v°,  v£),  where  copi  =  ^/4nnioe2/mi,  we  can 
approximate  the  following  integral  as 


(18) 


Then  plugging  back  (16)  and  (17)  in  the  Poisson's  equation  (9)  and  taking  note  of  (18), 
one  gets  the  following  dispersion  relation  (DR) 


l^^o 
u  ndo 


=  0 


(19) 


where  /.e  =  Te/4nneoe2,  x  =  \leo\ndjeneo.  We  note  that  a  is  the  electron  attachment 
frequency,  as  have  been  shown  by  Bhatt  and  Pandey  [5].  One  may  calculate  v°ed  for 
a  Maxwellian  electron 


Similarly  ion  attachment  frequency  is  given  by 


(20) 


(21) 
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Ion  acoustic  waves  in  a  dusty  plasma 

Thus,  we  see  that  using  the  constraint  Ieo  +  Iio  =  0,  one  can  write  v°d  =  v°ed(l  +  (n  >n  ^ 
This  result  is  in  agreement  with  the  result  derived  on  the  intuitive  ground  by  Bhatt  and 
Pandey  [5] 


fv^y  (22) 

-dv.     '  m\ 


Without  loss  of  generality,  assuming  k  ||  x  and  going  to  spherical  polar  coordinate  to 
evaluate  (19)  with  co  ~  ojpi  »  (kvTi,  v°d,-v°id},  one  gets 


(24) 

3  e  nio  "•    ' 

where  A  =  1  +  (1  —  (eQ0/CT;))~  1.  Ion  susceptibility  X;  becomes 

The  above  expression  has  been  derived  by  assuming  v°d(v)  v  vid  which  tantamounts  to 
assuming  that  Zde2/amiv2  «  1  i.e.  ratio  of  the  dust  potential  to  the  kinetic  energy  is 
much  less  than  1  .  In  the  other  limit,  one  cannot  take  v°id  out  of  integral  and  the  result  will 
renormalize  the  ion  acoustic  speed. 

While  evaluating  (23)  we  have  not  considered  the  resonant  contribution  to  the  ion 
susceptibility.  For  w  «  Pv°d,  where  P  =  (nd/ne)(aTJe2)  is  the  ratio  of  dust  space  change 
to  the  electron  space  change  (Havnes  et  al  [13])  resonant  denominator  is  evaluated  by 
the  usual  Landau  rule.  It  has  been  demonstrated  by  Tsytovich  and  Havnes  [9],  that  the 
change  fluctuation  induced  damping  rate  may  exceed  the  Landau  damping. 

Thus,  DR  can  be  written  as 

(w2  -  k2  Cs2)(co  -  iv°')  -  i(e  +  2v°d)k2  C2S  =  0  (26) 

where  v°d  =  v°d  +  a/(l  +  k2  Ae2),  Cs  =  ^ewpj(^\  +k2/2),  e  =  a(l  -  (1neoA/3nio)).  In  writ- 
ing DR  (26)  we  have  ignored  terms  0(v°a,  v°d^  v°idv°d).  We  see  that  DR  (26)  reduces  to  DR 
of  [  1  1]  in  v°id  ->  0  limit.  As  £  a  v°id,  we  see  that  the  proper  consideration  of  ion  attachment 
causes  the  mode  to  damp  faster  than  have  been  reported  [11]. 

4.  Discussion 

DR  (26)  can  be  solved  analytically.  However,  since  to  »  (v°id  ~  e),  it  is  instructive  to  solve 
the  above  equation  perturbatively.  Then  the  roots  are 


(27) 

£ 
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Figure  1.    This  licure  shows  variation  of  y  =  (2«,-  )/a  against  .v  =  /c/t,  for  differ 
\alues  of  J-Uir/n  J  and  ft-g,,  C7;.)=  -2-5  when  vV,  =  0.  It  is  evident  from 
figure  thai  for  certain  values  of  .v.  mode  becomes  unstable. 


1  +\»  X  H    -H  8/7  i*  (1 


4.0 


„  ---    This  figure  shows  the  same  when  v"id  ^  0.  It  is  clear  from  figure  2  that 
wavelengths,  charge  fluctuation  will  cause  the  damping  of  ion  acoustic  moc 


»e  see  that  the  ion  acoustic  mode  acquires  a  damping  rate  proportional 
-  fc/2  •*»  ^  %  i£.  The  abme  result  modifies  the  damping  rate  predicted  by  Ma  and 
[11],  between  ion  acoustic  wave  and  dust  charge  fluctuation  lead; 

tilts  Our  result  is  broadly  in  agreement  with  the  conclusions  drawn 

and  D'Angelo  [10].  It  must  be  mentioned  however,  that 
by  D'Angeio  [10]  is  caused  primarily  by  the  injection  of  the  i 
aad  fan  h  to  the  injection  frequency  of  the  ion  in  the  plasma  wherea 

ike  is  due  to  the  attachment  of  ions  to  the  grains. 

It  is  to  note  thai  without  the  ion  attachment  frequency  v°id  ion  acou 

wmy  [H],  if  fi<Q,  i.e.  A;Jr  >  3/2(nio/neo)A.  However,  : 

charge  fluctuation  shows  that  even  when  R  <  0,  there  i; 
condition  (  v-,  -  «/2)  <  0  is  satisfied.  As  is  evident  from 
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id  (21)  and  the  discussion  therein  v'M/e  %  a  =  neo/nio.  The  number  density  ratio  d  may 
i ve  different  values,  e.g.  for  diffuse  cloud  in  the  interstellar  media  6  %  0  [10]  whereas  in 
e  planetary  rings  and  cometary  tails  d^  1  [1-4].  Thus  we  see  that  the  charge 
ictuation  will  invariably  cause  the  damping  of  ion  acoustic  response  of  the  plasma, 
le  instability  suggested  in  ref.  [1 1]  is,  thus  highly  unlikely  in  a  dusty  plasma.  Figure  1 
spicts  the  plot  of  y  -  (2w,./a)  vs.  x  =  k/ie  for  d  =  0, 0-5, 1  and  (eQQjC  T,)=  —  2-5  when 
n  attachment  frequency  has  been  ignored.  We  see  that  for  d  =  05  and  1,  mode 
icomes  unstable  for  some  critical  wavelength.  Figure  2  depicts  the  same  for  non-zero 
n  attachment  frequency.  We  see  from  figure  2  that  there  is  no  instability,  and  ion 
loustic  mode  can  only  damp  in  a  dusty  plasma. 

We  conclude  that  the  self-consistent  treatment  of  charge  dynamics  leads  to  the 
imping  of  the  acoustic  wave.  Further,  Landau  damping  will  also  cause  the  mode  to 
ssipate  [9]  and  thus,  there  is  no  window  whatsoever  contrary  to  the  claim  mode  in  ref. 
1]  where  the  mode  may  become  unstable.  The  parameter  regime  in  which  the 
oustic  mode  becomes  unstable  is  highly  unlikely.  Our  analysis  demonstrates  that  the 
le  of  ion  attachment  in  the  charge  dynamics  of  a  dusty  plasma  is  crucial.  It  is  through 
is  attachment  that  the  ion  loses  its  energy  to  the  grains  and  causes  the  damping. 
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Abstract.  This  paper  addresses  itself  to,  a  practical  problem  encountered  in  using  iterative 
learning  rules  for  associative  memory  models.  The  performance  of  a  learning  rule  based  on  linear 
programming  which  overcomes  this  problem  is  compared  with  that  of  a  representative  iterative 
rule  by  numerical  simulation.  Results  indicate  superior  performance  by  the  linear  programming 
rule.  An  algorithm  for  computing  radii  of  maximal  hyperspheres  around  patterns  in  the  state 
space  of  a  model  is  presented.  Fractional  volumes  of  basins  of  attractions  are  computed  for  the 
representative  iterative  rule  as  well  as  the  linear  programming  rule.  With  the  radii  of  maximal 
hyperspheres  as  weight  factors  for  corresponding  patterns  to  be  stored,  the  linear  programming 
rule  gives  rise  to  the  maximal  utilisation  of  the  state  space. 

Keywords.    Associative  memory;  learning  rule;  linear  programming. 
PACS  Nos    87-10;  89-10;  89-80 

1.  Introduction  * 

There  has  b'een  much  interest  recently  in  the  McCulloch-Pitts  [1]  neural  networks  as 
models  of  associative  memories.  In  the  Hopfield  [2]  version  of  such  a  network,  the 
main  task  of  a  designer  is  to  compute  the  neural  interaction  strengths  as  a  function  of 
the  patterns  to  be  stored  and  recalled  associatively.  Algorithms  to  compute  interaction 
strengths  are  generally  known  as  learning  rules.  The  simplest  of  them  is  based  on  an 
hypothesis  by  Hebb  [3].  Though  biologically  meaningful,  the  Hebb  rule  cannot  be  used 
to  store  more  than  014JV  random  uncorrelated  patterns  where  N  is  the  number  of 
neurons  in  the  network  [4].  The  pseudo-inverse  rule  [5]  can  store  N  linearly  indepen- 
dent patterns.  Earlier,  Willshaw  et  al  [6]  have  proposed  a  rule  for  highly  correlated 
patterns  which  stores  patterns  of  the  order  of  N2/(\ogN)2.  In  these  and  other  rules  [7] 
the  sizes  of  basins  of  attraction  around  the  patterns  in  the  state  space  are  not  controlled 
explicitly.  Using  a  number  of  rules  proposed  subsequently  [8-14],  it  is  not  only 
possible  to  ensure  that  patterns  are  stored  in  the  network  but  also  to  provide  around 
each  one  a  basin  of  attraction  whose  measure  is  specified  by  the  designer.  Conditions 
for  stability  in  terms  of  measures  of  the  sizes  of  basins  of  attraction,  the  patterns  and  the 
interaction  strengths  are  formulated  first.  Beginning  with  arbitrary  values  for  the 
interaction  strengths,  these  rules  help  the  designer  to  move  towards  the  correct  values 
iteratively  till  these  conditions  are  satisfied.  Convergence  to  the  correct  values  are 
ensured,  provided  a  solution  satisfying  the  initial  conditions  exists  in  the  space  of 
interaction  strengths.  However,  in  practical  situations  it  will  be  difficult  to  know  if 
a  solution  exists  a  priori.  Therefore,  one  would  have  to  assume  conservative  values  for 


the  sizes  ot  basins  ot  attraction,  consequently  tne  oasms  ot  attraction  may  not  pack  the 
state  space  of  the  network  as  tightly  as  possible. 

A  learning  rule  based  on  linear  programming  [1 1]  overcomes  the  above  mentioned 
drawback  of  iterative  rules.  The  linearity  of  conditions  of  stability  combined  with  linear 
normalization  of  interaction  strengths  enables  one  to  employ  the  linear  programming 
approach.  Unlike  the  case  of  iterative  rules  where  measures  of  the  sizes  of  basins 
attraction  are  expected  as  external  inputs,  in  the  linear  programming  approach, 
maximal  values  for  these  measures  are  computed  automatically.  The  interaction 
strengths  are  computed  as  a  byproduct.  Though  this  rule  was  first  formulated  by 
Krauth  and  Mezard,  its  distinct  advantages  are  not  argued  or  brought  out  clearly  in 
their  paper.  Moreover,  they  do  not  report  extensive  numerical  studies  of  this  rule's 
performance  in  recalling  stored  patterns.  This  paper  proposes  to  complement  their 
work  and  fill  out  the  above  mentioned  gaps. 

While  employing  the  linear  programming  approach,  it  is  possible  to  specify  different 
relative  weights  for  different  patterns.  If  these  weights  are  selected  in  accordance  with 
the  inter-pattern  distances,  then  maximal  utilization  of  the  state  space  of  the  model  can- 
be  achieved  as  a  result.  An  algorithm  to  compute  the  radii  of  maximal  geometric 
hyperspheres  around  each  pattern  is  presented.  Use  of  these  radii  as  relative  weights  for 
patterns  results  in  maximal  fractional  volume  of  basins  of  attraction  as  shown  by 
Monte-Carlo  simulations. 

The  paper  is  organized  as  follows.  The  terms  and  notations  are  introduced  in  §  2  by 
way  of  briefly  reviewing  the  iterative  learning  rules.  It  is  followed  by  §  3,  which  describes 
the  rule  based  on  linear  programming.  Section  4  discusses  numerical  simulations  for 
comparing  the  performances  of  a  representative  iterative  and  linear  programming 
rules.  The  next  section  presents  an  algorithm  to  compute  the  radii  of  maximal 
geometric  hyperspheres  around  each  pattern.  It  also  describes  a  Monte-Carlo  method 
to  compute  the  fractional  volume  of  basins  of  attraction  and  presents  the  results. 
Discussions  and  conclusions  make  up  the  last  section. 

2.  Iterative  learning  rules 

The  standard  Hopfield  network  consists  of  N  Ising  spins  cr-  =  +  1  for  i  =  1, . . . ,  N.  The 
interaction  strength  from  spin;  to  spin  i  is  denoted  by  Jtj.  All  self-interactions,  JH,  are 
assumed  to  be  zero  throughout  this  paper  and  the  {J.j}  need  not  be  symmetric.  The 
local  field,  h{(t),  at  site  i  and  at  time  t  is  computed  by 

h,(t)=  Y  Ji;a;(t)  m 

'v '      LJ    ij  j^f  \i) 

where  aft)  is  the  Ising  spin  at  site;  and  at  time  t.  The  spins,  or  equivalently  neurons,  are 
updated  according  to  the  rule 

o",-(t  +  <5t)  =  sign(/2.(t)).  (2) 

A  configuration,  or  equivalently  a  pattern,  {£f:i  =  1,..., N}  is  stored,  i.e.,  becomes 
a  fixed  point  of  dynamics  (2),  if 


(3) 
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is  positive  for  all  i.  In  fact,  ci  is  a  measure  of  the  size  of  basin  of  attraction  for  the  pattern 
{£J.  Therefore,  for  effective  associative  recall,  c-t  must  be  greater  than  a  positive 
constant,  say,  k.  For  storing  p  patterns,  biased  or  random,  {£f  :  i  =  1,  .  .  .  ,  N;  fi  =  1,  .  .  .  ,  p}, 
the  fixed  point  conditions 

%hi({tf})>k  (4) 

must  be  satisfied  at  each  site  i  for  each  pattern  \JL.  The  positive  constant  could  be 
pattern  dependent,  if  required.  Without  some  sort  of  normalization  or  bounds  for  the 
J0-'s,  the  inequalities  (4)  would  be  devoid  of  meaning.  Two  types  of  normalizations 
are  used  in  the  iterative  rules  proposed  in  the  literature.  In  the  first  type  the  J^'s  are 
expected  to  obey 


for  each  i  (in  some  cases  N  is  replaced  by  1).  In  the  second,  the  bound 

l^l<Jmax  (6) 

is  imposed  on  all  J^.'s. 

In  Gardner's  version  of  the  iterative  approach,  the  interaction  strengths  {Ju}  are 
computed  as  follows.  The  {J.j}  are  assigned  any  set  of  values  to  begin  with.  A  mask  gf  is 
defined  at  each  site  i  and  for  each  pattern  \L  as  follows. 


6-  = 


N  \l/2  N 

£'5)  -£ 


where  6(x)  is  the  Heaviside  step  function.  The  iterative  corrections  for  {J(7}  are 
computed  as 

AJy^Wj  (8) 

to  update  the  {J^}  till  ef  become  zero  for  all  i  and  fj..  The  normalization  condition  (5)  is 
made  use  of  implicitly  in  computing  ef  using  (7).  This  local,  Hebb  rule  like,  algorithm  is 
guaranteed  to  converge  provided  a  solution  of  { J;j-}  satisfying  the  conditions  (4)  exists 
in  the  space  of  interactions  [15].  Existence  of  a  solution  is,  of  course,  dependent  on  the 
number  and  nature  of  patterns  and  also  on  the  constant  k.  Larger  the  value  of  p  and  k, 
the  less  probable  it  is  for  a  solution  to  exist.  Gardner's  deep  analytical  results  are  useful 
in  knowing  about  the  existence  of  such  solutions  when  one  deals  with  an  abstract 
category  of  patterns  drawn  from  a  known  probability  distribution.  However,  in 
practical  situations  where  one  needs  to  store  a  specific  set  of  p  patterns  there  is  no  way 
of  knowing  if  a  solution  exists  or  not.  There  are  no  guidelines  either  for  choosing  an 
optimal  value  for  k.  Both  these  problems  are  easily  overcome  by  adopting  the  linear 
programming  approach  to  compute  the  interaction  strengths. 

Two  other  iterative  rules  [10, 11]  make  use  of  the  fact  that  the  inequality  conditions 
(4)  can  be  decoupled  into  JV  independent  sets.  The  latter  rule,  called  minimum -overlap 
rule,  is  a  minor  variant  of  the  former.  Starting  with  some  a  priori  value  of  k,  the  rules 
iteratively  try  to  satisfy  these  N  sets  of  conditions  independently.  Once  again,  conver- 
gence is  ensured  only  if  a  solution  for  { J^.}  exists.  Therefore  both  these  rules  are  subjec 
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to  the  critical  observations  made  against  Gardner's  iterative  rule.  In  fact,  unless  the 
computation  of ;./,,  \  is  posed  as  a  problem  of  optimizing  k,  the  problem  of  finding  out 
a  priori  if  a  solution  ol  \Jti\  exists  or  not  cannot  be  avoided  whatever  be  the  iterative 
formulation.  Griniasly  and  (  iutfreund  j  16]  do  point  out  that  learning  can  be  viewed  as 
an  optimi/ation  problem.  However  they  do  not  discuss  the  application  oflinear 
programming  tor  lea  mint!  patterns. 

Krauth  and  Me/aid  [  1  1  |  present  the  linear  programming  approach  in  the  same 
paper  where  they  discuss  the  minimum-overlap  rule  and  compare  the  two.  They 
compute  measures  of  stabilities  or  equivalent  si/es  of  basins  attraction  in  ease  of  both 
the  rules.  When  the  noi  mali/ation  condition  (6)  is  used,  these  measures  turn  out  to  be 
larger  for  the  linear  proiiramming  rule,  as  one  would  expect.  If  one  uses  (5)  for 
normali/ation,  then  the  problem  of  computing  |./(.;.j  can  no  longer  be  posed  as  a  linear 
programming  problem.  1  lowever,  in  their  paper  a  comparison  between  the  two  rules 
with  normali/ation  based  on  (5)  is  presented  which  shows  the  linear  programming 
approach  in  poor  light.  It  is  nol  clear  how  they  used  the  simplex  algorithm  to  maximise 
k  subject  to  conditions  (4)  and  l>)  as  the  latter  are  quadratic.  The  point  argued  here  is 
that  when  all  the  conditions  invoked  in  the  problem  are  linear  the  best  approach  is  one 
bused  on  linear  programming.  Besides  rinding  out  if  a  suitable  solution  of  {yr)  exists  or 
nol,  this  approach  provides  unique  and  maximal  possible  stabilities  or  sizes  of  basins 
attraction  for  the  patterns  to  be  stored. 

3.  Linear  programming  rule 

Linear  programming  is  concerned  with  finding  optimum  values  of  linear  objective 
functions  subject  to  a  set  of  equality  or  inequality  constraints.  The  constraints  have  to 
be  linear  and  the  standard  method  for  solving  a  linear  programming  problem  (LPP)  is 
known  as  the  simplex  algorithm.  Starting  with  a  feasible  solution  which  satisfies  the 
given  constraints,  the  algorithm  improves  the  solution  from  one  step  to  another  till  the 
optimum  is  reached.  In  case  no  feasible  solution  exists,  the  algorithm  is  capable  of 
detecting  that  as  well  as  the  possibility  of  unboundedness  of  the  optimal  value.  In  the 
canonical  formulation  of  an  LPP  one  has  a  linear  objective  function  which  is  to  be 
maximised  and  a  number  of  linear  constraints.  The  variables  are  normally  assumed  to 
take  only  positive  values. 

The  problem  of  computing  j./f/|  can  be  formulated  as  an  LPP  in  the  following 
manner.  Let  the  objective  function  <•  be  specified  by 

(•  =  *.  (9) 

One  set  of  constraints  are 

£?  I  •/,•/£} -•*><>.  (10) 

./    i 

The  other  set  of  constraints  are  the  normalization  bounds  specified  by  (6).  The  problem 
is  to  maximise  c  subject  to  (10)  and  (6)  which  are  a  set  of  pN  +  N(N-\)  linear 
constraints. 

The  {Jy}  can  be  positive  or  negative,  whereas  in  the  canonical  formulation  of 
LPP,  negative  values  for  the  variables  are  not  permissible.  Therefore  a  new  variable 
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J'ij  is  defined  as 


max"  (11) 

Substituting  for  J.j  in  (10)  and  (6)  from  (11)  one  obtains 

N 

\j 

" 


and 

•Jy>0  (13) 

^•<2^max  (14) 

for  all  ij  and  p..  The  diagonal  J'n  are  presumed  to  be  0.  The  standard  requirement  that 
the  variables  must  have  positive  values  is  met  by  (13)  while  (14)  provides  positive 
bounds  on  {J'u}.  These  bounds  eliminate  the  possibility  of  an  unbounded  optimum. 
The  final  formulation  is  then,  maximise  c  =  k  subject  to  (12)  and  (14).  This  formulation 
is  only  marginally  different  from  that  of  Krauth  and  Mezard  [11]. 

4.  Numerical  simulation 

The  principal  aim  of  the  numerical  simulation  reported  here  is  to  compare  the 
performances  of  Gardner's  version  of  iterative  rule  and  the  linear  programming  rule. 
Throughout  the  simulations  N  is  assumed  to  be  100.  The  performance  of  a  learning  rule 
is  evaluated  as  follows.  Firstly  a  value  of  p  is  chosen  between  1  and  N,  and  that  many 
unbiased  patterns  are  generated.  The  interaction  strengths  are  computed  next  using  the 
learning  rule  to  be  evaluated.  One  of  the  stored  patterns,  say  {£?: i  =  1,...,N},  is 
selected  as  a  test  pattern  and  is  persisted  with  throughout  the  evaluation.  The  test 
pattern  is  corrupted  by  selecting  IN  (denotes  Input  Noise)  sites  randomly  and  inverting 
the  spin  values  there.  Using  the  result  as  the  input  key,  the  network  is  used  to  recall  the 
original  test  pattern.  For  a  given  value  of  IN,  a  total  often  recall  experiments  are  carried 
out.  In  each  case  the  overlap,  m,  of  the  recalled  state,  {<T;},  with  the  original  test  pattern 
is  computed  using 


and  the  average  of  these  ten  overlaps  is  evaluated.  By  varying  a  =  p/N  from  5/N  to 
1  and  IN  from  0  to  50,  a  surface  of  overlaps  over  the  phase  plane  a  x  IN  is  obtained.  The 
shape  of  the  surface  by  itself  is  an  indicator  of  the  efficacy  of  the  learning  rule  being 
evaluated.  Let  an  overlap  value  greater  than  or  equal  to  0-99  in  a  recall  experiment  be 
termed  a  success.  By  sectioning  the  overlap  surface  with  a  m  =  0-99  plane  .parallel  to  the 
phase  plane,  a  contour  line  of  IN  as  a  function  of  a  is  obtained.  It  is  easy  to  see  that  the 
value  of  IN  on  this  line  at  a  given  a  is  an  experimental  measure  of  the  radius  of  the  basin 
of  attraction,  r,  around  the  test  pattern.  This  line  separates  the  phase  plane  into  two 
regions  in  one  of  which  the  network  functions  as  useful  associative  memory.  Thus,  the 
shape  and  location  of  this  line  on  the  phase  plane  is  a  good  index  of  performance  of  the 
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Figure  1.  The  surface  of  overlaps  (m)  between  recalled  patterns  and  the  original 
test  pattern  is  shown  in  (a)  for  the  iterative  learning  rule.  The  number  of  bits  selected 
randomly  and  reversed  in  the  test  pattern  to  generate  the  input  key  for  recall 
experiments  is  represented  by  IN.  The  variable  a  denotes  the  storage  capacity  p/N. 
The  contour  line  on  the  surface  of  overlaps  at  a  value  of  m  =  099  is  shown  in  (b).  This 
contour  line  represents  the  radius  of  the  basin  of  attraction  of  the  test  pattern  as 
a  function  of  a.  The  value  of  the  parameter  k  is  10. 


The  evaluation  of  Gardner's  version  of  iterative  rule  has  been  done  taking  the  value 
of  k  to  be  10.  Also,  to  make  the  comparison  with  the  linear  programming  rule 
meaningful,  instead  of  using  (5),  eq.  (6)  is  used  for  the  normalization  of  {J^.}.  That  is,  the 
mask  £?  is  defined  as 


N 

•I 

j=  1 


(15) 


While  updating  {Jtj},  condition  (6)  is  enforced  by  clipping  those  values  of  fjf.|  which 
exceed  Jmax.  A  value  10  is  assumed  for  Jmax.  With  constant  magnitude  of  the  correction 
term  AJ0-  in  (8),  convergence  has  been  found  to  be  slow  and  sometimes  impossible. 
Attenuating  the  correction  term  progressively  as  the  iterations  proceed  has  helped  to 
speed  up  the  convergence.  Assuming  a  maximum  permissible  number  of  iterations,  the 
correction  term  was  linearly  attenuated  from  a  maximum  value  of  1  to  a  minimum 
value  of  0.  The  resulting  overlap  surface  and  the  m  —  0-99  contour  line  for  the  iterative 
rule  are  shown  in  figure  1. 

Evaluation  of  the  linear  programming  rule,  however,  runs  into  implementational 
problems  as  a  direct  employment  of  the  simplex  algorithm  would  require  a  huge 
amount  of  computer  memory.  With  N  =  100,  the  size  of  the  co-efficient  matrix 
representing  constraints  (12)  and  (14)  would  be  of  the  order  of  108.  Such  a  size  cannot  be 
accommodated  within  the  memories  of  standard  workstations  which  usually  have 
memories  of  the  order  of  106  to  107.  Therefore  a  subdivision  of  the  LPP  becomes 
necessary.  In  this  context  it  may  be  noted  that  the  constraints  (12)  decouple  into 
N  independent  sets  of  constraints  one  set  for  each  i.  Optimization  of  k  for  each  set  using 
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(a) 
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Figure  2.  The  surface  of  overlaps  (m)  and  the  m  =  0-99  contour  line  in  the  case  of 
the  linear  programming  rule.  The  meaning  of  the  variables  and  parameters  is  the 
same  as  given  for  figure  1.  The  value  of  J  is  10. 


the  simplex  algorithm  will  result  in  the  computations  of  {J'ij:j=l,...,N}  for  the 
corresponding  i.  The  entire  set  of  {7'y}  can  be  computed  by  employing  the  simplex 
method  for  i  =  1, . . . ,  N,  individually.  This  scheme  requires  memory  of  the  order  of  104 
which  is  manageable.  Keeping  the  co-efficient  matrix  small  also  helps  in  limiting  the 
numerical  round-off  and  truncation  errors.  However,  a  side  effect  of  this  scheme  of 
computing  {J;j}  is  the  dependence  of  optimal  k  on  i.  In  the  actual  simulation,  k  as 
a  function  of/  has  been  found  to  vary  only  slightly.  In  fact,  the  standard  deviation  of /c's 
for  various  fs  turns  out  to  be  less  than  5%.  For  Jmax  a  value  of  10  is  assumed,  as  in  the 
case  of  the  iterative  rule.  The  simplex  implementation  due  to  Press  et  al  [17]  has  been 
used  in  the  computations.  The  overlap  surface  and  the  corresponding  m  =  0-99  contour- 
line  for  the  linear  programming  rule  are  shown  in  figure  2. 

Qualitatively  the  linear  programming  rule  appears  to  perform  better  than  the 
iterative  one.  The  overlap  surface  bounded  between  the  contour  line  and  the  origin  is 
flat  with  fewer  undulations  compared  to  that  of  the  iterative  rule  implying  error-free 
recall  in  that  region  of  the  phase  plane.  For  a  close  visual  comparison  the  m  =  0-99 
contour  lines  of  both  the  rules  are  shown  together  in  figure  3.  In  view  of  the  fact  that 
only  ten  recall  experiments  are  carried  out  at  each  point  to  determine  the  overlap,  the 
error  margins  on  these  contour  lines  are  likely  to  be  high.  Nevertheless  from  the  general 
trend  of  these  lines,  linear  programming  rule  appears  to  perform  better  than  the 
iterative  one,  though  towards  the  oc  =  1  -0  limit  the  contour  lines  tend  to  coincide.  This  is 
expected,  because  the  optimal  values  of  k  around  a=  1-0  as  found  by  the  linear 
programming  rule  turn  out  to  be  close  to  10.  The  optimal  values  of  k  as  a  function  of 
a  decrease  with  increasing  a  which  is  again  expected.  Fewer  patterns  mean  larger  basins 
of  attraction  for  each  one  of  them  and  vice  versa.  For  a  visual  comparison  of  the 
performances  of  linear  programming  and  iterative  rules  with  that  of  Hebb  rule,  the 
interested  reader  may  refer  to  figure  4. 
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Figure  3.  For  a  close  comparison,  m  =  099  contour  lines  in  figures  1  and  2  are 
shown  together  here.  The  line  with  circles  corresponds  to  the  iterative  rule  while  that 
with  triangles  corresponds  to  the  linear  programming  rule.  For  a  substantial  part  of 
the  a  axis,  the  linear  programming  rule  has  a  larger  radius  of  basin  of  attraction 
compared  to  the  iterative  rule. 
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Figure  4.  The  surface  of  overlaps  (m)  and  the  m  =  0-99  contour  line  are  shown  here 
for  the  Hebb  learning  rule.  The  meaning  of  the  variables  and  parameters  is  the  same 
as  given  for  figure  1. 

As  mentioned  earlier,  the  near  perfect  flatness  on  the  overlap  surface  corresponding 
to  the  linear  programming  rule  suggests  perfect  recall  in  that  region  of  the  phase  plane. 
To  confirm  that,  recall  experiments  are  carried  out  on  either  side  of  the  m  —  O99 
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Table  1.  A  comparison  of  the  transition  from  perfect 
recall  performance  to  no  proper  recall  for  two  learning 
rules  is  presented  in  this  table.  In  the  first  column 
a  denotes  the  storage  capacity  p/N.  While  nu  denotes 
the  maximal  number  of  bit  changes  which  ensures 
perfect  recall,  nt  denotes  the  minimal  number  of  bit 
changes  which  results  in  no  proper  recall.  In  each  case 
a  hundred  recall  experiments  have  been  carried  out  for 
determining  these  numbers. 

Iterative  rule     Linear  programming  rule 
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contour  line  along  the  IN  axis.  A  total  of  100  experiments  are  carried  out  at  some 
discrete  intervals  of  a  and  at  various  values  of  IN.  For  a  given  a,  a  threshold  value  of  IN, 
«,,  above  the  contour  line  is  found  beyond  which  none  of  the  100  experiments  results  in 
a  perfect  recall.  Similarly  a  threshold  value  of  IN,  nu,  below  the  contour  line  is  found  so 
that  everyone  of  the  experiments  results  in  a  perfect  recall.  The  values  of  nu  and  nt  are 
presented  in  table  1  for  five  different  values  of  a,  for  both  the  iterative  and  the  linear 
programming  rules.  These  data  confirm  the  perfect  performance  of  the  linear  program- 
ming rule  over  a  much  larger  region  of  the  phase  plane  compared  to  the  iterative  rule. 
Also  the  transition  from  perfect  performance  to  total  failure  is  narrower  in  the  case  of 
the  linear  programming  rule. 

5.  Maximal  basins  of  attraction 

In  the  evaluation  of  the  linear  programming  rule,  it  is  presumed  that  the  weights 
for  patterns  are  equal.  As  a  result  the  basins  of  attraction  around  each  one  of  the 
patterns  are  likely  to  be  approximately  of  the  same  size.  Given  a  set  of  patterns,  the 
inter-pattern  distances  in  the  state  space  of  the  network  are  bound  to  have  a  spread. 
Therefore  so*ne  patterns  can  afford  to  have  larger  basins  of  attraction  compared  to 
others  so  that  the  state  space  is  more  optimally  utilised.  However,  with  equal  weights 
for  all  patterns  such  an  eventuality  may  not  be  explicitly  realized.  Ideally,  the  basins  of 
attraction  for  various  patterns  should  close  out  any  gap  between  them  and  be 
tangential  to  each  other.  Such  an  ideal  situation  may  be  obtained  as  follows. 

Let  the  p  patterns  be  conceived  as  p  points  in  the  state  space.  Then  the  task  is  to 
find  maximal  hyperspheres  centred  around  each  point  such  that  these  hyperspheres 
do  not  intersect  each  other  but  are  just  mutually  tangential.  The  radii  of  these 
hyperspheres  can  be  characterized  as  follows.  Let  hyperspheres  with  zero  initial  radii 
around  each  one  of  the  p  points  expand  at  a  uniform  rate.  Once  two  or  more 
hyperspheres  come  in  contact,  they  shall  stop  expanding  and  their  radii  reach  their  final 
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venues,  t^nce  all  expansion  stops,  the  radii  ol  the  Irozen  hyperspheres  oecome  me 
maximal  radii  associated  with  the  corresponding  points.  The  assumption  of  uniform 
rate  of  expansion  ensures  equal  rights  for  all  patterns  to  the  space  around  them.  The 
following  procedure  which  is  a  heuristic  one  finds  close  approximations  to  these  radii 
using  suitable  metric. 

Firstly,  for  each  point,  its  nearest  neighbour  is  found.  The  initial  approximation  to 
the  radi  us  of  the  maximal  hypersphere  at  that  point  is  assigned  as  half  the  distance  to  its 
nearest  neighbour.  After  this  step,  all  the  points  are  sorted  with  respect  to  their  initial 
radii  in  ascending  order.  For  each  point  j,  beginning  with  the  first  point  in  the  sorted 
array,  it  is  checked  whether  that  point  and  its  nearest  neighbour  are  mutual  nearest 
neighbours.  If  yes,  then  the  initial  approximate  radii  are  taken  as  the  final  radii  for  both. 
If  not,  maximum  permissible  radius  for  the  point  i  is  computed  as  the  distance  to  its 
neighbour  minus  the  current  radius  associated  with  the  neighbour.  This  maximum 
permissible  radius  is  checked  against  the  current  radii  of  all  other  points  to  see  if 
corresponding  hyperspheres  intersect.  During  this  check,  the  maximum  permissible 
radius  is  updated  if  necessary.  At  the  end  of  this  check,  the  final  value  of  maximum 
permissible  radius  is  assigned  as  the  radius  of  the  maximal  hypersphere  at  point  i.  Once 
all  the  points  are  assigned  their  radii,  the  procedure  stops.  More  details  of  this 
procedure  are  given  in  appendix- 1. 

To  verify  the  above  procedure,  maximal  circles  for  several  sets  of  50  points  within 
a  square  have  been  computed.  A  sample  output  of  the  algorithm  is  illustrated  in 
figure  5.  The  radii  of  these  maximal  circles  can,  of  course,  be  computed  by  brute  force 
simulation  of  the  problem  scenario  assuming  a  small  though  finite  incremental  step  for 
expanding  the  radii.  In  practice  it  has  been  found  that  the  above  heuristic  algorithm 
could  compute  about  95%  of  the  radii  correctly;  also,  the  error  in  the  remaining  5% 
cases  was  found  to  be  within  20%  of  the  correct  values. 
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Figure  5.  A  sample  set  of  fifty  random  points  inside  a  square  on  a  plane  with 
maximal  circles  around  each  one  of  them.  The  radii  of  these  circles  are  computed  by 
an  algorithm  described  in  text. 
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Let  the  radii  for  maximal  hyperspheres  around  patterns  be  denoted  by  -f.  Using 
these  /'s  as  weight  factors  for  the  patterns,  condition  (10)  may  be  written  as 

N 

The  linear  programming  rule  may  be  employed  to  maximise  k  subject  to  (16)  and  (6). 
The  outcome  is  bound  to  be  the  maximum  possible  utilization  of  the  state  space  to 
carve  out  the  various  basins  of  attraction.  To  verify  this  assertion,  the  Monte-Carlo 
method  has  been  employed  to  compute  the  fractional  volume  of  basins  of  attraction. 
A  total  of  10000  recall  experiments  are  carried  out  starting  from  random  input 
patterns.  Each  time  the  recalled  state  is  compared  with  all  the  stored  patterns  and  the 
maximum  overlap  is  computed.  If  the  overlap  is  above  or  equal  to  a  preselected  threshold 
the  random  input  pattern  is  deemed  to  be  inside  the  basin  of  attraction;  else  outside. 
The  fractional  volume  of  basins  attraction  is  then  computed  as  the  ratio  of  the  random 
input  patterns  that  are  found  to  be  inside  the  basins  to  the  total  number  of  experiments. 
Employing  this  method  the  fractional  volumes  of  basins  of  attraction  have  been 
computed  for  three  cases;  firstly  for  the  linear  programming  rule  with  radii  of  maximal 
hyperspheres  around  patterns  as  their  weights;  secondly  for  the  linear  programming 
rule  with  equal  weights  for  the  patterns;  thirdly  for  the  iterative  rule.  Hamming  metric  is 
used  while  computing  the  radii  of  maximal  hyperspheres.  The  fractional  volumes  are 
computed  as  a  function  of  a  from  a.  =  0-05  to  a  =  0-5.  The  value  of  a  is  increased  by 
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Figure  6.  Fractional  volume,  /,  of  basins  of  attraction  for  stored  patterns  as 
a  function  of  storage  capacity  a  is  shown  here.  Line  A  corresponds  to  the  case  of 
linear  programming  rule  with  radii  of  maximal  hyperspheres  around  stored  pat- 
terns as  weights  for  these,  patterns.  Line  B  corresponds  to  the  case  of  the  same  rule 
with  equal  weights  for  the  patterns.  Line  C  corresponds  to  the  iterative  rule.  Figure 
6a  represents  the  case  of  basins  whose  minima  have  100%  overlap  with  stored 
patterns.  Figure  6b  represents  the  case  of  basins  with  one  or  more  local  minima 
whose  overlaps  with  stored  patterns  are  greater  than  or  equal  to  95%. 
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me  icquncu  numoer  01  additional  pauerns  to  me  sei  01  aireauy 
The  results  of  the  Monte-Carlo  experiments  are  presented  in  figure  6.  Figure  6a 
corresponds  to  the  overlap  threshold  of  value  1-0  while  figure  6b  corresponds  to  a  value 
of  0-95.  As  remarked  earlier,  the  first  case  utilizes  the  state  space  most  optimally.  Also, 
the  difference  between  the  second  and  first  cases  is  quite  pronounced,  implying  that  the 
use  of  maximal  radii  as  weights  for  the  patterns  further  improves  the  sizes  of  the  basins 
of  attraction  wherever  possible. 

6.  Discussions  and  conclusions 

A  striking  feature  of  the  LP  rule  is  the  sharp  transition  exhibited  by  its  overlap  surface. 
In  contrast,  the  overlap  surface  corresponding  to  the  iterative  rule  shows  a  gradual 
downward  trend  as  IN  and  a  increase.  Though  a  steady  degradation  in  performance  is 
desirable,  a  sharp  fall  in  the  recalling  ability  has  its  advantage.  If  an  input  key  does  not 
match  with  any  of  the  stored  patterns  then  the  linear  programming  rule  is  capable  of 
reporting  a  clear  failure.  Such  an  ability  may  be  valuable  in  cognitive  modelling. 
Another  important  aspect  of  the  linear  programming  rule  is  the  associated  linear 
normalizing  bounds  on  the  {J..}.  This  aspect  will  be  useful  in  designing  VLSI  neural 
chips  with  finite  precision  synaptic  strengths.  Limiting  the  magnitude  of  interaction 
strengths  is  also  biologically  meaningful  as  biological  synapses  cannot  represent 
strengths  of  arbitrary  precision. 

Results  of  the  computer  simulations  suggest  that  the  basins  of  attraction  in  case  of 
linear  programming  rule  have  smooth  shapes  converging  to  unique  local  minima. 
Thus,  when  an  input  key  falls  within  a  specific  basin  of  attraction  it  leads  to  the  recalling 
of  the  corresponding  pattern  without  any  error.  Both  the  flatness  on  the  overlap  surface 
in  figure  2  and  the  high  values  of  nu  in  table  1  corroborate  this  fact.  Further  evidence  is 
available  from  data  presented  in  figure  6.  The  dependence  of/  on  a  in  cases  A  and  B  as 
shown  in  the  figure  has  been  observed  to^be  invariant  with  respect  to  the  two  values  of 
threshold  overlaps  implying  the  presence  of  unique  local  minima.  With  the  iterative 
rule,  however,  /as  a  function  of  y.  has  been  found  to  depend  on  threshold  overlaps. 

The  radius  of  the  maximal  hypersphere  around  a  pattern  as  defined  in  the  previous 
section  gives  an  estimate  of  noise  which  the  pattern  can  tolerate  without  losing  its 
uniqueness.  In  that  sense  these  radii  could  be  useful  in  the  study  of  associative  memory 
and  pattern  recognition  models.  The  procedure  outlined  earlier  for  computing  radii  of 
the  maximal  hyperspheres  requires  computing  time  of  the  order  of  O(p2)  where  p  is  the 
number  of  patterns.  It  is  not  an  exact  algorithm,  though  its  solutions  in  hundreds  of  test 
cases  in  two-dimensions  have  proved  to  be  satisfactory.  Developing  an  exact  algorithm 
based  on  either  an  'incremental'  or  "divide  and  conquer'  approach  may  prove  to  be 
a  worthwhile  exercise. 

Maximal  possible  utilization  of  the  state  space  of  the  network  is  demonstrated 
employing  linear  programming  rule  with  differentiated  weight  factors  for  the  patterns. 
A  trivial  approach  to  find  the  closest  match  is,  of  course,  to  compute  distances  between 
the  input  and  all  the  patterns  to  be  stored  and  then  select  one  of  the  latter  that  is  least 
distant  from  the  input.  It  is  instructive  to  know  that  the  basins  of  attractions  in  such  an 
approach  are  nothing  but  Voronoi  polytopes  which  cover  the  state  space  completely. 
This  is  a  most  ideal  situation,  which  may  not  be  achieved  by  the  Hopfield  network. 
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As  pointed  out  earlier,  the  computational  approach  adopted  here  for  implementing 
the  linear  programming  rule  makes  the  sizes  of  the  basins  of  attraction  depend  on  the 
neuronal  index  i.  During  computer  simulations,  however,  this  dependence  was  found 
to  be  small.  As  the  standard  Hopfield  network  is  homogeneous,  it  is  necessary  that  its 
performance  parameters  are  independent  of  neuronal  indices.  In  its  extensions  where 
homogeneity  may  not  be  desirable,  making  the  sizes  of  basins  of  attraction  depend  on 
neuronal  indices  may  be  functionally  necessary.  The  human  brain,  it  may  be  noted,  is 
highly  differentiated  even  in  regions  dedicated  to  specialized  tasks  and  any  attempt  in 
understanding  it  should  consider  heterogeneous  networks  sooner  or  later. 
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Appendix  1 

Procedure  FindRadiiMaxHyperspheres  (p) 
prNumber  of  points 

{ 

For  each  point  i 

{ 

Radii AssignedFlags  [i]  =  —  1; 
For  each  point  j^i 

Compute  Distance  [/,j]; 
Compute  its  nearest  neighbour,  /c; 
NearNeighbour  [i]  =  k; 

Set  the  initial  approximation  to  the  radius  of  hypersphere  at  i  as 
Radii  [/]  =  Distance  [i,  /c]  *O5; 

1 

Sort  the  points  with  their  associated  radii  in  ascending  order; 
Initialize  a  pointcounter  to  1; 

Step  L:     Pickup  the  current  point,  i,  from  the  sorted  points  array 
If  RadiiAssignedFlags  [i]  =  —  1)  then{ 

Neighbour  =  NearNeighbour  [i]; 

If  (/  =  NearNeighbour  [Neighbour])  Then 

{ 

RadiiAssignedFlags  [/]  =  1; 
RadiiAssignedFlags  [Neighbour]  =  1; 

} 
Else 


MaxRadius  =  Distance  [f,  Neighbour]  —  Radii  [Neighbour]; 
For  each  point;  ^/ 

If    (Distance  [i,;]  <  MaxRadius  +  Radii  [/']) 
MaxRadius  =  Distance  [/J]  —  Radii  [;']; 
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1 
< 
) 


Radii  [f]  =  MaxRadius; 
RadiiAssignedFlags  [/]  =  1; 


j 

Increment  pointcounter  by  1; 
If  (pointcounter  <  =  p)  Go  To  Step  L; 
Else 
STOP: 
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THIRD  WORKSHOP  ON 
HIGH  ENERGY  PARTICLE  PHYSICS 


Editor 
S  Uma  Sankar 


PREFACE 

The  Third  Workshop  on  High  Energy  Particle  Physics  (WHEPP-3)  was  held 
at  the  Institute  of  Mathematical  Sciences,  Madras  during  January  10-22,  1994. 
This  is  the  third  in  the  series  of  workshops;  the  first  one  was  held  at  T.  I.  F.  R,., 
Bombay  (January  1989)  and  the  second  one  was  held  by  S.  N.  Bose  Centre, 
Calcutta  (January  1991).  About  100  particle  physicists  have  attended  the  work- 
shop. 

As  in  the  previous  two  workshops,  four  broad  topics  were  chosen  for  the 
formation  of  working  groups.  These  are 

1.  Collider  physics  and  structure  functions  (coordinated  by  Amitava  Datta 
and  Rohini  Godbole) 

2.  Beyond  standard  model  (coordinated  by  Anjan  Joshipura  and  Probir  Roy) 

3.  Heavy  ion  physics  (coordinated  by  Sourendu  Gupta  and  J.  C.  Parikh) 

4.  Heavy  flavour  physics  (coordinated  by  S.  Uma  Sankar) 

The  first  week  was  devoted  to  review  talks  on  various  topics  of  current  interest 
and  to  the  formation  of  working  groups.  The  second  week  was  devoted  to  talks 
by  participants  on  their  work  and  the  discussion  of  the  problems  that  were  raised 
in  working  group  sessions.  The  group  coordinators  presented  the  results  of  the 
discussions  on  the  last  day  of  the  workshop. 

These  proceedings  contain  the  written  version  of  the  review  talks  given  dur- 
ing the  first  week.  In  addition  to  the  contributions  included  here,  the  following 
talks  were  also  presented: 

1.  Top  quark  search  at  tevatron  . . .  N.  K.  Mondal 

2.  Searches  for  new  physics  at  LEP200  . . .  Sudhakar  Katta 

3.  Neutrino  Masses  and  Mixing  Angles  . . .  K.  S.  Babu 

4.  Lepton  Number  Violation  . . .  J.  W.  F.  Valle 

Most  of  the  support  for  the  workshop  has  come  from  S.  N.  Bose  National 
Centre  for  Basic  Sciences,  Calcutta.  Additional  support  was  also  given  by  the 
Department  of  Science  and  Technology,  Government  of  India,  New  Delhi,  The 
Institute  of  Mathematical  Sciences,  Madras  and  the  International  Center  for 
Theoretical  Physics  (ICTP),  Trieste. 

I  thank  Prof.  R.  Ramachandran,  Director,  Institute  of  Mathematical  Sci- 
ences, for  the  enthusiasm  with  which  he  agreed  to  host  the  workshop  and  for 
providing  constant  support.  I  also  thank  Prof.  G.  Rajasekaran,  Dr.  M.  V.  N. 
Murthy  and  Dr.  Rahul  Sinha  for  offering  their  help  without  any  hesitation  at  all 
times.  My  sincere  thanks  to  all  the  office,  the  library  and  the  guest  house  staff 


of  the  Institute  of  Mathematical  Sciences,  who  worked  untiringly  to  take  care 
of  the  workshop  participants  and  made  the  workshop  a  success.  Finally  I  would 
like  to  thank  my  colleagues  at  T.  I.  F.  R.,  particularly  Dr.  Sourendu  Gupta  and 
Dr.  Suresh  Govindarajan  of  the  Theory  Group  and  Dr.  Kajari  Mazumdar  of 
the  E.  H.  E.  P.  Group,  for  their  help  in  preparing  the  proceedings. 


S.  Uma  Sankar 


B  physics  at  LEP 

TARIQ  AZIZ 

Tata  Institute  of  Fundamental  Research,  Bombay  400005,  India 

Abstract.  The  Z  factory  at  LEP  is  like  a  mini  B  factory.  About  20%  of  the  time  a  Z 
decays  to  a  pair  of  b  flavoured  hadrons.  As  a  result,  this  has  provided  a  major  boost  to 
B  physics  studies.  The  four  sophisticated  LEP  detectors,  with  millions  of  Z  events,  have 
provided  a  variety  of  interesting  results.  Only  a  snapshot  of  the  major  developments  can 
be  covered  in  one  review.  We  have  considered  the  results  related  to  Z  — »•  bb  coupling,  in 
particular,  the  measurements  of  Rb  =  Ph/Fi^d  and  the  forward  backward  charge  asym- 
metry. The  b  hadron  properties,  like  inclusive  and  exclusive  lifetime  measurements  are 
summarised  briefly.  Measurements  of  B°-B~°  oscillation  parameter  as  well  as  the  evidence 
for  time  dependent  evolution  have  been  discussed.  Studies  of  interesting  decay  modes 
have  been  mentioned. 


1  Introduction 

The  Z  factory  at  LEP  is  a  copious  source  of  b  flavoured  hadron  production  via  Z  — > 
bb.  About  20%  of  the  visible  Z  decay  to  this  channel.  As  a  consequence  B  physics 
at  LEP  has  become  one  of  the  most  interesting  issues  being  persued  vigorously. 
A  glimps  of  various  interesting  topics  is  shown  in  figure  1.  An  attempt  is  made 
to  cover  significant  part  of  this  exciting  area  with  a  minimum,  but  unavoidable, 
sacrifice  to  the  rest.  As  a  broad  line  of  separation,  there  are  two  specific  areas, 
the  one  which  has  to  do  with  Z  — >  bb  coupling,  like  partial  width  and  asymmetry. 
The  second  one  has  to  do  with  the  b  flavoured  hadron  properties,  like  lifetime  of 
hadrons,  neutral  meson  mixing  and  branching  ratios.  Another  issue  that  deals  with 
the  strong  interaction  feature  of  b  quark,  like  fragmentation  parameter  and  strong 
coupling  constant,  atl  are  not  touched  at  all. 

2  Measurement  of 

Within  the  SM  framework  Z  — >  bb  partial  width  is  an  important  measurement,  both 
at  the  tree  level  and  beyond.  A  precise  measurement  is  essential  for  understanding 
top  mass  dependent  weak  radiative  correction  in  Z  — »  bb  vertex.  The  virtual  effects 
of  fermions  and  bosons  lead  to  the  following  weak  corrections  [1]. 

•  The  propagator/oblique  correction,  called  Ap,  where  relevant  contribution 
comes  from  top  quark  and  Higgs  boson.  All  partial  widths  and  asymme- 
tries increase  with  increasing  top  mass  while  due  to  Higgs  boson  the  effect  is 
opposite. 
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Figure  1:  Different  topics  at  LEP 
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The  vertex  correction  where  the  relevant  effect  is  in  Z  —  >  bb  vertex  due  to 
massive  top  quark,  called  6vtb-  Due  to  this  effect  the  Z  —  »  bb  partial  width 
decreases  with  inceasing  top  mass.  The  b  quark  asymmetry  however  remains 
unaffected  [2]  . 


As  a  result  of  the  above  two  corrections,  the  ratio  Rb  =  Tb/Fhadi  wnere  ^b  ^  *ne 
Z  — »  bb  partial  width  and  Fhad,  the  partial  width  due  to  all  five  quark  flavours 
(u,d,s,c,b),  depends  only  upon  the  top  quark  mass.  All  common  corrections  in  the 
propagator  get  cancelled,  making  Rb  a  clean  top  mass  indicator,  free  from  Higgs 
and  QCD  effects  [3].  Specifically, 

Rb  —  Rbo(l  ~h  5vtb(l  —  Rbo))» 

where  Rb0  =  0.2197  ±  0.0001  is  estimated  switching  off  the  vertex  correction,  6vtb. 
For  top  mass  above  100  GeV  [4], 

20  a    m? 

X  «5    7\        lliry 


131 

6 


m 


Due  to  this  clean  dependence  on  top  quark  mass,  any  possible  deviation  from  this 
trend  will  also  be  a  clean  indication  for  physics  beyond  SM  (see  figure  2).  Thus  an 
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Figure  2:  Rb  as  a  function  of  top  mass.  The  MSSM  (Minimal  Supersymmetric 
Standard  Model)  calculations  are  performed  with  a  choice  of  parameters  to  show 
the  maximal  effect  [5]. 


accurate  measurement  of  Rb  at  LEP  is  one  of  the  topmost  priorities,  exploiting  all 

possibilities  of  tagging  b  quark. 

Three  different  techniques  have  been  used  io  isolate  the  signal. 


Tariq  Aziz 

Using  large  momentum,  p,  large  transverse  momentum,  pr,  leptons  from  SL 
decays.  Where  p?  is  with  respect  to  the  nearest  candidate  jet  axis. 

Exploiting  the  multidimensional  event  shape  in  a  sophisticated  neural  network 
analysis. 

Exploiting  lifetime  and  hence  displaced  decay  vertex  of  b  flavoured  hadron  in 
a  sophisticated  Silicon  Microvertex  Detectors  (SMD). 


2.1  Rb  from  Lepton  tag 

The  massive  (~  5  GeV)  b  flavoured  hadron  leads  to  large  PT  of  the  charged  lepton 
in  a  semileptonic  decay.  Due  to  the  hard  b  quark  fragmentation  (suppressed  gluon 
emission)  the  b  falvoured  hadron  is  energetic.  About  70%  of  b  quark/beam  energy 
is  carried  by  the  b  hadron.  This  leads  to  the  large  momentum  lepton.  Thus 
restricting  to  large  p,pr  leptons  it  is  possible  to  select  high  purity  b  sample.  By 
applying  a  typical  cut  on  p  >  3  GeV/c  and  pr  >  1  GeV/c  a  sample  purity  of  80% 
for  b  quark  can  be  achieved.  The  relevant  backgrounds  to  the  signal  come  from 
charm  semileptonic  decays  and  the  fake  leptons.  The  fake  p,  are  the  punch-throughs 
pointing  to  the  vertex,  whereas  fake  elections  are  the  the  charged  hadrons  matched 
wrongly  to  an  electromagnetic  shower.  Thus  the  important  systematics  come  partly 
from  the  knowledge  of  SL  branching  ratios  and  partly  from  MC  modelling  of  the 
event  in  the  detector.  A  substantial  control  to  these  systematics  is  achieved  by 
tagging  both  the  b  quarks  using  two  high  p,  pr  charged  leptons,  called  the  double 
tag.  The  double  tag  to  single  tag  ratio  is  used  to  minimise  the  MC  dependence. 
However  this  is  achieved  at  the  cost  of  losing  statistics.  The  results  from  the  four 
LEP  experiments,  ALEPH  [6],  DELPHI  [7],  L3  [8]  and  OPAL  [9]  based  on  this 
technique  are  summarised  in  table  1. 

2.2  Rb  from  Event  shape  Anaylsis 

The  Z  — »  bb  event  is  identified  using  general  properties  of  the  event  as  a  whole. 
A  sophisticated  neural  network  technique  is  used  to  exploite  the  multidimensional 
correlations  in  several  eventshape  variables.  This  method  has  the  advantage  of 
being  least  sensitive  to  the  semileptonic  branching  ratio.  Even  though  heavy  b 
quark  radiates  less  energy  as  gluons,  the  resulting  b  flavoured  hadron  being  short 
lived  decays  to  several  particles.  The  energy  sharing  among  the  secondaries  is,  thus, 
more  even  compared  to  the  case  of  light  quark  where  the  resulting  leading  light 
hadron  being  stable  can  be  fairly  energetic.  The  mass  effect  of  b  quark  manifests 
itself  in  the  jet  boost(7/3  =  p/m]  which  tends  to  be  small.  With  these  features  a  set 
of  variables  are  defined  which  are  used  as  input  to  neural  network.  ALEPH  [10], 
DELPHI  [11]  and  L3  [12]  have  performed  such  analysis.  An  example  of  such  an 
analysis  is  considered  from  the  L3  experiment.  A  set  of  five  variables  for  each  of 
the  two  most  energetic  jets  of  the  event  are  defined: 


B  physics  at  LEP 

Table  1:          I^/T^  from  Lepton  Tagging 

f  '  •  •  ]  •  '  '  I  '  '  '  1  '  •  '  I 

OPAL  Single  leptons        m 0.220  i  0.002  ±  0.013 


OPAL  Dileptons 


.0.22310.01210.011 
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0.21810.00810.007 


ALEPH  Global  Fit 

-s-  Double  Tag 


0.22210.00410.006 
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0.22210.00910.004 
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In  addition  there  is  one  global  variable  JE?ieft  defined  as  the  fraction  of  energy  outside 
the  two  most  energetic  jets  in  the  event 

Using  a  functional  expansion  neural  network  developed  by  the  L3  collaboration, 
they  achieve  a  35%  b  tagging  efficiency  with  a  sample  purity  of  60%.  The  discrim- 
inating strength  of  various  variables  is  confirmed  by  using  high  pr  lepton  tagged 
events  in  data  and  MC.  The  network  response  and  the  purity  versus  efficiency  plot 
for  b  tagging  are  shown  in  figure  3.  The  network  response  is  checked  using  lep- 
ton tagged  events  to  confirm  that  the  correlations  are  well  simulated  in  the  MC. 
Finally  a  fit  to  the  data  response  using  b  and  non-b  from  MC  deduces  the  value  of 
Rb-  Figure  4  shows  the  network  response  for  lepton  tagged  events  and  the  overall 
distribution  to  determine  Rb-  A  summary  of  all  the  measurements  at  LEP  using 
neural  network  technique  is  given  in  table  2.  The  major  systematics  in  this  kind 
of  analysis  come  from  the  understanding  of  the  fragmentation  behaviour  and  the 
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Figure  3:  (a)  The  network  response  for  b  events  and  non-b  events  separately,  (b) 
Purity  and  efficiency  of  the  classifier  function  for  b  events. 
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Figure  4:  The  network  response  for  high  pr  muon  tagged  events  for  which  b  purity 
is  80%  (left).  Fit  of  the  network  response  to  the  data  (right). 
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Table  2: 


Shape 


^M  '  '  '  I 

L3NN  ^fl^  0.222  ±0.003  ±0.007 


DELPHI  NN  Sp 0.232  ±  0.005  ±0.017 

ALEPH  NN  +  leptons  ^^^^^  0.228  ±  0.005  ±  0.005 

AVERAGE  mm^mmm.  0.226  ±0.006 

I   ...    i   ...   i   ...    i   ...   I 

0.210  0.220  0.230   0.240  0.250 
2.3     Rb  from  Lifetime  Tag 


The  b  hadron  lifetime  fa  ~  1.5p«)  is  exploited  for  impact  parameter  measure- 
ment or  secondary  vertex  reconstruction  using  a  high  resolution  vertex  detector. 
ALEPH  [13],  DELPHI  [14]  and  OPAL  [15]  have  used  this  technique  to  measure 
Rb«  ALEPH  has  used  three  dimensional  (3D)  signed  impact  parameter,  5,  using 
silicon  microvertex  detector.  (For  illustration  of  impact  parameter  see  later  in  life- 
time section.)  DELPHI  has  used  2D  signed  impact  parameter  alongwith  p\  of 
charged  tracks  with  respect  to  jet  axis  as  well  as  decay  distance  in  a  multidimen- 
sional analysis  (MDA).  OPAL  has  used  2D  signed  impact  parameter  to  count  the 
so  called  forward  multiplicity  (tracks  originating  from  secondary  vertex,  forward  to 
the  primary  vertex).  Initially  they  used  their  conventional  tracking  chamber  and 
added  SMD  later.  The  results  being  consistent.  To  reduce  the  MC  dependence,  the 
double  tag  technique  is  applied.  The  event  is  divided  in  to  two  hemispheres  using 
the  event  thrust  axis. 

The  relevant  systematics  here  come  from  the  knowledge  of  lifetime,  mixtures  of 
mesons,  baryons  and  the  decay  braching  ratios  to  charged  particles  as  well  as  the 
ratios  like  D+/D°  etc.  To  give  an  example,  we  describe  the  ALEPH  analysis.  The 
typical  features  of  ALEPH  silicon  micorovetex  detector  are  the  following.  The  ver- 
tical resolution  is  ~  10  £tm;  horizontal  resolution  is  ~  20  /Am,  with  an  RMS  for 
interaction  region  •—  110  pm.  The  significance  for  tracks  originating  from  a  sec- 
ondary vertex  is  defined  as  S  =  8/<r(6],  where  tr(8)  is  the  measurement  uncertainty. 
For  tracks  originating  from  b  hadron  decay,  S  ~  4.5.  A  further  cut  of  6  <  2.5mm 
is  applied  to  remove  K°  decay  and  7  conversion  background.  A  joint  probability 
PH  for  a  given  hemisphere  tracks  is  calculated  using  5  values  such  that  none  of  the 
tracks  are  from  secondary  vertex.  A  small  value  of  PH  means  large?  signal  proba- 
bility. A  cut  on  PH(<  10~4)  is  used  to  select  good  purity  b  sample  (~  96%)  with 
a  good  tagging  efficiency  (~  26%).  Using  the  active  detector  acceptance,  the  net 
tagging  efficiency  (~  15%)  is  still  signifcantly  large.  A  plot  of  purity  and  efficiency 
as  function  of  PJJ  is  shown  in  figure  5.  Using  cut  on  PH  from  the  two  hemispheres 
the  single  and  double  tag  probabilities  are  defined.  For  single  tag 
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Figure  5:  Hemisphere  tag  efficiency  and  signal  purity  as  a  function  of  PH  cut. 


Fl  = 


F2  =  Rbeb  +  (1  -  Rb)4  + 


and  for  double  tag 


Here  Ab  is  the  correlation  coefficient  between  the  two  hemisphere  tag  probabilities 

and  it  is  determined  using  MC.  e\>  and  ex  are  tagging  efficiencies  for  b  and  non-b. 

The  two  expressions  are  solved  for  Rb  and  Cb  iteratively  whereas  ex  is  determined 

using  MC.  The  important  sources  of  correlations  are: 

The  detector  dependent  correlation  where  both  jets  are  back  to  back;  being  both 

either  in  good  or  bad  measurement  regions  due  to  detector  symmetry. 

The  physics  processes,  for  example  energetic  gluons  making  third  jet  and  in  that 

case  the  event  thrust  axis  and  the  hemispheres  are  affected.   The  net  systematic 

uncertainties  from  all  sources  are  still  the  smallest  in  this  measurement.  A  summary 

of  all  measurements  using  this  technique  is  given  in  table  3. 

2.4     Summary  of  Rb  from  different  methods 

The  results  from  the  three  methods  for  all  the  experiments  al  LEP  are  summarised 
in  table  4.  The  average  Rb  =  0.2207  ±  0.0027  is  somewhat  large  when  compared 
with  the  SM  prediction.  To  give  some  idea,  for  an  optimistic  top  mass  of  ~  160 
GeV  the  expected  value  of  Rb  =  0.2162.  Since  the  errors  are  still  large,  it  is 
premature  to  talk  of  any  deviation  from  SM.  On  the  other  hand  an  interpretation 
in  the  framework  of  SM  will  indicate  a  top  mas  <  185  GeV  at  Iff  level  if  no  other 
information  from  LEP  or  Tevatron  is  used.  An  improved  measuremnet  of  Rb  with 
better  understanding  of  systematics  is  likely  to  prove  very  exciting  from  the  point 
of  view  of  new  physics,  once  top  mass  is  known  from  Tevatron  direct  searches. 


3     Forward-Backward  Charge  Asymmetry 

The  forward-backward  charge  asymmetry  of  b  quark  provides  a  sensitive  measure- 
ment of  weak  mixing  angle.    For  the  process  Z  — »  f  f  (f:   fermion)  .where  final 


a  pnystcs  at 


Table  3: 


ALEPH  (3D  Imp) 


fr°m  Lifetime  Tag 


0.21  92  ±  0.0022  ±  0.0030 


DELPHI  (L  +  Imp) 


0.2060  ±0.0060  ±0.0060 


DELPHI  (MD  A.  vtx) 


0.2220  ±0.0070  ±0.0070 


OPAL  (Imp,  FWML) 


0.2200  ±0.0040  ±0.0060 


AVERAGE 


Table  4: 


0.21 84  ±0.0036 


0.195  0.205  0.215  0.225  0.235 


rbb/Thad  from  all  methods 


Leptons 


0.222  ±  0.006 


Event  Shape 


0.226  ±0.006 


Lifetime 


0.218410.0036 


AVERAGE 


0.2207  ±0.0027 


I    ...    I    ...   \   ...   I   ...   I 

0.200  0.210  0.220  0.230  0.240 


state  fermion  angle  9  is  measured  with  respect  to  the  election  beam  direction,  the 
differential  cross-section  is  given  by 

=  -(1 
V 


deos0       8V  yv  4 

where  <rTT  and  <r**  are  symmetric  vector/axial  vector  coupling  cotributions  to  the 
cross-section  and  crn  is  the  interference  term  and  it  is  asymmetric.  The  forward 
backward  asymmetry  is  given  by 

3  &** 

Aft,  =  7  -  5  <r-r  =  cr"  +<r** 

4  <TT 

On  the  Z  peak 


For  charge  Q  =  1/3  quark  (b,s,d)  V{  is  large  which  leads  to  large  Aft.  Furthermore, 
because  of  large  Vf  the  above  expression  simply  reduces  to  a  convenient  form 

Aft,  =±  C  ve  =  C(l  -  4sin2  0W) 

where  ve  =  (1  —  4sin20w)  is  the  vector  coupling  of  electron  and  sin3  &w  the  ef- 
fective weak  mixing  angle.  An  important  consequence  of  this  simplicity  is  that  b 
quark  asymmetry  is  independent  of  Z  — *  bb  vertex  corrections  [2],  To  an  excellent 
approximation  C  ~  1.395,  The  QED  and  QCD  effects  tend  to  dilute  the  asymme- 
try. Taking  these  effects  into  account  C  ~  |j  to  be  compared  with  the  measured 
asymmetry  on  the  peak.  A  word  of  caution  is  in  order.  The  QED  effects  are  easier 
to  understand,  however  QCD  effects  need  better  understanding  keeping  realistic 
limitations  of  the  detectors  and  the  fact  that  the  event  thrust  axis  is  used  to  rep- 
resent the  b  quark  direction.  Also  the  effect  of  missing  neutrinos,  which  is  more 
important  for  lepton  tag  analysis  needs  to  be  taken  into  account  carefully.  Any  poor 
understanding  of  detector  as  well  as  physics  process  only  dilutes  the  sensitivity  of 
the  angular  distribution  under  consideration  and  hence  asymmetry. 
The  apparently  observed  bb  asymmetry  as  measured  using  charged  lepton  tag  is 
further  diluted  due  to  B0-B0  mixing  as  the  sign  of  tagging  charged  lepton  is  nipped. 
Thus 

Afbtroe  =  A<bob.erved/(l  -  2*B) 
where  XB(=  0.115)  is  measured  B°-B°  mixing  at  LEP. 

3.1       Asymmetry  measorement  using  lepton  tag 

This  is  the  standard  procedure.  The  b  enriched  sample  is  tagged  by  lepton  and  the 
b  quark  charge  is  assigned  using  lepton  charge  keeping  decay  sequence  in  mind.  The 
event  thrust  axis  is  used  to  approximate  the  b  quark  direction  and  the  hemisphere 
is  assigned  to  be  the  lepton  hemisphere.  A  plot  of  angular  distribution  for  the 
tagged  events  is  shown  in  figure  6.  A  summary  of  the  measured  b  quark  asymmetry 
from  these  experiments  is  given  in  table  5.  The  results  are  based  on  the  data  of 
ALEPH(90-92)  [16],  DELPHI(92)  [17],  L3(90-92)  [18]  and  OPAL(90-91)  [19],  In  all 
these  cases  the  background  asymmetries  are  estimated  using  MC.  The  important 
systematics  as  reported  by  these  experiments  are  based  on  b  hadron  decay  models 
and  Br(b  — »  c  — *•  t+)  and  Br(b  — »  c  — *  t~). 
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;ure  6:  The  distribution  of  —  Qcosdthnut  for  prompt  muon  and  electron  tagged 
nts  taken  from  L3.  The  curves  are  the  result  of  the  fit. 


Table  5: 


ALEPH  (c,n) 
DELPHI  OJL) 


OPAL(c,fi) 


for  b  quark  at  91.27  GeV 


0.081  ±0.010  ±0.003 
0.102  ±0.016  ±0.012 
0.091  ±0.010  ±0.006 
0.091  ±0.01 8  ±0.007 


AVERAGE  (lepton) 


0.088  ±0.007  ±0.002 


ALEPH  (Vrtx  Charge) 
DELPHI  (Vrtx  Charge) 


0.109  ±0.012  ±0.005 
0.1 16  ±0.019  ±0.020 


AVERAGE  (Vrtx  Charge) 


0.1 10  ±0.011  ±0.003 


AVERAGE  (ALL)  «@«  0.094  ±0.006 
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The  secondary  vertex/impact  parameter  is  used  to  tag  b  hadron.  The  momentum 
weighted  jet  charge  using  all  charged  particles  in  the  jet  is  used  to  assign  the  b 
quark  charge.  The  relevant  systematics  here  are  due  to  fragmentation  and  decay 
models  assumed  to  describe  the  data  as  well  as  meson/baryon  composition.  The 
results  are  however  competitive  to  the  lepton  tag.  Moreover  the  effect  of  missing 
neutrino  is  less  relevant  in  this  case. 

ALEPH  [20]  and  DELPHI  [21]  have  used  this  technique  and  their  results  are  shown 
in  the  table  5.  Averages  for  lepton  tag,  vertex  and  jet  charge  tag  as  well  as  overall 
are  shown  in  the  same  table.  These  results  are  already  corrected  for  the  mixing 
parameter.  Since  these  measurements  are  performed  slightly  above  the  Z  peak,  a 
small  correction  is  applied  and  the  final  measured  peak  asymmetry  is 

A^  =  0.092  ±  0.006 

which  leads  to  sin3  iV  =  0.2322  ±  0.0011  to  be  compared  with  the  LEP  average 
sin2  9w  =  0.2321  ±  0.0006  from  all  methods  including  this  measurement.  Conser- 
vatively one  would  expect  true  sin2#tv  to  be  smaller,  if  all  the  effects  mentioned 
above  are  taken  fully  into  account. 

4     b  Hadron  Lifetimes 

The  measurement  of  b  hadrons  lifetimes  at  LEP  has  the  advantage  of  relatively 
larger  boost  making  it  a  cleaner  measurement  without  any  mixing  of  the  tracks 
from  the  two  jets  in  Z  —  *  bb.  Excellent  measurements,  both  inclusive  as  well  as 
exclusive,  for  b  hadrons  have  been  performed.  Real  justice  can  not  be  done  in  this 
kind  of  review.  Only  a  gilmps  of  the  overall  situation  is  given.  Being  of  fundamen- 
tal importance,  these  measurements  are  relevant  not  only  for  the  understanding  of 
b  hadron  decay  mechanisms  and  the  Cabbibo-Kobayashi-Maskawa  (CKM)  matrix 
elements  in  cosequence,  but  also  from  the  fact  that  b  tagging  using  lifetime  tech- 
nique requires  an  accurate  measurement  to  minimise  the  systematics,  in  particular 
for  Rb.  The  most  naive,  yet  simplest  to  connect  the  lifetime  with  CKM  elements, 
is  the  so  called  spectator  model.  In  this  scheme,  the  light  partner  is  ignored  and 
the  b  quark  is  treated  to  decay  analogous  to  a  muon. 
For  muon  decay 


where  f^  is  the  phase  space  factor,  and  Gp  the  Fermi  coupling  constant.  In  a  similar 
way  for  b  hadron  decay 

r(b  -*  *'x) 


_  Br(b-*/,X) 

•n, 

where  fc  and  fu  are  combined  phase  space  and  QCD  correction  factors  correspond- 
ing to  the  relevant  decay  modes. 
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is  obvious  to  see  that  in  such  a  scheme  the  lifetimes  for  different  mesons  (charged, 
neutral,  strange)  and  baryons  should  be  the  same.  There  axe  several  relevant  di- 
agrams like  W  annihilation,  W  exchange  and  quark  interference  which  need  to  be 
considered  depending  upon  the  quark  content  of  the  b  hadron.  Consequently  one 
would  expect 

TQ+  >  TBO  w  TBI  >  rAb 

to  be  the  more  realistic  case  within  10-20%  [22].  Exact  predictions  are  still  difficult 
as  it  requires  to  understand  hadronic  decay  matrix  (quark  masses  ,  QCD  effects 
etc).  The  experimental  measurements  can  help  a  great  deal  in  this  direction. 

4.1     Inclusive  lifetime  measurements 

The  4  LEP  exp  riments  ALEPH  [23],  DELPHI  [24],  L3  [25]  and  OPAL  [26]  have 
performed  this  measurement.  The  impact  parameter  of  the  lepton  (//  and  e)  are 
used  for  this  purpose.  The  method  is  illustrated  in  figure  7  .  An  example  for 
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Figure  7:    Definition  of  the  transverse  impact  parameter  and  various  resolution 
effects. 


the  actual  fit  to  the  data  taken  from  the  L3  experiment  is  shown  in  figure  8  . 
The  important  systematics  for  this  measurement  come  from  the  understanding  of 
semileptonic  branching  ratios,  fragmentation  parameters  for  heavy  and  light  quarks 
and  the  measurement  resolution  function. 
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detectoi  to  reconstruct  the  invaiaint  mass  of  the  charged  tracks  in  order  to  remove 

the  non-b  background  using  mass  cut  and  also  to  calculate  the  boost  for  the  b 

hadron.   The  secondary  vertex  is  used  for  the  decay  length  and  thus  the  proper 

time  is  extracted. 

ALEPH  has  also  used  charged  tracks  to  reconstruct  the  two  secondary  vertices 

(dipoles)  and  the  distance  between  them  is  exploited  to  infer  lifetime. 

A  summary  of  the  inclusive  T\,  measurements  from  4  LEP  experiments  is  given  in 


Table  6: 


Inclusive  B  lifetime  (ps) 


ALEPH  (Leptons) 


1.49  ±  0.03  ±  0.06 


ALEPH  (Chrd  Trks) 


1.51  ±  0.02±  0.08 


DELPHI  (C3wJ  Titos) . 


1.49  ±  0.1 1±  0.12 


L3  (Leptons) 


1.54±  0.04  ±  0.03 


OPAL  (Leptotw) 


1.52±0.03±  0.04 


AVERAGE 


1.52±  0.02  ±  0.02 


1.30     1.40     1.50     1.60     1.70 

table  6.  We  have  avoided  those  measurements  which,  in  our  opinion,  are  'too  pre- 
liminary'. A  look  at  the  average  T\>  given  here  and  the  averages  in  PDG  for  the  last 
few  years  will  show  that  T\,  has  been  increasing  with  time.  Whether  it  has  reached 
to  the  asymptote  is  not  very  obvious. 


4.2     Exclusive  Lifetimes 

The  exclusive  lifetimes  for  B^,  B°,  B°,  Ab  are  measured  by  looking  for  correlations 
like  (D*,£),  (D°,£),  (Da,£)  and  (A,  t)  with  the  right  sign  combination  for  a  desired 
state.  In  the  following  we  describe  one  example  each  from  the  three  experiments. 

ALEPH  anaylysis  for  B~  and  B°  life  time:  B~  is  inferred  from  a  (D0,^") 
correlation  whereas  B°  is  inferred  from  (D*~*",£~)  correlation.  A  D*+  is  unfavoured 
in  B~  decay.  It  is  idetified  using  the  mass  difference  technique  with  the  condi- 
tion AM  =  M(D°ir)  -  M(D°)  ~  145  MeV  where  D°  is  reconstructed  from  K*  and 
K3ir  mass  combinations.  The  (D°,  I)  flight  intersection  is  used  to  extract  the  decay 
length  and  the  invariant  mass  of  the  combination  is  used  to  extract  the  b  hadron 
boost.  The  effect  of  the  missing  neutrino  is  convoluted  in  the  fit  using  MC  mod- 
elling of  the  data.  The  important  systematics  are  due  to  fake  D*  or  fake  lepton 
or  both  apart  from  MC  modelling  for  semileptonic  decay  as  well  as  D*  and  D** 
relative  contributions.  A  summary  of  all  tae  measurements  for  B*  and  B°  lifetimes 
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Table  7: 


B+  lifetime  (ps) 


ALEPH  (DJ) 


M7+OJZ+0.15 
'•*  '  -  0.1?  -  0.14 


DELPHI  (D°,l) 


.+0.33 
'-  039 


1.30 +*"±  0.16 


OPAL(D,1) 


1.66  ±  030  * 


ALEPH  (D 


1.77  :S£±  0.14 


DELPHI  (Vertex  Charge) 


1.81  ±  0.12  ±  0.19 


1.63  ±  0.21  ±  0.16 


AVERAGE 


1.68  ±  0.12 


0.6       1.1       1.6       2.1       2.6 

by  ALEPH  [27],  DELPHI  [28],  OPAL  [29]  including  result  from  CDF  [30]  is  given 
in  tables  7  and  8. 

DELPHI  B°  lifetime  analysis:  In  this  case  (Ds,£)  correlation  is  used.  The 
D,  is  reconstructed  from  <£*•"  or  K*°K~  combinations.  The  decay  length  is  con- 
sidered between  primary  vertex  and  D,  vertex.  Again  missing  neutrino  effect  is 
convoluted  using  MC.  The  signal  quality  of  this  analysis  for  <£ir~  channel  is  shown 
in  figure  9.  A  summary  of  the  measurements  by  ALEPH  [31],  DELPHI  [32],  and 
OPAL  [33]  is  given  in  table  9. 

OPAL  Ab  lifetime  analysis:  The  analysis  is  based  on  their  central  tracking 
chamber.  It  is  assumed  that  Ag  —  +  AC£~PX  and  that  Ac  —  »•  A(p*~)X.  The  analysis 
ignores  the  production  of  Sb.  Here  (A,/)  flight  intersection  is  taken  as  the  decay 
vertex.  Again  the  effect  of  missing  neutrino  is  convoluted  using  MC.  The  largest 
systematics  to  the  lifetime  estimate  come  from  decay  length  estimate  as  the  open- 
ing angle  between  (A,  t}  is  small.  This  is  simply  because  of  small  reaction  energy 
available  in  a  baryonic  decay.  An  example  of  the  signal  and  the  fit  quality  is  shown 
in  figure  10.  A  summary  of  all  the  measurements  by  ALEPH  [34],  DELPHI  [35] 
and  OPAL  [36]  is  given  in  table  10.  Some  preliminary  results  from  ALEPH  and 
DELPHI  using  (Ac,  t)  correlation  exploiting  their  SMD  are  also  included. 
Finally  a  summary  of  average  B  hadron  lifetimes  based  mainly  on  LEP  is  given  in 
table  11.  To  conclude  this  table,  exclusive  lifetimes  follow  the  trend: 


TB+  >  TB» 


O  >  rAb 


and  considering  the  extremes  we  can  say  that  spectator  model  is  not  enough  and 
other  diagrams  can  not  be  ignored.  Considering  the  situation  a  few  years  back,  we 


B  phyaict  at  LEP 


Table  8: 


ALEPH  (D  .leptoas) 


DELPHICD  .Icpton) 


OPAL  (D  .lepton) 


ALEPH  (D  .nit) 


DELPHI  (Vrtx  Charge) 


CDF(J/y,X) 


AVERAGE 


B°  lifetime  (ps) 


1.52 


+  0.20  +  0.07 
-  0.18  -  0.13 


r+0^9 
023 


1.17  +£!±  0.16 


1.63  ±au:ti; 


1.19^±  0.14 


1.37±  0.15  ±0.21 


1.54  ±  0.2210.10 


1.49  ±  0.10 


0.4       0.9       1.4       1.9       2.4 
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Figure  9:  Proper  time  distribution  for  B8  candidates.    The  inset  show  the  signal 
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Tabfe  9: 


ALEPHflyj 


Bf  lifetime  (ps) 


a96±037 


OPAL(DJ) 


1.131035 
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Figure  10:  A  signal  for  right  sign  (A,*)  combination  (261  events)  as  well  as  the 
wrong  sign  (104  events)  for  the  background  (left).  Fit  to  the  decay  length  distri- 
bution to  extract  the  A*  lifetime  (right). 


I  a  Die  iu; 


ALEPH  (A,l) 
OPAL  (A.1) 
DELPHI  (AJ) 
ALEPH  (Ac,l) 
DELPHI  (Ac,l) 
AVERAGE 


1.11  ±0.15 


Table  11: 


Inclusive 


0.5       1.0       1.5       2.0       2.5 


Summary  B  lifetime  (ps) 
i  i  i  i  I  i  i  i  '  I  '  "  '  i  '  '  "  'I 


1.52  ±  0.03 


,1.68±  0.12 


1.49  ±  0.10 


1.40±  0.23 


1.11±  0.15 


.  1  i  f  •  '      '  '  '  '      '  ' 


0.80     1.05     1.30     1.55     1.80 
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can  say  that  major  progress  has  been  made  in  this  sector,  thanks  mainly  to  LLP. 

5     B°-B°  Oscillation/Mixing 

Like  K°,  the  B°  weak  eigenstates  are  not  the  same  as  the  mass  eigenstates,  rather 
a  linear  combination  of  them.  Thus, 

I  Bl  >  +    B2  >          ,     .  _-  Bl  >  -    B2  > 


where  the  two  states  have  mass  difference  Am  =  Ml  —  M2  and  similarly  a  lifetime 
difference  which  is  considered  to  be  small  and  neglected  in  this  case.  Thus  a  beam 
of  pure  B°  studied  as  a  function  of  time  evolves  into  two  components,  a  fraction 
of  which  is  B~°.  This  can  be  inferred  from  the  final  state  decay  products.  This 
probability  can  be  expressed  as  follows: 


P(B°  -+  B"°)t  =      e~tr(l  -  cos  Amt/ft) 

i 

P(B°  -»  B°)t  =  «  e-'r/*(l  +  cos  Ami/ft) 

L 

Where  F  =  (*)  is  the  decay  width.    For  the  detector  resolution  where  only  time 
integerated  measurements  are  possible: 


Where  x  is  called  the  mean  mixing  parameter  and  ^j?-  =  x,  the  oscillation 
frequency.  For  reasonably  large  oscillation  frequency  (z  >  5),  the  time  integerated 
measurement  loses  its  sensitivity  as  x  —  1/2,  the  mixing  is  maximal. 

Mixing  measurements  are  relevant  for  several  reasons.  Within  SM  framework, 
second  order  weak  transitions  lead  to  e  neutral  meson  oscillation  (illustrated  in 
figure  11).  Considering  the  B°-B°  oscillation,  the  strength  of  oscillation  is  related 
to  mass  difference; 


m3 


?  F(-f  )  TJQCD  B  fl  |  Vt*bVt 


QCD  tbtq 

rt  iTiir 

where  Vqq/  are  CKM  elements;  F  gives  functional  dependence  on  mt;  TJQCD  is  the 
QCD  correction  factor;B  the  bag  factor  and  fe  the  decay  constant   [37].  From  this 
one  would  expects  ^j^«  I  $£  I  '   ^sm8  unitarity  arguments  |  Vti  |  «  |  Vcb  |  and 
!  ^  |  <  sin  Oc  +  |  ^J  |  <  0.35(at  90%  CL).  This  translates  to  |gj  >  8(actually  ~  20), 
indicating  large  mixing  for  Ba  system    [38]. 

5.1     Measurements 
Time  Integerated 

The  simplest  signature  of  B°-B~°  mixing  is  the  presence  of  same  sign  dilepton  events. 
In  the  normal  course  B  —  »  /+X  and  B  —  »  ^~X  and  thus  for  no  mixing  in  a  B°- 
B°  event  one  should  expect  only  unlike  sign  dileptons.  However,  owing  to  mixing 
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Figure  11:  Mixing  diagrams  within  the  framework  of  SM.  The  simplest  signati 
for  B°-B°  is  the  same  sign  dilepton  event. 
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one  can  observe  even  the  like  sign  dileptons.  A  counting  of  like  sign  dileptons,  thus 
provides  a  measure  of  mixing; 


Which  gives  x  =  i+yfiVft  '•  F°r  large  enough  PT  (>  IGeV/c)  cut,  the  relevant 
background  that  fakes  mixing  is  the  cascade  decay  (b  —  *  c  —  *  t)  since  in  this  process 
also  the  sign  of  lepton  is  reversed.  Thus,  the  true  mixing  is  given  by  Xtrue  = 

Xobterved       Xcatca.de 

At  LEP,  all  kind  of  B  hadrons  are  produced  and  hence  one  measures  an  average 
mixing  for  both  d  type  and  s  type  neutral  B  mesons; 

X  =  fdXd  +  £** 

Where,  U  =  P(b  -»  Bd)  and  &  =  P(b  —  >  Bg)  are  the  probabilities  to  observe  the 

two  kind  of  mesons  in  an  overall  hadronic  b  sample.    The  four  LEP  experiments 

have  measured  the  average  mixing  parameter  x  *&  the  follwoing  way. 

ALEPH  [39]:  They  count  dileptons  after  PT  cut  and  correct  for  backgrounds. 

DELPHI  [40],  OPAL  [41]:  They  perform  two  dimensional  maximum  likelihood  fit 

using  P«>m6(l,2)  (denned  later). 

L3  [42]:  Perform  4D  ML  fit  using  P(l,  2),PT(1,2). 

ALEPH  and  DELPHI  have  also  tried  another  method  where  one  jet  charge  is  as- 

signed by  using  the  momentum  weighted  charge  of  the  tracks  in  the  jet,  while  the 

second  is  tagged  by  lepton.    This  improves  the  statistics,  however  at  the  cost  of 

increased  systematics.  We  have  ingnored  this  measurement  at  the  moment. 

To  illustrate  the  measurement  of  x  we  have  considered  one  example  from  OPAL 

experiment.  A  special  variable  : 


is  defined  to  fit  the  dilepton  spectrum.  The  example  of  plot  for  smaller  of  the  two 
Pcomb  is  shown  in  figure  12  alongwith  the  fits.  The  cases  for  mixing  and  no  mixing 
are  illustrated.  The  largest  systematics  are  from  Br(b  -*•  c)  and  Br(b  ->•  c  -*  t).  A 
summary  of  all  the  results  from  the  four  experiments  is  shown  in  table  12.  The 
LEP  average  x  =  0.115  ±  0.011  can  be  compared  with  the  world  average  x  — 
0.121  ±  0.010  that  includes  the  CDF  and  DO  collaboration  results.  Using  the  value 
of  Xd  —  0.155  db  0.026  from  ARGUS  and  CLEO  and  assuming  Ci  =  0.39  and 
f,  =  0.12,  we  get  x«  =  0.45  ±  0.09,  a  number  consistent  with  maximal  mixing  [43]. 


Time  Differential  Measurement 

The  way  to  extract  B,  mixing  parameter  is  to  separate  the  slow  and  fast  compo- 
nents of  the  oscillation  by  studying  the  time  dependent  behaviour  or  use  exclusively 
identified  states.  As  a  first  step  to  this  goal,  ALEPH  has  demonstrated  the  mea- 
surement of  B  £  frequency.  This  has  been  done  by  two  methods.  In  one  case  (D* ,  t) 
correlation  has  been  studied  as  a  function  of  time  [44].  In  the  other  case  (t,l)  cor- 
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Figure  12:  Determination  of  mixing  parameter  by  OPAL  in  different  channel 

Tabk  12:       Mixing  Parameter  %  from  dileptons 
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(D%  £)  Analysis 

One  side  of  the  event  is  tagged  by  lepton  with  PT  >  0.75  GeV/c.  In  this  case, 
the  D*  seen  on  the  other  side  comes  mainly  from  B°  —  »  D*+,X.  Since  the  recon- 
struction of  D*  vertex  is  difficult,  owing  to  low  momentum  of  the  pion  and  the 
small  opening  angle  in  D*t  —  »  D°T+,  the  D°  vertex  is  used  instead  for  decay  length 
and  time  information.  Thus  the  total  decay  length  is  the  sum  of  B°  and  D°  decay 
lengths.  To  determin  B°  boost,  MC  modeling  is  required.  The  unlike  sign  pair  of 
(D*  ,  I)  signals  mixing. 


Ntike(t)  -  Nunlike(t)  __  jfunmi"d(i)  -  Nmi*ed(i) 

jy-Kfcejt)  +  jy«nUfc«(j)   ~  jy«nnu««<i(t)  +  jVmi*«d(i) 

In  an  ideal  situation 

RtD(i]  <*  cos  Ami 

However  in  the  real  situation  oscillation  amplitude  is  damped  due  to  backgrounds 
and  detector  resolution  effects.  The  various  effects  are  illustrated  in  figure  13  where 
study  is  made  as  a  function  of  decay  length.  The  charge  asymmetry  is  shown 
in  figure  14.  An  unbinned  maximum  likelihood  fit  demonstrates  that  the  time 
independent  fit  is  excluded  at  3.2<r  level  and  leads  to 

Am,  =  (3.44J3;£i8:S)  X  1(T4  eV. 
Using  their  measured  TBd  =  1.44:±:0.15ps  they  determine 

_  n   .TK+0.15+0.08 

=  0.75I0.14I0  05 

(t,t)  Analysis 

Both  ALEPH  and  DELPHI  have  tried  this  analysis  where  the  latter  analysis  is  very 
preliminary.  In  case  of  ALEPH  the  event  topology  defines  the  secondary  vertex  and 
the  lepton  serves  to  tag  the  charge  as  well  as  the  purity.  The  energy  flow  analysis 
is  used  to  extract  the  boost  for  the  secondary  vertex  and  hence  extract  the  proper 
time  using  the  intersection  of  a  bunch  of  tracks  and  the  lepton.  The  like  sign 
dilepton  is  studied  as  a  function  of  time.  In  an  ideal  situation, 

Ru(t)  a  (1  -  cos  Ami) 

However,  due  to  background  and  resolution  effects,  the  amplitude  is  damped.  The 
result  of  ALEPH  analysis  is  shown  in  figure  15.  The  time  independent  fit  is  excluded 
at  99%  CL.  They  extract 

Am*  =  (3.5  j£J  ±  0.7)xl(T4eV 

which  leads  to  x«j  =  0.76  ±  0.10  ±  0.16  consistent  with  the  previous  measurement, 
but  with  improved  error.  Further,  these  results  are  also  consistent  with  the  results 
6om  ARGUS  (xd  ~  0.72  ±  0.15)  [46]  and  CLEO  (xd  -  0.65  ±  0.10)  [47].  It  is 
hoped  that  this  kind  of  study  will  be  able  to  reveal  the  second  component  of  the 
oscillation  with  increase  in  statistics  and  should  provide  the  measurement  of  x,  for 
B8. 
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Figure  13:  Detector  resolution  and  background  effects  on  the  oscillation  amplitude. 
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Figure  14:  Determination  of  oscillation  frequency,  x*  =  ^fr*  by  ALEPH.  The  time 
independent  alternative  is  shown  by  dashed  line  (left).  Schematic  illustration  of  the 
event  topology  for  the  study  of  time  dependent  mixing  using  (D*,£)  events  (right). 
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Figure  15:  Determination  of  oscillation  frequency  (left).  Schematic  illustration  of 
the  event  toplogy  for  the  study  of  time  dependent  mixing  using  dilepton  events 
(tight). 


Prospects  for  Ba  Oscillation 

The  possibilities  to  see  B,  oscillation  in  fully  identified  events  are  not  veiy  exciting. 
It  is,  however,  expected  that  with  increased  statistics  in  (I,  t)  events,  it  should  be 
possible  to  see  the  second  fast  component  in  the  time  dependent  analysis  in  near 
future.  In  the  meantime,  to  our  satisfaction,  ALEPH  [31]  has  been  lucky  to  see 
one  fully  reconstructed  event,  out  of  the  two,  that  exhibits  even  oscillation.  This 
event  is  shown  in  figure  16  with  zoomed  up  view  of  the  vertex  region.  This  is  a  low 

tttttm  MB*  1*0  14:15:12  b)r  HtfiVHUtAKMvi*  D*U_D1. 


Figure  16:  B,  oscillation  event  seen  by  ALEPH. 


multiplicity  event  where  a  K+  (us)  is  seen  from  the  primary  vertex.  The  particle 
reconstructed  at  the  secondary  vertex  using  (D+(cl),  ir~)  is  Bs(bs).  This  is  possible 
only  if  the  Bs{bs)  which  was  produced  at  primary  vertex  in  association  with  K+ 
has  oscillated  to  Bs  by  the  time  it  decays.  Since  there  are  no  other  tracks  other 
than  accounted  for,  any  other  explanation  is  unlikely. 


o     interesting  o 

B  decay  characteristics  have  been  studied  both  at  low  energies  as  well  as  at  LEP. 
Significant  contributions,  specially  from  LEP,  are  in  the  case  of  Br(B  — >  r  -f  X)  and 
Br(B  — v  Xc  +  X).  We  discuss  here  an  example  of  Xc  mode.  This  decay  mode  is  inter- 
esting to  understand  the  colour  suppression  mechanism  [48].  Moreover  a  knowledge 
of  the  branching  ratio  is  important  for  optimising  CP  violation  experiments. 
For  the  first  time  at  LEP,  L3  [49]  has  exploited  its  high  resolution  photon  detec- 
tion to  extract  Xc  signal  from  its  decay  to  (J/V>,7)  where  J/V"  is  reconstructed  from 
dimuons.  The  mass  difference 

AM  =  M(J/V>,7)  -  M(J/VO  is  used  to  demonstrate  the  signal,  shown  in  figure  17. 
Assuming  all  to  be  Xci  &*  Xco,a  are  suppressed,  they  determine  Br(B  — *  Xci  +  X) 
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Figure  17: 
using  MC. 


signal  seen  by  the  L3  detector.  The  background  shape  is  estimated 


=  (2.4±  0.9±  0.2)  x  10~~2.  The  only  other  measurement  for  this  mode  comes  from 
low  energy  experiment,  ARGUS  [50].  Their  value  being  Br(B  — •»  Xci  +  X) 
=  (1.23  ±  0.41  ±  0.29)  x  10~3.  For  any  realistic  use  these  numbers  have  to  be  im- 
proved. With  more  and  more  data  being  collected,  LEP  experiments  are  likely  to 
improve  this  result  considerably. 
Studies  of  rare  decays  are  being  persued  vigorously  at  LEP,  however,  the  excite- 
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7     Conclusions 

LEP  has  contributed  in  an  impressive  way  to  the  B  physics  sector.  Notable  among 
the  results  are: 

«  Using  three  different  techniques  for  b  tagging,  namely  the  lepton  tag,  the 
eventshape  tag  using  neural  networks,  and  the  lifetime  tag,  Rb  =  Fb/Fh&d  has 
been  measured.  The  average  of  the  three  methods  is  Rb  =  0.2207  db  0.0027. 
Compared  to  this  value  the  SM  prediction  for  a  top  mass  of  160  GeV  is 
=0.2162.  If  the  systematic  effects  from  QCD  considerations  are  understood 
and  the  overall  errors  could  be  brought  down,  this  may  well  be  an  interesting 
result  in  near  future  for  new  physics. 

*  The  Z  peak  forward  backward  charge  asymmetry  of  b  quark  is  measured  to 
be 

A£,  =  0.092  ±0.006. 

This  leads  to  sin20iv  =  0.2322  ±  0.0011  after  QED  and  QCD  corrections 
are  applied.  We  believe  that  QCD  effects  need  to  be  better  understood, 
in  view  of  the  fact  that  the  event  thrust  axis  is  used  to  approximate  the  b 
quark  direction.  The  sensitivity  of  the  angular  measurement  depends  upon 
the  hadronisation  effects,  missing  neutrino  in  case  of  leptonic  tag,  as  well  as 
detector  energy  and  angular  resolution  where  usually  calorimetry  plays  a  vital 
role. 

«  Impressive  progress  has  been  made  in  the  lifetime  measurements  for  b  hadrons, 
both  inclusive  as  well  as  exclusive.  For  the  inclusive  measurements 

rb  =  1.53  ±0.03  ps. 
The  exclusive  lifetimes  follow  the  trend: 

rB+   >  "Ta»   «  7fi°    >  TAb. 

•  d 

Some  of  the  exclusive  lifetimes  have  been  measured  at  LEP  for  the  first  time. 

•  B°-B°  oscillation/mixing  parameter  has  been  measured  with  better  precision. 
The  LEP  average  x&  ~  0.115  ±0.011  is  essentially  the  world  average.    The 
time  differential  measurement  of  B°-B°  oscillation  has  been  demonstrated 
only  at  LEP.  Both  (D°,£)  and  (1,1)  correlations  have  been  used  for  time 
dependent  analysis  using  silicon  microvertex  detector.     The  (£,  I)  analysis 
seems  to  be  promising  to  extract  the  second  oscillation  frequency  for  Ba, 
apart  from  that  observed  for  Bd  currently. 

®  Several  branching  ratios  have  been  measured,  including  (B  — *  r,  X)  and 
(B-*Xc,X). 

In  short  LEP  has  produced  plenty  of  exciting  B  physics  and  much  more  is  expected 
in  near  future. 

Pramaaa-  J.  Phys.,  Supplement  Issue,  1995  29 


I  am  very  much  indebted  to  Maneesh  Wadhwa  for  his  untiring  help  during  the 
preparation  of  this  talk  and  many  stimulating  discussions  and  suggestions.  I  am 
also  grateful  to  the  organisers  of  the  workshop  for  providing  me  local  hospitality 
and  partial  travel  support. 

References 


[1]     M.  Consoli,  W.  EoUik  and  F.  Jegerlehner,  Z  Physics  at  LEP,  CERN  report:  CERN 
89-08,  Vol  1.,  p  7.  Eds:  G.  Altareffl  etal. 

[2]     T.  Aziz,  Some  interesting  approximations  of  AJTB  on  the  Z°  peak,  TIFR-EHEP 
90/6,  November  1990. 
L3  Collaboration,  Phys.  Lett.  B252,  713  (1990). 

[3]     T.  Aziz,  Rb  =  rb/rkid  as  a  measure  of  Z  -+  bb  vertex  correction,  TIFR-EHEP, 
90/7,  December,  1990. 
T.  A»is,  Heavy  Flavour  Physics  Symposium,  Montreal,  July,  1993. 

[4]     F.  Boudjema,  A.  Djouadi  and  C.  Veraegnassi,  Phys.  Lett.  B238,  423  (1990); 
F.  M.  Renard  and  C.  Verzegnassi,  Phys.  Lett.  260,  225  (1991). 

[5]     M.  Boulware  and  D.  Finnel,  Phys.  Rev.  D44,  2054  (1991). 

Djouadi,  G.  Girardi,  W.  Hollik,  F.  M.  Renaid  and  C.  Ventegnassi,  High  Luminosity 
at  LEP,  CERN  Yellow  report,  CERN  91-02/91  1991,  pp.  135.  Eds:  E.  Blucher  et 

al 

[6]     ALEPH  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 
[7]     DELPHI  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 
[8]     L3  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 

[9]     OPAL  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993;  CERN, 
PPE/93-46,  193. 

[10]  ALEPH  Colaboration,  Phys.  Lett.  BS1S  (1993)  549. 

[11]  DELPHI  CoUaboration,  Phys.  Lett.  B205  (1992)  383. 

[12]  L3  Collaboration,  Phys.  Lett.  BS07  (1993)  237. 

[13]  ALEPH  CoUaboration,  Phys.  Lett.  BS1S  (1993)  535. 

[14]  DELPHI  CoUaboration,  Contributed  to  EPS  Conference,  MarseiUe,  July  1993. 

[15]  OPAL  CoUaboration,  CERN,  PPE/93-155  (1993). 

[16]  ALEPH  CoUaboration,  Contributed  to  EPS  Conference,  MarseiUe,  July  1993. 

[17]  DELPHI  CoUaboration,  Contributed  to  EPS  Conference,  MarseiUe,  July  1993. 

[18]  L3  CoUaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 

[19]  OPAL  CoUaboration,  CERN,  PPE/93-78. 

[20]  ALEPH  CoUaboration,  Contributed  to  EPS  Conference,  MarseiUe,  July  1993. 

[21]  DELPHI  CoUaboration,  Contributed  to  EPS  Conference,  MarseiUe,  July  1993. 

[22]  D.  Karlen,  "B  lifetimes"  Heavy  Physics  Symposium,  Montreal,  July  1993. 

[23]     ALEPH  CoUaboration,  Phys.  Lett.  B266  (1992)  174 
ALEPH  CoUaboration,  CERN,  PPE/93-116. 

30  Pramana-  J.  Phys.,  Supplement  Issue,  1095 


[24]     DELPHI  Collaboration,  Contributed  to  LEPTON-PHOTON  Symposium  ,  Cornell, 
August  1993;  CERN,  PPE/93-80. 

[25]     L3  Collaboration,  CERN,  PPE/93-158. 
[26]     OPAL  Collaboration,  CERN,  PPE/93-92. 

[27]     ALEPH  Collaboration,  CERN,  PPE/93-42. 

ALEPH  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 

[28]     DELPHI  Collaboration,  Contributed  to  LEPTON-PHOTON  Symposium  ,  Cornell, 
August  1993;  Z.  Phys.  C57  (1993)  81. 

[29]     OPAL  Collaboration,  Contributed  to  LEPTON-PHOTON  Symposium  ,  Cornell, 
August  1993. 

[30]     CDF  Collaboration,  Contributed  to  LEPTON-PHOTON  Symposium  ,  Cornell,  Au- 
gust 1993. 

[31]     ALEPH  Collaboration,  CERN,  PPE/93-214. 
[32]     DELPHI  Collaboration,  CERN,  PPE/93-176. 
[33]     OPAL  Collaboration,  CERN,  PPE/93-95. 

[34]     ALEPH  Collaboration,  Phys.  Lett.  B207  (1992)  449. 

ALEPH  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 

[35]     DELPHI  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 
[36]     OPAL  Collaboration,  CERN,  PPE/93-132. 

[37]     K.  V.  L.  Sarma,  TIFR/TH/94-04;  A.  Ali  and  D.  London,  DESY-92-057  (1992)  and 
A.  Ali,  DESY-92-152  (1992). 

[38]  R.  Forty,  CERN  PPE/93-165. 

[39]  ALEPH  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 

[40]  DELPHI  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 

[41]  OPAL  Collaboration,  CERN,  PPE/93-106. 

[42]  L3  Collaboration,  Contributed  to  EPS  Conference,  Marseille,  July  1993. 

[43]  W.  Venus,  Review  Talk,  LEPTON-PHOTON  Symposium,  Cornell,  August  1993. 

[44]  ALEPH  Collaboration,  CERN,  PPE/93-99. 

[45]  ALEPH  Collaboration,  CERN,  PPE/93-105 

[46]     ARGUS  Collaboration,  Z.  Phys.  C55  (1992)  363;  EPS  Conference,  Marseille,  July 
1993. 

[47]     CLEO  Collaboration,  Heavy  Flavour  Physics  Symposium,  Montreal,  July  1993. 

[48]     J.  H.  Kuhn  and  R.  Rucld,  Phys.  Lett.  Bl*5  (1984)  477. 
P.  H.  Cox  et  al.  Phys.  Rev.  DS2  (1985)  1157. 

[49]     L3  Collaboration,  Phys.    Lett.    B$17  (1993)  467;  M.  Wadhwa,  Ph.    D.  Thesis, 
LAPP,  Annecy  (1993). 

[50]     ARGUS  Collaboration,  Phys.  Lett.  B277  (1992)  209. 

[51]     "Rare  B  decays",  Heavy  Flavour  Phsyics  Symposium,  Montreal,  July  1993. 


pp. 


Limits  on  3  boson  vertex  parameters  from  LEP200 

S.  BANERJEE 

Tata  Institute  of  Fundamental  Research,  Bombay  400  005 

Abstract.  LEP  will  be  operating  at  cm  energies  above  W  pair  threshold  from  1996.  The 
process  e+e~  —  »•  W^W"  will  provide  a  unique  opportunity  to  test  some  important  aspects 
of  the  Standard  Model.  The  methodology  of  studying  this  process  has  been  reviewed  in 
this  report.  The  study  of  the  process  will  probe  at  the  triple  vector  boson  coupling.  The 
sensitivity  to  anomalous  coupling  of  W  boson  has  been  discussed  in  detail. 


1.  Introduction 

One  of  the  main  motivations  of  LEP200  is  a  study  of  the  process  e+e~  —  »•  W+W~. 
The  reaction  e+e~  —  +  W+W~  has  contributions  from  ?/  exchange  in  the  t-channel 
and  7,  Z  exchange  in  the  s-channel.  Of  these,  the  v  exchange  gives  the  largest 
contribution  to  the  amplitude.  In  the  Standard  Model  [1]  large  cancellation  takes 
place  between  the  various  amplitudes  and  the  resultant  cross  section  is  small  and 
well  behaved  (see  figure  1)  at  high  energies.  As  a  result,  small  deviation  from  the 
Standard  Model  will  show  up  in  this  process  and  can  be  studied  at  LEP200.  The 
physics  possibilities  and  limitation  have  been  studied  in  several  earlier  reviews  [2,  3] 
and  some  of  those  results  will  be  highlighted  here. 

The  main  importance  of  studying  the  process  e+e~  —  >•  W+W~  lies  in  the  7\VW 
and  ZWW  couplings  (see  figure  2). 

The  interaction  vertex  is  given  by  j'gwwvTv  (q,  q,  p),  where  V  is  the  vector  boson 
7  or  Z. 

—  e;  gwwz   =  —  ecot#w 


involves  2x7  independent  form  factors  f?  and  f^.  The  three  form  factors  fv, 
are  related  to  magnetic  and  electric  dipole  moments  of  W.  One  can  write 


=     1  + 


2mw2 
\_. 

2 


f2v     =     AV 


=      1  +  KV  +  AV 

e  (1  +«7  +  A7) 


Qw     = 


2mw 
mw2 


The  other  form  factors  are  restricted  by  C,  P  and  CP  invariance 


80  90  100  110 

Eb  (GeV) 


Figure  1.  Total  cross  section  for  e+e~  -*•  W+W~  in  the  Standard  Model  as  a  functioi 
of  beam  energy  for  m\v  =  82  GeV 


Figure  2.  The  three  boson  coupling  Z(7)WW 
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CP  invaraince  =>     f4v  =  f6v  =  f  f  =  0 

C  and  P  invariance 


In  the  Standard  Model,  one  has  at  the  lowest  order 

Ky    =    Kl    —    I 

A7  =  Az  =  0 
f-r.z       _  A 

M,  5,6,7    —    u 

So  to  test  the  validity  of  the  Standard  Model,  one  needs  to  measure 


gwwz 
fz 

4,5,6,7 

In  the  various  study  groups  [4],  the  following  parameters  have  been  studied  : 
^gwwz,  A  (with  A7  =  AZ),  /CT  and  KZ. 

2.  Observables 

Experimentally,  one  can  make  the  following  measurements  : 

•  Total  cross  section 

«  Production  angle  of  W+  (W~) 

•  Decay  angular  distribution  of  W+  and  W~ 

As  seen  in  figure  1,  the  W  pair  cross  section  is  rather  small  (~  20  pb)  compared 
to  typical  LEP100  cross  sections.  It  is  approximately  flat  over  the  energy  range  of 
interest  (at  cm  energies  above  175  GeV).  One  can  expect  to  observe  about  4000 
events  per  year  in  one  experiment  if  the  LEP  luminosity  reaches  2x  1031  cm~2sec~1. 

In  the  standard  Model,  the  W  couples  universally  to  quarks  and  leptons.  One 
can  write  down  [2] 

rw  =  r(w  -  M  3 


This  leads  to  8.9%  of  W's  decaying  to  each  of  the  Icptonic  channels  (ei/,  >rtv  or  TV] 
and  the  remaining  73.3%  decaying  to  qq'  system  (ud  or  cs).  As  a  result,  out  of  all 
the  W+W~  events,  one  will  have  the  following  final  states 

54%  4  jets  (pure  hadronic) 

39%  jiJ2^^  (semi  leptonic) 

7%  t\  vi  £2  ^2  (p^re  leptonic) 

For  the  semileptonic  final  state  (ji  J2  ^  ^),  one  can  make  the  following  observa- 
tions 


•  ji  J2  wiH  determine  the  folded  direction  of  W 


•  t  determines  the  charge  of  W  unambiguously.  So  one  can  study  product 
angle  of  W+  (W~). 

•  One  can  study  the  complete  decay  angular  distribution  of  the  W  which  dec 

leptonically 

•  One  can  also  study  folded  decay  angular  distribution  of  the  W  deca? 
hadronically.    If  tagging  of  the  jet  charge  is  possible,  a  study  of  comp 
decay  angular  distribution  from  W  decay  can  be  done. 

For  pure  hadronic  final  states  (J!  j2  js  34),  the  jets  determine  W  direct 
However,  one  needs  some  measure  of  jet  charge  to  decide  about  the  charge  of  W 
in  principle,  one  can  always  study  folded  production  angle  of  W+  (W~).  Simil 
one  can  study  the  folded  decay  angular  distribution  of  Ws. 

For  pure  leptonic  final  states  (£\  v\  t^  1/2)5  the  momenta  of  the  two  z/'s  cai 
calculated  within  a  twofold  ambiguity  if  W  width  and  photons  from  initial  s 
radiation  are  ignored.  It  is  nevertheless  a  very  clean  sample.  But  it  is  difficu] 
obtain  the  production  or  decay  angular  distribution  uniquely. 

Production  of  transversely  polarized  W  (A  =  ±1)  dominates  over  that  of 
gitudinally  polarized  W  (A  =  0)  for  all  values  of  production  angle.  Howevei 
the  backward  region,  the  dominant  AA  =  —2  amplitude  from  v  exchange  vanis 
So  this  region  is  more  sensitive  in  measuring  the  gauge  couplings.  Decay  ang 
distribution  of  W  depends  on  W  polarization.  So  a  study  of  the  decay  distribu 
helps  in  separating  contributions  from  different  W  helicity  components. 


ex 
b 


200   240   280   320   360  400   440   480 

Energy  (GeV) 

Figure  3.  Difference  between  the  SM  and  non-SM  total  cross  section  for  e+e~  — t-  W 
as  a.  function  of  cm  energy  [4] 

Figure  3  shows  a  plot  of  deviation  of  total  cross  section  of  e+e~  — »•  W1 
from  Standard  Model  value  as  a  function  of  cm  energy  for  two  sets  of  anom 
3  boson  coupling  (*7  =  KZ  =  0.2  and  A7  =  Az  =  0.2).  The  figure  clearly  s 
that  large  deviation  from  the  Standard  Model  is  seen  only  at  very  high  cm  en 


so  L,£,r2UU  operating  near  the  WTW      tnresnold  will  not  be  able  to  detect  any 
significant  deviation  from  the  Standard  Model  from  this  measurement. 

Figure  4  shows  distribution  of  production  angle  of  the  W's  as  expected  in  the 
Standard  Model  and  also  for  some  non-standard  couplings  of  the  3  boson  system. 
One  sees  large  deviation  from  the  Standard  Model  only  in  the  very  backward  di- 
rections. So  to  be  sensitive  to  anomalous  W  couplings,  the  detectors  should  have  a 
large  angular  acceptance  and  also  high  precision  in  the  forward  backward  direction. 


Figure  4.   Effect  of  the  angular  distribution  for  W- production  due  to  different  values  of 
the  parameters  «-,,  KZ,  A7,  AZ  [3] 


3.  Event  Classification  and  W  Reconstruction 

It  is  necessary  to  classify  the  W  pair  events  into  three  different  categories  (1) 
four  jet  events  (ji  J2  ja  J4);  (2)  two  jet  plus  two  lepton  events  (ji  J2  t  v)  and  (3) 
four  lepton  events  (^i  v\  ti  1/2)  from  the  general  event  characteristics  and  to  select 
these  events  from  a  background  of  other  physics  events  and  noise.  A  first  attempt 
in  this  direction  has  been  carried  out  in  [5]  by  looking  into  fully  simulated  and 
reconstructed  events  from  W-pair  generator  GENTLE  [6]. 

Leptcnic  decays  of  W  give  rise  to  events  with  low  multiplicity  and  sizeable 
missing  energy.On  the  other  hand,  hadronic  decays  of  W  lead  to  high  multiplicity 


vift-    T      T*lr 


nt  T 


events  with  small  missing  energy.  So  global  event  shape  variables  like  visible  energy, 
energy  imbalance,  total  multiplicity  would  be  useful  variables  for  event  classification 
alongwith  particle  identification  capabilities. 

Four  jet  events  are  classified  by  selecting  events  with  high  multiplicity  (N  > 
45),  large  visible  energy  (Evj8/Vs  >  0.72),  small  energy  imbalance  in  the  transverse 
direction  (Ej./Evi8  <  0.25).  Leptonic  decays  of  W  to  ev  or  fiv  are  removed  by 
rejecting  events  with  electron  candidates  with  energy  above  30  GeV  or  with  muon 
candidates  with  momentum  above  20  GeV.  This  set  of  criteria  leads  to  an  efficiency 
of  (94±1)%  with  (98.9±0.5)%  purity. 

Four  lepton  events  are  classified  by  selecting  events  with  low  multiplicity  (N  < 
12)  and  the  presence  of  two  high  energy  leptons  (electrons  or  muons).  High  energy 
electrons  are  selected  with  energy  above  25  GeV  and  muons  with  momentum  above 
20  GeV.  The  selection  efficiency  and  purity  for  the  three  channels  ei/ei/,  /u//"', 
as  obtained  in  [5]  are  summarized  in  table  1. 


Table  1.  Selection  efficiency  and  purity  of  the  four  lepton  final  state 


Efficiency 

Purity 

ei/ef 
fiffii/ 

ZV\1V 

(79±11)% 
(95±5)% 
(62±13)% 

(92±8)% 
(95±5)% 
(83±15)% 

For  events  with  2  jets  and  2  leptons,  one  uses  the  following  criteria  :  (1)  moderate 
multiplicity  (12  <  N  <  60),  (2)  large  missing  energy  (Ej_/Evis  >  0.1-0.15),  (3) 
moderate  visible  energy  (Evis/\/s  <  0.88).  In  addition,  separate  selection  criteria 
(on  energy  of  identified  particles)  were  applied  to  classify  these  into  electron,  muon 
or  tau  category.  The  resulting  efficiency  and  purity  of  the  samples  are  summarized 
in  table  2. 

Table  2.   Selection  efficiency  and  purity  of  ji  J2  t  v  final  state 


Efficiency 

Purity 

JiJ2e^ 

JlJ2/«> 
J1J27-V 

(74±4)% 
(69±4)% 
(80±3)% 

(87±3)% 
(94±2)% 
(60±3)% 

The  stable  leptons/hadrons  are  detected  in  the  detector.  For  W's  decaying 
hadronically,  one  needs  to  reconstruct  jets.  For  both  W's  decaying  hadronically, 
the  number  of  jets  to  be  reconstructed  is  4.  For  only  one  W  decaying  hadronically, 
the  number  of  jets  to  be  reconstructed  is  2  (excluding  the  lepton  when  associated 
with  e//x)  or  3  (including  the  lepton  when  associated  with  T).  Several  jet  recon- 
struction algorithms  [7,  8]  have  been  tried  out  [9].  In  most  of  these  algorithms 


are  combined  into  clusters  until  only  four  clusters  are  left.  The  difference  in  these 
algorithms  lie  in  the  choice  of  the  closeness  variable  yjj  as  summarized  in  table  3. 


Table  3.  Closeness  variable  yij  for  various  jet  algorithms 


Algorithm 


yy 


JADE 
Durham 
Geneva 
Lund 


2Mm(Ep,Ep(l-cos0ij)/Evi82 

SEiEjCl-cos^ygtEi  +  Ej)2 

2|pi||pj|sin(0ij/2)/iPi  +  Pjl 


In  generalized  thrust  approach  [8],  one  defines  the  generalized  thrust  as 


™  v 

=  max 


by  dividing  the  set  of  particles  into  4  groups.  T^  is  maximized  by  regrouping 
particles  and  finding  the  four  jet  axes  from  the  particle  4-momenta  in  each  group. 
Technically  this  is  achieved  by 

•  Defining  event  plane  according  to  the  2  largest  eigenvectors  of  the  momentum 
tensor 

•  Maximizing  TS  on  the  event  plane  by  dividing  the  particles  in  zones  of  <j>  angle 

•  Removing  the  nicest  of  the  three  jets  (one  with  the  largest  thrust) 

•  Boosting  the  rest  of  the  particles  in  the  reduced  cm  system 

•  Applying  the  same  3  jet  algorithm  to  divide  these  particles  to  a  set  of  3  jets 

•  Checking  if  the  particles  in  each  of  the  jet  has  the  largest  py  with  respect  to 
the  jet.  If  not,  reclassification  is  required. 

Jet  energies  are  recalculated  using  the  jet  angles  as  measured,  assuming  0  parton 
mass  and  energy  momentum  conservation. 

The  energy  and  angular  resolution  of  the  jets,  thus  obtained,  are  summarized 
in  table  4. 

This  study  suggests  that  Durham  or  Lund  algorithm  is  most  suitable  for  jet 
reconstruction. 

Further  constraint  can  be  put  by  imposing  energy-momentum  conservation  in 
the  W-pair  production  process.  These  conservation  relations  as  well  as  the  mass 
shell  criteria  for  the  W's  can  be  written  down  as 

0 


(Pji  +Pj3+P3+P4(+P-r))    ~ 


0 
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Reconstruction 
method 

<TB 
(GeV) 

v» 
(rad) 

0>i 
(rad) 

JADE 

6.13 

0.113 

0.121 

Durham 

5.96 

0.110 

0.118 

Geneva 

6.16 

0.115 

0.126 

Lund 

5.58 

0.107 

0.115 

Generalized 

8.50 

0.143 

0.145 

Thrust 

(Pji  +Pj2)2  - 
(P3  +  P4)2  - 


=  fs 
=  f6 


p3  =  pjs,  p4  =  Pj4  for  four  jet  final  states  and  ps  =  p/;,  p4  =  pv  for 
state.  For  perfect  measurements,  one  should  have 

fj   =  0 

However,  because  of  detector  resolution  and  acceptance,  the  constraint  relations  are 
strictly  speaking  not  satisfied.  Energy  and  direction  of  the  jets  and  the  charged  lep- 
tons  are  measured  with  precision  determined  by  detector  resolution  and  acceptance. 
After  mw,  FW  measurement,  mw!,  mw2  would  be  known  with  some  precision.  For 
ji  J2  f-v  final  state,  pv  is  unmeasured.  All  these  are  taken  care  of  by  using  the 
method  of  Lagrangian  multiplier.  Here  one  minimizes  x2  of  deviation  of  measured 
qualities  in  an  iterative  way. 


x2  = 


The  minimum  x2  should  occur  for  fj  =  0  so  that  the  choice  of  multipliers  £j  is 
arbitary.  Convergence  of  the  fits  are  decided  by 

•  the  maximum  constraint  value  after  fit  (e.g.  |fj|  <  10) 

•  X2  to  be  small  after  fit  and  change  of  x2  in  iteration  to  be  also  small 

The  effect  of  the  kinematic  fit  [10]  is  summarized  in  table  5  and  figure  5  where 
one  shows  the  resolutions  in  production  and  decay  angles  of  the  W's  before  and 
after  the  fit. 
The  fit  improves  resolution  in  all  measured  distributions. 

DELPHI  [4]  has  proposed  to  use  momentum  weighted  charge  of  the  decay  prod- 
ucts to  measure  the  charge  of  the  W. 
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Figure  5.  Prodxiction  (a-b)  and  decay  6  (c-d),  (f>  (e-f)  angular  distribution  of  W  before 
and  after  the  kinematic  fits 


Table  5.  Resolution  in  production  and  decay  angular  distribution  of  W  before  a 
the  kinematic  fit 


Before  fit 

After  fit 

cos#\v 

COS#d 

<£d  (rad) 

0.0134 
0.0181 
0.034 

0.0108 
0.0103 
0.021 

-10  0 

Momentum— weighted  charge 


Figure  6.  Distribution  of  momentum  weighted  charges  C_  (the  left  one)  and 
right  one)  for  jets  assigned  to  W~  and  W+  respectively 
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LUC  cvciiis. 


4.  Determination  of  the  3  Boson  Couplings 

DELPHI  [4]  has  studied  the  limits  which  can  be  realistically  set  to  the  pa- 
rameters of  the  3  boson  coupling  by  their  detector.  They  have  generated  events 
with  complete  detector  simulation.  After  the  jet  reconstruction  and  kinematic  fit- 
ting, they  correct  the  observed  W  production  angular  distribution  from  detector 
effects  by  deconvoluting  the  observed  distribution  with  a  binned  smearing  matrix 

W/-rvs  /scon   __„  aob\ 

f  ^COS  (7yy     ,  COS)  t/yy  ) 


d  cos  0 


•j 


_  i  d  cos 


F(cos  ^r,  cos  <)  -  e (cos  0#)  •  d  cos  0 


where  e(cos  6$)  represents  the  efficiency  of  the  selection  criteria  for  the  appropriate 
final  state.  For  4  jet  final  states,  they  apply  the  jet  charge  tagging  technique  to 
distinguish  between  W+  and  W~~. 

The  corrected  distribution  is  then  fitted  to  the  theory  by  varying  each  of  the 
parameters  (5gwwZ)  A,  ^7  and  KZ)  in  turn  for  500  pb"1  data.  The  results  obtained 
for  two  cm  energies  of  176  and  190  GeV  are  summarized  in  table  6. 


Table  6.   Precision  in  measuring  anomalous  3  boson  coupling  constant  for  single  param- 
eter fit 


v/s~  =  176  GeV 

Vs"  =  190  GeV 

<$gwwz 
A 

K7 

KZ 

0.051  ±0.151 
0.034  ±0.095 
1.07  ±0.20 
1.05  ±0.16 

-0.166  ±0.085 
-0.095  ±0.048 
0.86  ±0.09 
0.85  ±0.08 

Thus  the  anomalous  coupling  can  be  measured  with  a  precision  of  0.1-0.2  for 
LEP200  operating  at  a  cm  energy  of  176  GeV.  The  precision  is  improved  by  a  factor 
of  2  by  going  to  a  higher  cm  energy  of  190  GeV. 

Alternately,  one  can  fit  to  several  parameters  simultaneously.  This  will  give  rise 
to  contours  at  95%  confidence  level.  Figure  7  shows  such  contours  for  £gwwz  vs  A 
for  a  two  parameter  fit  to  generated  and  reconstructed  angular  distributions  from 
J1J2&/  events. 
One  can  draw  the  following  conclusions  from  these  fits 

•  The  constraint  on  A  is  tight  and  is  not  too  dependent  on  the  values  of  other 
parameters 
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Figure  7.  95%  Confidence  level  contour  obtained  for  6gwwz  and  A  from  generated  and 
reconstructed  production  angular  distributions  using  jijs^  events  [4] 


•  The  jet  charge  tagging  improves  the  limits  significantly 

•  One  obtains  better  precision  for  the  anomalous  couplings  by  going  to  higher 
cm  energies 

As  has  been  mentioned  earlier,  decay  angles  can  be  unambiguously  determined 
for.leptonic  decay  of  W's  in  jijitv  event  sample.  For  4  jet  final  state,  one  can  obtain 
folded  distribution  of  decay  angular  distributions.  The  charge  tagging  technique 
does  not  work  very  well  for  individual  jets.  Following  the  suggestions  of  Hagiwara 
et  al.  [2],  one  can  parametrize  the  triple  differential  cross  section  in  terms  of  a  com- 
plete set  of  known  orthogonal  functions  Li's,  which  are  used  to  produce  projections 
Fj  of  the  W+  and  W~  production  cross  sections  respectively. 


d  cos  0  w  d  cos  #d  d<£ 


t=l 


One  can  now  examine  a  set  of  linear  combinations  of  F^  and  F-,    to  put  limits  to 

nonstandard  3  boson  parameters.  DELPHI  [4]  have  examined  the  effect  of  including 

the  decay  angular  distributions  to  the  parameters  (see  figure  8). 

It  is  obvious  from  the  figure  that,  the  decay  angular  distributions  improve  the  limit 

on  5g\vwz  or  ^  substantially.  The  parameter  space  is  more  restricted  at  higher  cm 

energies. 


5.   Summary 

From  the  studies  made  so  far,  one  can  draw  the  follwing  conclusions  : 
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Figure  8.  95%  Confidence  level  contours  from  the  distributions  of  cos  ^w,  (cos#w, 
and  (cos#w,  cos^d,  ^d)  using  jiJ2^  events  [4] 


*  Techniques  have  been  developed  to  reconstruct  W  production  and  decay  an- 
gular distributions  from  the  jiJ2^  channel  as  well  as  the  jiJ2JaJ4  channel. 
The  use  of  4  jet  channel  has  the  potential  of  increasing  statistics  and  hence 
reducing  statistical  error. 

*  With  500  pb"1  integrated  luminosity  at  LEP200,  the  3  boson  coupling  param- 
eters can  be  measured  from  W  production  angle  with  a  precision  of  ±0.1-0.2 
at  176  GeV. 

*  Precision  of  the  coupling  paramter  determination  is  significantly  improved  by 
utilizing  the  decay  angular  distribution  of  W. 

*  The  precision  is  further  improved  by  operating  at  a  higher  energy.    Event 
selection  efficiency  as  well  as  angular  resolution  improves  with  LEP  energy. 
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Abstract.  Several  reasonably  model-independent  formulations  of  the  implications  of 
new  physics  for  precision  electroweak  measurements  have  been  developed  over  the  past 
years,  most  notably  by  Peskin  and  Takeuchi,  and  by  Altarelli  et  al.  These  formulations 
work  by  identifying  a  small,  but  useful,  set  of  parameters  through  which  new  physics  often 
enters  into  well-measured  physical  observables.  For  the  theories  to  which  such  an  analy- 
sis applies,  this  approach  greatly  streamlines  the  confrontation  with  the  data.  Since  the 
experimentally-allowed  range  for  these  parameters  has  been  determined  from  global  fits  to 
the  data,  theorists  need  only  compute  their  predictions  for  these  parameters  to  constrain 
their  models.  We  summarize  these  methods  here,  together  with  several  recent  general- 
izations which  permit  applications  to  wider  classes  of  new  physics,  and  which  include  the 
original  approaches  as  special  cases. 


1.  Introduction 

The  attainment  of  high  precision  in  measurements  of  Z-boson  properties  has  been 
perhaps  the  most  significant  experimental  result  in  high-energy  physics  over  the  last 
decade.  Besides  testing  the  Standard  Model  (SM)  to  high  precision,  experiments  at 
LEP  and  at  SLC  [1]  are  providing  the  first  experimental  winnowing  of  the  bumper 
crop  of  theories  that  hope  to  describe  the  physics  at  energies  well  above  100  GeV. 
Indeed,  the  data  on  the  Z  resonance  is  now  so  good  that  model  builders  ignore 
it  at  their  peril.  There  is  a  very  real  benefit  in  confronting  the  various  models 
of  current  theoretical  interest  with  the  constantly  improving  experimental  results. 
One  way  to  proceed  is  to  simply  compute  the  relevant  observables  explicitly  on  a 
model-by-model  basis,  and  to  fit  the  results  to  the  data  in  order  to  constrain  the 
model's  parameter  space.  Unfortunately,  this  is  a  time-consuming  procedure  and 
so  one  is  limited  in  the  number  of  models  which  can  be  treated  in  this  way.  For 
well-motivated  theories,  such  as  the  minimal  supersymmetric  generalizations  to  the 
standard  model  (MSSM)  for  instance,  such  detailed  calculations  may  be  worth  the 
effort  they  require,  although  even  here  it  is  impractical  to  explore  the  model's  entire 
parameter  space. 
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Happily,  there  is  another  way  to  proceed  which  can  substantially  reduce  the 
labour  that  is  required  to  confront  a  model  with  the  implications  of  the  data.  This 
alternative  is  based  on  the  realization  that  many  models  often  only  contribute  to 
deviations  from  the  SM  —  for  the  well-measured  observables  of  interest  —  in  a 
limited  number  of  ways.  For  instance,  this  could  happen  if  all  of  the  new  particles 
only  couple  to  the  presently-observed  ones  in  a  restricted  manner,  such  as  through 
the  exchange  of  electroweak  gauge  bosons.  Or  all  of  the  new  particles  could  be 
extremely  heavy.  In  either  case  it  is  typically  true  that  only  a  small  number  of 
independent  combinations  of  the  model's  coupling  constants  ever  appear  in  —  and 
so  are  well  constrained  by  —  the  observables  that  are  measured.  In  these  cases  it 
is  useful  to  confront  the  data  in  two  steps.  First,  one  can  parameterize  the  well- 
measured  observables  in  terms  of  a  few  independent  variables  which  can  then  be 
constrained,  once  and  for  all,  by  comparing  with  the  experiments.  Next,  constraints 
on  any  given  model  may  be  obtained  by  comparing  the  bounds  on  these  parameters 
with  the  model's  predictions  for  them  as  functions  of  its  underlying  couplings. 

Such  a  two-step  procedure  has  the  advantage  of  separating  the  statistical  fit  to 
the  data  from  model-dependent  calculations.  Since  the  data  can  be  fit,  once  and 
for  all,  to  a  general  set  of  parameters,  it  is  not  necessary  to  repeat  this  analysis 
separately  for  every  model.  Although,  in  principle,  the  permitted  parameter  space 
that  is  obtained  for  a  particular  theory  in  this  way  can  differ  somewhat  from  what 
would  be  obtained  by  a  direct  fit  to  the  model,  in  practice  the  two  procedures  turn 
out  to  give  constraints  which  are  essentially  equivalent.  Furthermore,  the  compar- 
ative model-independence  of  the  fit  .to  the  data  in  the  two-step  approach  permits 
an  efficient  comparison  of  many  models,  and  so  gives  a  reasonably  broad  picture  of 
the  kinds  of  new  physics  which  can  produce  deviations  in  different  observables. 

There  are  a  number  of  similar,  but  not  completely  equivalent,  examples  of  this 
type  of  reasoning  which  have  become  widely  used  in  the  literature  [2,  3,  4,  5,  6].  The 
most  widely  used  of  these  are  based  on  the  parameterizations  of  Peskin  et.al.  [2], 
and  of  Altarelli  et.al.  [5].  Both  of  these  formalisms  are  very  useful  for  describing  the 
implications  for  precision  electroweak  measurements  of  a  wide  class  of  new  physics. 
Neither  of  these  parameterizations  of  the  observables  can  encompass  all  models, 
however,  and  so  it  is  important  to  bear  in  mind  the  limits  to  their  applicability  when 
attempting  to  use  their  results  for  comparing  with  any  particular  theory.  Some 
well-motivated  models  and  observables  cannot  be  analysed  completely  within  the 
framework  of  either  approach,  motivating  their  extension  to  more  general  situations 
[7,  8].  The  purpose  of  this  review  is  to  describe  those  extensions  which  I  have  helped 
to  develop,  and  to  relate  these  extensions  to  earlier  approaches.  Some  applications 
of  the  resulting  techniques  are  also  described. 

This  article  is  organized  as  follows.  There  are  two  main  categories  of  new 
physics  that  have  been  considered  to  date:  those  which  dominantly  contribute  to 
observables  through  'oblique'  corrections,  and  those  for  which  the  new  physics  is 
heavy.  The  next  two  sections  are  devoted  to  describing  how  each  of  these  kinds 
of  corrections  can  contribute  to  electroweak  observables.  In  the  next  section  the 
case  of 'oblique'  corrections  is  considered,  while  heavy,  non-oblique,  new  physics  is 
considered  in  section  3.  Section  4  briefly  describes  a  number  of  the  applications  of 


Almost  all  of  the  observables  that  are  presently  amenable  to  accurate  measurement 
can  be  phrased  in  terms  of  the  two-particle  scattering  of  light  fermions.  This  is 
because  these  experiments  either  involve  the  scattering  of  two  quarks  or  leptons, 
or  the  decay  of  an  initial  fermion  into  three  lighter  ones.  There  are  three  ways  in 
which  new  physics  can  affect  such  experiments,  given  that  the  new  particles  are 
not  themselves  directly  produced.  It  can:  (a)  change  the  propagation  of  the  gauge 
bosons  that  can  be  exchanged  by  the  fermions;  (fe)  alter  the  three-point  fermion  - 
boson  couplings;  and  (c)  modify  the  four-point  direct  fermion  -  fermion  interactions 
(i.e.:  'box'-diagram  corrections). 

An  important  class  of  new-physics  models  contribute  dominantly  to  precision 
measurements  through  process  (a):  changes  to  the  vacuum  polarizations  of  the  elec- 
troweak  bosons.  Such  corrections  are  called  'oblique'  [9],  and  when  they  dominate 
they  imply  universal  modifications  to  light-particle  scattering,  in  the  sense  that 
the  changes  depend  only  on  the  electroweak  quantum  numbers  of  the  light  fermion 
involved.  Oblique  corrections  can  be  the  most  important  when  the  direct  couplings 
between  the  observed  light  fermions  and  any  new  particles  are  either  forbidden  or 
highly  suppressed. 

2.1  General  oblique  corrections 

The  effects  of  oblique  corrections  on  fermion  scattering  can  be  determined  by  ex- 
amining how  the  gauge  boson  vacuum  polarizations 

n;»(«)  =  n«6(g2)  iT  +  (« V  terms),  (1) 

(with  a,  b  —  7,  W,  Z)  appear  in  the  observables  of  interest  [9,  10].  The  contribution 
to  these  due  to  new  physics  we  denote  as  6Eab(q2),  so  the  full  vacuum  polarization 
is  given  by:  Kab(q*)  =  K™(q2)  +  Wab(q2). 

In  general,  each  of  the  8Uab(q2)  can  be  arbitrary  functions  of  g2,  and  so  the 
vacuum  polarizations  in  principle  contain  several  unknown  functions,  each  of  which 
can  potentially  enter  into  all  physical  observables.  Any  attempt  to  extract  general 
information  by  fitting  these  functions  to  the  data  might  therefore  seem  to  be  doomed 
because  of  the  large  number  of  unknown  quantities  in  comparison  to  the  amount  of 
data  that  is  available.  This  turns  out  to  be  too  pessimistic  a  view,  however,  because 
currently  accurate  measurements  are  performed  either  at  very  low  energies,  or  on 
the  Z  resonance:  i.e.  only  for  g2  =  m2  and  q2  «  0.  (g2  —  m2w  should  also  be. 
considered  to  the  extent  that  the  mass  and  width  of  the  W  boson  are  thought  to 
be  sufficiently  well  measured.)  As  a  result,  the  unknown  functions,  &Tlai,(q2),  are 
presently  only  accurately  sampled  at  these  few  values  of  four-momentum  transfer. 

This  restriction  —  that  precision  observables  only  probe  q2  w  0  and  q2  —  m| 
and  m2^  —  implies  that  all  oblique  corrections  to  electroweak  observables  can  be 
expressed  in  terms  of  six  independent  combinations  of  the  various  <5ITs  [7].  The 
counting  proceeds  as  follows. 

1.  Inspection  of  the  graphs  with  vacuum  polarization  insertions  shows  that,  a 
priori,  there  are  ten  quantities  to  consider.  For  neutral-current  processes  at 
q2  =  0  and  q2  =  m2,  these  are:  8Hri(q2)/q2t  8RZJ(q2}/q2  and  6Uzz(q2). 
(Notice  that  electromagnetic  gauge  invariance  ensures  that  both  SU-rf(q2)/q2 
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denoting  differentiation  with  respect  to  g  ,  also  contributes  at  q  —  mz. 
Finally,  charged-current  observables,  or  those  involving  physical  W  particles, 
involve  6nww(q2)  at  q2  =  0  and  m^,,  as  well  as  tfTI^^m^,). 

2.  Of  these  ten  possible  parameters,  three  combinations  can  never  lead  to  ob- 
servable deviations  from  the  SM,  since  they  can  be  absorbed  into  SM  renor- 
malizations.  For  instance,  they  can  be  absorbed  into  renormalizations  for  the 
electroweak  gauge  potentials,  W^  B^,  and  of  the  Higgs  vev,  {^).  This  reduces 
the  total  number  of  possible  oblique  parameters  to  seven. 

3.  Finally,  at  the   present   levels  of  accuracy,   new-physics   contributions  to 
<$II77(m|)  are  not  detectable.   This  is  because  this  term  contributes  purely 
through  photon  exchange,  which  is  not  resonantly  enhanced  when  g2  =  mz . 
As  a  result,  the  influence  of  <5TI77(m|)  is  suppressed  by  O(Tz/mz)  ~  0.03  in 
comparison  to  the  effects  of  the  Z-mediated  terms  ^^(mj.)  and  <$nzz(rn|). 
We  are  therefore  left  with  six  measurable  oblique  parameters. 

There  is  obviously  a  great  deal  of  freedom  in  how  to  parameterize  this  six- 
dimensional  parameter  space.  A  convenient  way  to  define  the  six  oblique  parameters 

is: 

(2  2  \ 

Cw         Sw  j    -^        ,    ,          .^        ,.-.-. 


_ 

oil     — 


m 

mz  j 

(2' 


av  =  «,.„ 

aW    -    ill' 

UCYY       —      u\iw 

aX     =     -swc 

where  sw  and  cw  denote  the  sin  and  cosine  of  the  weak  mixing  angle,  0«,  ,  and  <. 
represents  the  electromagnetic  fine-structure  constant.  For  numerical  purposes  w< 
later  use  a  (ml)  =  1/128  and  4  =  0-23.  The  quantity  6nab(q2)  denotes  the  rati 
<5IIaj(g2)/g2.  These  definitions  are  chosen  so  that  (i)  the  first  three  agree  wit: 
the  definitions  of  S1,  T  and  U  that  are  used  in  Ref.  [3],  («)  the  remaining  thre 
quantities,  V,  W  and  X,  vanish  if  6Hab(q2)  should  be  simply  a  linear  function  c 
q  ,  and  (m)  each  parameter  contributes  to  a  particular  kind  of  observable  (se 
below). 

A  straightforward  calculation  gives  expressions  for  the  corrections  to  the  variou 
electroweak  observables  in  terms  of  the  parameters  S  through  X.  One  complicatio 
arises  because  these  corrections  must  be  referred  to  the  corresponding  (radiativelj 
corrected)  SM  prediction.  Because  of  the  necessity  for  performing  radiative  corre< 
tions,  the  expressions  for  the  SM  predictions,  in  turn,  depend  on  (t)  which  thre 

50  Pramana-  J.  Phys.,  Supplement  Issue,  1995 


observables  are  used  to  infer  the  experimental  values  for  the  SM  couplings,  and  («") 

unknown  quantities,  such  as  the  masses,  m*,  mw,  of  the  t  quark  and  Higgs  boson. 

We  follow  the  universal  practice  of  using  the  three  best-measured  observables 

—  a  from  low-energy  electron  properties,  GF  from  muon  decay,  and  mz  from  LEP 

—  as  inputs  for  fixing  the  values  of  the  SM  couplings  from  experiment.  How  the 
parameters,  S  through  X,  appear  in  the  final  expressions  depends  in  detail  upon 
this  choice,  since  new  physics  also  affects  these  input  observables,  and  so  shifts  the 
inferred  values  for  the  SM  couplings.  With  these  inputs,  and  with  the  definitions  of 
5"  through  X  given  above,  U  and  W  only  enter  into  'charged-current'  quantities  like 
the  mass  and  width  of  the  W  boson,  and  of  these  W  appears  only  in  the  W-boson 
width.  Purely  neutral-current  data  depend  only  on  5,  T,  V  and  X,  and  of  these 
only  S  and  T  appear  in  observables  at  low  energies  (for  which  g2  «  0).  V  and  X 
arise  only  in  observables  defined  at  the  Z  resonance.  Of  these  two,  X  enters  as  a 
correction  to  the  effective  weak  mixing  angle,  and  so  contributes  to  asymmetries 
such  as  ALR  or  AFB.   V,  on  the  other  hand,  drops  out  of  these  asymmetries,  but 
instead  changes  the  Z  partial  widths,  since  it  alters  the  normalization  of  the  Z- 
fermion  couplings.  The  numerical  expressions  for  these  observables  as  functions  of 
S  through  X  are  summarized  in  Table  1. 


Table  1.  Oblique  Contributions  to  Observables 

The  dependence  of  some  electroweak  observables  on  S,  T,  U,  V,  W  and  X .  The  nu- 
merical values  o(m|)  =  1/128  and  s^,  =  0.23  are  used  in  preparing  this  table.  The 
precise  definitions  of  the  observables  can  be  found  in  Refs.  [7]. 


Tz  =  (rz)SM  -  0.009615+  0.0263T  +  0.0194V  -  0.0207*  (GeV) 

rtl  =  (riT)SM  -  0.001715  +  0.00416T  +  0.00295V  -  0.00369*  (GeV) 

r,+,-  =  (r,+,-  )SM  -  0.0001925  +  0.000?90T  +  0.000653V  -  0.000416*  (GeV) 

That  =  (Thad)sM  -  0.009015  +  0.0200T  +  0.0136V  -  0.0195*  (GeV) 

AFB(n]  =  (AFB(fi))SM  -  0.006775  +  0.00479T-  0.0146* 

Apol(r)  =  (Apoi(r})SM  -  0.02845+  0.0201T-  0.0613* 

Ae(PT)  =  (Ae(PT))SM  -0.02845  +  0.0201T- 0.0613* 

AFB(b)  =  (AFB(b))SM  -  0.01885+  0.0131T-  0.0406* 

AFB(c)  =  (AFB(Cy)SM  -  0.01475  +  0.0104T  -  0.03175X 

ALR  =  (ALR)SM  -  0.02845+  0.0201T-  0.0613* 

m2w  =  (m2w)SM(l  -  0.007235+  0.011ir  +  0.00849(7) 

Tw  =  (rw)SM(l  -  0.007235  +  0.0111T  +  0.00849(7  +  0.00781Vvr) 

9l  =  (0?  W  ~  0.002695  +  0.00663T 

9\  =  (9l)SM  +  0.0009375-  0.000192T 

gev(ve  -+  i/c)  =  (gev )SM  +  0.007235  -  0.0054ir 

geA (i/c  -» i/e)  =  (g\)SM  -  0.00395T 

Qw(W*Cs)  =  Qw(Cs)SM  -  0.7955  -  0.0116T 


The  expressions  from  Table  1  can  now  be  compared  with  the  data  to  obtain 
bounds  on  the  phenomenologically  allowed  range  for  the  parameters  5  through  X. 
In  making  this  comparison  we  use  the  values  mt  =  150  GeV  and  mH  =  300  GeV  for 


addressed  in  section  4.  The  one-<r  allowed  ranges  for  the  oblique  parameters  which 
result  from  the  fit  to  the  data  of  Ref.  [7]  are  then  given  by: 

S  =  -0.93  ±  1.7;        T  =  -0.67  ±  0.92;         U  =  -0.6  ±  1.1; 

V  =  0.47  ±1.0;        W  =1.2  ±7.0;        X  =  0.10  ±0.58.  (3) 

Notice  that  since  the  parameter  W  only  appears  in  the  width  of  the  W  boson,  it  is 
the  most  poorly  constrained. 

2.2  When  the  new  physics  is  heavy 

There  is  an  important  special  case  for  which  the  above  oblique  analysis  simplifies 
considerably.  This  is  when  the  lightest  mass,  M,  for  all  of  the  new  particles  is 
much  larger  than  mz.  Since  the  scale  over  which  q2  varies  appreciably  in  6Tlab(q2) 
is  set  by  M ,  in  this  case  these  functions  may  be  well  approximated  by  the  first 
terms  in  their  Taylor  expansion  in  powers  of  q2/M2.  Since  current  measurements 
are  restricted  to  the  regime  q2  &  m|,  this  approximation  is  controlled  by  powers  of 
the  small  parameter  m^/M2. 

The  leading  contributions  in  this  limit —  i.e.  those  which  are  not  suppressed 
by  inverse  powers  of  M2  -r-  are  simply  linear  functions  [2,  3,  4].  of  q2: 

6Hab(q2}KAab  +  Babq2.  (4) 

(Higher-order  terms  have  also  been  considered  in  the  literature.  [11])  With  this 
assumption  three  of  the  oblique  parameters,  V,  W  and  X,  vanish  identically  and 
so  all  new  physics  effects  are  described  by  the  three  parameters  5,  T  and  U.  The 
expressions  for  observables  in  terms  of  these  three  parameters  may  therefore  be 
obtained  simply  by  setting  V  =  W  =  X  =Q'm  Table  1.  A  fit  to  the  data,  [7]  with 
V,  W  and  X  constrained  to  vanish,  then  gives  the  following  one-cr  allowed  ranges 
for  S,  T  and  U: 

S  =  -0.48  ±0.40;        T  =  -0.32  ±  0.40;         U  =  -0.12  ±0.69.  (5) 

Not  surprisingly,  the  allowed  range  for  S  and  T  in  this  two-parameter  fit  is  smaller 
than  was  permitted  in  the  six-parameter  fit  whose  results  are  given  above. 

Since  the  neutral-current  data  are  completely  controlled  by  the  two  parameters 
S  and  T,  it  has  become  conventional  to  display  the  results  of  fits  to  this  data 
by  plotting  the  ellipses  of  constant  confidence  interval  in  the  two-dimensional  S-T 
plane.  Figure  1  displays  the  results  of  the  two  fits  described  above.  Notice  that, 
since  mw  can  depend  on  the  parameter  U  as  well  as  on  5  and  T,  mw  should  only  be 
included  in  such  a  plot  if  there  are  a  priori  reasons  for  believing  U  to  be  negligibl> 
small. 

2.3  Using  only  the  Z  resonance 

In  recent  years  the  electroweak  data  on  the  Z  resonance  has  become  more  accurate 
than  are  the  older  low-energy  measurements.  As  a  result  it  is  now  possible  [5]  tc 
usefully  constrain  new  physics  using  only  the  data  at  ga  =  mj.  As  is  clear  frorr 
the  earlier  counting  of  parameters,  such  a  restriction  to  only  one  value  of  four- 
momentum  transfer  permits  a  description  of  the  data  in  terms  of  fewer  oblique 
parameters  than  the  six  that  were  required  when  q2  «  0  was  also  considered.  This 
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Figure  1.  Constraints  on  5  and  T  from  a  fit  to  both  high-  and  low-energy  electroweak 
measurements.  The  solid  line  represents  the  68%  C.L.  setting  VWX  to  zero,  the  dashed 
line  represents  the  90%  C.L.  Setting  VWX  to  zero,  the  dotted  line  represents  the  68% 
C.L.  allowing  VWX  to  vary,  and  the  dot-dashed  line  represents  the  90%  C.L.  allowing 
VWX  to  vary. 


As  is  easily  verified  by  repeating  the  counting  argument  given  earlier,  only  three 
parameters  are  required  to  describe  the  general  oblique  corrections  in  this  case.  A 
convenient  choice  for  these  three  parameters  is:  [12] 


5'     =     £  +  4447  +  4(4-4)*, 

,  (6) 


With  this  choice,  the  parameters  5',  T'  and  V  reduce  to  5,  T  and  U  in  the 
limit  where  V  =  W  =  X  ~  0.  This  ensures  that  the  dependence  of  all  Z-pole 
observables  on  5',  T  and  U'  can  be  simply  read  off  from  Table  1  by  replacing 
(5,  T,  UtV,  W,  X)  -  (S>,  r,U',  0,0,0). 

As  before,  since  the  data  on  the  Z  resonance  only  depends  on  two  parameters 
S'  and  T',  it  is  convenient  to  display  the  result  of  a  fit  of  these  two  parameters  to 
the  data  as  a  plot  in  the  S'-T1  plane.  The  result  of  such  a  fit,  including  the  more 
recent  1994  data,  [13]  is  displayed  in  Figure  2. 

It  is  noteworthy  that  the  constraints  on  S'  and  T'  from  Figure  2  are  comparable 
to  those  on  S  and  T  using  the  larger  data  set,  including  also  the  q1  «  0  observables, 
that  are  used  for  Figure  1.  This  illustrates  the  high  quality  of  the  data  that  has 
been  obtained  over  recent  years  at  the  Z  resonance.  The  conceptual  difference 
between  Figures  1  and  2  remains  crucial,  however.  Whereas  a  two-parameter, 
S'  —  T",  description  of  the  Z-pole  data  relies  only  on  the  assumption  that  oblique 
corrections  dominate,  the  extension  of  such  a  description  to  include  also  neutral- 
current  measurements  at  q2  ?a  0  relies  on  the  additional  assumption  that  all  of  the 
new  physics  is  sufficiently  massive  to  permit  the  neglect  of  V  and  X. 

2.4  A  cultural  aside 

Treating  the  Z-pole  data  by  itself  is  very  much  in  the  spirit  of  the  approach  of 
Ref.  [5].  The  formalism  of  these  authors  differs  from  that  described  so  far  in  the 
following  twd  ways,  however. 

1.  These  authors  introduce  three  parameters,  ei,  62  »  and  £3,  which  broadly  cor- 
respond to  the  three  parameters  5',  T'  and  U'  .  Their  definitions,  however, 
are  made  directly  in  terms  of  the  observables,  and  do  not  separate  the  new- 
physics  contributions  from  those  due  to  SM  radiative  corrections.  As  a  result 
only  the  deviation,  6e»  =  €{  —  eSM,  from  the  SM  predictions  —  using  the 
fiducial  choices  for  mt  and  mH  —  are  directly  related  to  5',  T'  and  U'.  The 
connection  between  the  two  sets  of  parameters  is  [5,  14,  15]: 

6c2  =  -^,         fcs  =  £^.  (7) 

v 


2.  A  second  important  difference  in  the  approach  of  Altarelli  et.al.  is  to  permit 
one  non-oblique  correction,  parameterized  by  e\,,  to  the  Zbb  vertex.  The  inclu- 
sion of  this  correction  is  motivated  by  the  large  mt-dependent  contributions 
it  receives  from  SM  radiative  corrections,  and  potentially  from  other  kinds  of 
new  physics.  We  return  to  this  type  of  term  in  the  next  section. 
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Figure  2.  Constaints  on  Sr  and  T"  from  &  global  fit  of  pretioon  electreweak  measurements 
at  the  Z  resonance  only.  TKe  two  lines  represent  the  64%  C.L.  and  90%  C.L.  ellipses. 
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3.  Nonoblique  new  physics 

Although  many  kinds  of  new  physics  dominantly  produce  oblique  corrections  to 
electroweak  observables,  this  is  certainly  not  true  for  all  kinds.  Examples  include 
any  new  physics  which  preferentially  couples  to,  say,  the  heavy  generations.  An- 
other worthwhile  generalization  of  the  previous  formalism  is  therefore  to  extend  it 
to  include  nonobliqne  corrections.  In  order  to  be  practical,  however,  such  a  gener- 
alization cannot  be  permitted  to  introduce  too  many  new  parameters,  or  else  the 
utility  of  the  confrontation  with  the  data  will  be  lost.  Some  criterion  is  necessary  to 
limit  and  organize  the  number  of  independent  interactions  that  need  be  considered. 
A  very  natural  way  to  provide  the  required  organization  is  to  assume  that  all 
of  the  new  physics  is  much  heavier  than  the  electroweak  scale,  mz .  In  this  case 
the  implication  of  such  new  physics  for  current  experiments  can  be  parameterized 
in  terms  of  a  low-energy  effective  lagrangian  [16]  such  as  would  be  obtained  by 
integrating  out  all  of  the  presently-undiscovered  heavy  particles.  In  this  section  the 
results  of  such  an  analysis  [8]  are  summarized. 

The  limit  of  large  masses,  M,  for  all  hypothetical  new  particles  allows  their  low- 
energy  interactions  to  be  organized  according  to  dimension,  with  operators  having 
a  higher  mass  dimension  being  more  suppressed  by  inverse  powers  of  M.  Simple 
dimension  counting  need  not  be  the  whole  story,  of  course,  as  low-energy  selection 
rules  and  symmetries  can  also  help  to  determine  the  relative  size  of  the  various 
effective  interactions.  For  a  more  complete  discussion  of  the  issues  involved  see 
Ref.  [8]. 

3.1  The  effective  interactions 

Consider,  therefore,  the  most  general  effective  interactions  that  are  consistent  with 
the  particle  and  symmetry  content  that  is  appropriate  for  applications  to  processes 
having  energies  £100  GeV.  Since  our  intention  is  to  study  current  experiments  in 
this  energy  range,  we  take  our  particle  content  to  include  only  those  which  already 
have  been  detected.  This  includes  most  of  the  SM  particles,  including  precisely 
three  left-handed  neutrinos,  but  does  not  include  the  Higgs  boson  and  the  top 
quark,  which  we  take  to  have  been  integrated  out  (if  they  indeed  exist  [17]).  Due 
to  the  absence  of  these  particles,  the  electroweak  gauge  group  must  be  nonlinearly 
realized  on  the  given  fields,  and  so  in  practice  it  can  be  completely  ignored  [18] 
(apart  from  the  unbroken  electromagnetic  subgroup)  in  what  follows.  The  price 
for  choosing  this  particle  content  is  that  the  resulting  effective  lagrangian  has  to 
violate  unitarity  at  energies  at  or  below  the  TeV  range. 

Typically,  any  such  lagrangian  contains  a  great  many  effective  interactions.  For- 
tunately, there  is  a  great  deal  of  latitude  in  how  the  interactions  can  be  written, 
since  there  is  considerable  freedom  to  redefine  fields  to  simplify  terms  in  the  la- 
grangian. We  choose  to  work  with  fields  for  which  the  kinetic  and  mass  terms  take 
their  standard  diagonal  forms.  The  nonstandard  interactions  which  arise  in  the 
most  general  effective  lagrangian  up  to  mass  dimension  five  can  then  be  written  in 
the  following  way.  [8] 

1.  Fermion  Masses:  The  only  possible  effective  interactions  havi-  g  dimension  two 
or  three  are  an  arbitrary  set  of  masses  for  the  W  and  Z  bosons  and  for  each  of 
the  low-energy  fermions.  The  fermion  mass  terms  so  obtained  are  indistinguishable 


Tk_nnn^-__         T       T»l C« 1 i 


A  neutrino  mass  matrix  would  have  two  effects.  It  would  (t)  give  the  neutrino 
mass  eigenstates  nonzero  masses,  and  (if)  it  would  introduce  unitary  Cabbibo- 
Kobayashi-Maskawa  (CKM)  style  mixing  matrices,  V}3  into  the  various  charged- 
current  neutrino  couplings.  (Flavour  off-diagonal  neutrino  kinetic  terms,  such  as 
can  arise  when  sterile  neutrinos  are  integrated  out,  can  also  introduce  off-diagonal 
neutral-current  interactions,  and  can  make  the  charged-current  mixing  matrices 
nonunitary.)  Notice  that,  unlike  most  other  new-physics  corrections,  these  mixing 
angles  need  not  be  small,  even  if  the  neutrino  masses  are. 

2.  Electromagnetic  Couplings:  The  total  electromagnetic  couplings  of  fermions  are 
straightforward  to  write  down: 


Qi  /,-  A,  +  7X"(#7,  +  <#7«)  fj  *w  >  (8) 

where  the  indices  i  and  j  are  to  be  summed  over  all  possible  flavours  of  light 
fermions,  /,-.  Qi  represents  the  electric  charge  of  /,-,  in  units  of  the  proton  charge. 
JL  and  TR  denote  the  usual  projection  matrices  onto  left-  and  right-handed  spinors. 
Linear  combinations  of  the  effective  coupling  matrices,  <f£  and  <f^,  represent  non- 
standard  magnetic-  and  electric-dipole  moment  interactions. 

3.  Charged-  Current  Interactions:  The  fermion  charged-current  interactions  be- 
come: 


X*(c?7L  +  ejfriO  fj  w;v]  +  cc,  (9) 


where  W^v  —  D^WV  —  DVW^  is  the  W  field  strength  using  electromagnetic  covariant 
derivatives,  D^.  For  leptons, 

hvLtl>     =    6h"L^ 

4.V«  (^  _         *S         ,       c*aT       ,^L_  4(A,  +  A^A 

l  \     4(4  _  4)  +  2(4  _  4)  +  84      2(4  -  4)  )' 
h?'  =  sly1'  ,  (io) 

while  for  quarks: 


/    4(4  -  4) +  2(4  -  4)  +  84      "2(4  -  4J"  J  ' 
hlidi   =  thyd>.  (ii) 

Here  V*t3  and  V^  respectively  denote  the  unitary  CKM  matrices  for  the  left-handed 
charged  current  interactions  of  the  leptons  and  quarks.  The  coefficients  f>huL\^  and 
C*L(R)  represent  a  set  of  arbitrary  nonstandard  fermion-W  couplings,  and  5,  T  and  U 

are  defined  in  terms  of  the  vacuum  polarizations  as  in  the  previous  section.  Finally, 

/  ^ 

A/  (with  /  =  e,  //  or  r)  denotes  the  following  quantity:  A/  = 
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4-  The  W  Mass:  Although  the  SM  contains  gauge  boson  mass  terms,  these  arise 
only  in  a  particular  linear  combination,  leading  to  a  calculable  mass  relation  for 
mw  in  terms  of  the  three  inputs,  mzt  Gf  and  a:  mw  =  m%*  =  mzcw+(radiative 
corrections).  This  relation  is  ruined  by  the  generic  effective  gauge-boson  mass 
terms,  leading  to  the  following  nonstandard  contribution  to  the  W  mass:  [8] 


(12) 

5.   Neutral- Current  Couplings:  The  general  interactions  between  the  light  fermions 
and  the  Z  boson  can  be  written  as  follows: 

+  #7«)  /;  Zp  +7X"(n?7i.  +  ' 


-  Ae  -  A,)] 


r2    —  jf^  li          r2    _  ,2 

cw         **J          c»        *ic  w          tf 

(15) 

As  before,  Qt-  represents  here  the  electric  charge  of  fermion  /»,  and  TS,-  is  its  eigen- 
value for  the  third  component  of  weak  isospin.  Z^v  denotes  the  abelian  curl: 
dpZy  —  dvZp.  The  effective  coupling  matrices  69^,  R,  and  n'LJ(w)  represent  arbitrary 
sets  of  nonstandard  couplings  between  the  fermions  and  the  Z  boson. 

These  expressions  may  now  be  used  to  compute  electroweak  observables.  Work- 
ing to  linear  order  in  the  effective  couplings  permits  the  neglect  of  all  interactions 
which  cannot  interfere  with  the  corresponding  SM  contribution.  This  eliminates  a 
good  many  of  the  effective  vertices  that  are  listed  above,  including  most  flavour- 
changing  interactions.  (Eef.  [8]  gives  a  more  general  discussion  which  also  includes 
the  strongly  bounded  flavour-changing  couplings.)  The  results  for  a  number  of  low- 
energy  observables  (those  for  which  q2  «  0)  are  listed  in  Table  2,  and  those  for 
observables  at  the  weak  scale  in  Table  3. 

The  parameters  which  appear  in  Tables  2  and  3  can  be  fit  to  the  precision 
electroweak  data,  just  as  was  done  for  the  oblique  parameters  S  through  X.  The 
results  of  such  a  fit  [8]  are  quoted  in  Tables  4,  5  and  6.  In  these  tables  the  results  of 
two  types  of  fits  are  presented.  In  one  of  these  (the  Individual  Fit')  the  parameter 
in  question  has  been  considered  in  isolation,  with  all  of  the  other  parameters  set 
to  zero  by  hand.  This  kind  of  fit  is  not  realistic,  but  has  often  been  considered  in 
the  literature.  The  second  fit  (the  'Global  Fit')  allows  all  of  the  parameters  to  be 
varied  in  fitting  the  data.  Perhaps  surprisingly,  the  resulting  bounds  on  the  various 
parameters  are  nevertheless  quite  good, 


Table  2.   Low-Energy  Observables 

The  contributions  to  low-energy  (q2  »  0)  electroweak  observables  that  are  generated 
by  the  various  interactions  of  the  general  non-oblique  effective  lagrangian.  The 
precise  definitions  of  the  observables  can  be  found  in  Ref.  [8]. 


R,  =  T(v  -+  ei/)/r(T  -+  pv)  =  R*»  (1  -f  2Ae  -  2A^) 
RT  =  T(r  -*  evv)/T(iA  ->  ei/F)  =  fl*"  (1  4  2Ar  -  2AP) 
/V  =  T(r  -+  jii/P)/r(Ai  -*  ei/17)  =  HJJf  (1  +  2Ar  -  2AC) 
p  =  1  +  c*T 

=  <rSM(^N-  -*  /4~X)  [l  +  2A^  -  2Ae  -  2  (Re  (6hl 
=  (ffJ)SM-0.002695+0.00663T-  1.452AM  -  0.244AC 

4-0.620  Re(6fc^)  -  0.856  6g*d  +  0.689  «^u  4-  1.208  S^*" 
=  (^)SAf-f  0.000937  S1-  0.000192  T  +  0.085Ae-0.0359A^ 
+0S0*620  Re(«h«rf)  -f  0.156  «y?  -  0.311  6fli«u  +  0.121  J^* 
(^v)SAf  +  0.00723  5  -  0.00541  T  +  0.656  Ae  j-  0.730AP 

+^'e  +  Sff  -  0-074  ^?"*  -  0.037  Rc(^) 
(<7a,)SM  -  0.00395  T+  1.012  A,  -f  ^  -  ^e  -  1.012  69 

0.0506  Re  (6h%*) 
=  Cfuw  -f-  0.004825  -  0.00493  T  +  0.631(Ae  -f-  Ap) 

+0.387  5^  -  ^«u  -  0.387  6g«  -  6g™ 
=  Cf^  -  0.00241  S  -f  0.00442  T  -  0.565(  Ae  4-  AM) 

-0.693  ^««  -  8g*d  -f  0.693  ^  -  6g*d 
=  Cf^  +  0.00723  5  -  0.00544  T  +  0.696(Ae  4-  A^) 

-  0.08  6^u  4  6~geae  4  0.08  6j|«« 
-  0.00723  5  4-  0.00544  T  -  0.696(AC  4  Ap) 

r  -  0.08  ^f  -  *if«-  4  0.08  6g" 
=  [Qw(£3Cs)]SM  -  0.7965-  0.0113T4  1.45(A«  4 
4147  (6g?  -  6g«)  4  422  (6g*d  4  *yf)  4  376  (6g™  4 
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Table  3.  Weak-Scale  Observables 

The  contributions  to  weak-scale  (q2  =  m|  or  rn^r)  electroweak  obseryables  that  a 
generated  by  the  various  interactions  of  the  general  non-oblique  effective  lagrangia 
The  precise  definitions  of  the  observables  can  be  found  in  Ref.  [8]. 


=  (mi,)SM[l  -  0.00723  5  +  0.0111  T  +  0.00849  U  -  0.426(Ae  +  A,,)] 
-  =  (IY^-)S»,  [l  -  0.0023C  S  +  0.00944T  -  1.209(AC  +  A,,) 

-4.29  «§?  + 3.66  65?] 

=  (rufl)SM  [l  -  0.00649  5+  0.0124T-  1.59(Ae  +  A,,) 
+4.82  6<ftu- 2.13  6jH 

,[l  -  0.00452  S  +  0.0110T-  1.41(Ae  +  AM) 
-4.57  $$?  +  0.828  6g*f] 

5)SM  [l  -  0.00452  5  +  O.OllOr -  1.41(Ae  +  AM) 
-4.57  5^  +  0.828$$!?] 

had)sM  [l  -  0.00518S+  0.0114T-  1.469(Ae  +  A 
-1.01  (6g*d  +  ^f  +  «ff*»)  +  0.183  (^ 
+0.822 («^«  +  f>~g?)  -  0.363 (^-  +  6 
,,^  =  (r^^JsM  [l  +  0.00781T-  (Ae  +  AM) 

z  =  (rz)SM  [l  -  0.00385  S  +  0.0105  T  -  1.35(Ae  +  Ap)  +  0.574  (6g™  + 
-0.254  (6J««  +  6ff«)  +  0.268  (*%•"•  +  «jf?1'- 
-0.144  (6fifie  +  6^  +  OBBSg?)  +  0.123  (5y« 
-0.707  (6#*  +  fiy"  +  ^»»)  +  0.128  (6~g<!*  +  65 

AX.H  =  Af  %  -  0.0284  5  +  0.0201  T  -  2.574(Ae  +  A,)  -  3.61  Sg?  -  4.238  8g 
A£*~  =  (AFB)SM  -  0.00677 S-f  0.00480 T-  0.6l4(Ae  +  AM) 

_    -0.430  (6^«  +  6ff)  -  0.505  (^«  +  6ff) 
APB(bb)  =  (yiFBTO)SM  -  0.0188  5  +  0.0133  T  -  1.70(Ae  +  A,,) 

-2.36  6g?  -  2.77  5g«  -  0.0322  6g?  -  0.178  ^j* 
APB(CC)  =  (AFB(cc))5M  -  0.01475  +  0.0104 T  -  1.333(Ae  +  A,,) 
-1.69  65"  -  1.99  6g«  +  0.175  6g?  +  0.396  6  ft 
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Table  4.  Oblique  Parameters 

Results  for  the  oblique  parameters  5,  T  and  U  obtained  from  the  fit  of  the  new- 
physics  parameters  to  the  data.  The  second  column  gives  the  result  for  the  (unreal- 
istic) case  where  all  other  parameters  are  constrained  to  vanish.  Column  three  gives 
the  result  of  a  global  fit  in  which  all  of  the  parameters  of  the  effective  lagrangian 
are  varied. 


Parameter  Individual  Fit  Global  Fit 

5  -0.10  ±0.16  -0.2  ±1.0 

T  +0.01  ±0.17  -0.02  ±0.85 

U  -0.14  ±0.63  +0.3  ±1.2 


3.2  The  Zbb  vertex 

An  important  special  case  of  the  above  analysis  is  to  simply  add  ~  nonstandard 
dimension-four  coupling  between  the  Z  boson  and  the  6  quark,  in  addition  to  the 
oblique  corrections  S,  T  and  U.  This  corresponds  to  the  approach  of  Ref.  [5],  with 
the  parameters  €1,62  and  €3  given  as  in  the  previous  section,  and  ej  given  by: 

6cb  =  6ft  -  ef  =  -2  6~gbLb.  (16) 

Notice  that,  like  all  of  the  flavour-diagonal  neutral-current  couplings,  the  reality  of 
the  lagrangian  implies  that  6gbLb  must  be  real.  In  this  case,  the  formulae  of  Tables 
2  and  3  simplify  considerably. 

4.  Applications 

With  the  above  results  in  place  for  the  experimental  limits  on  the  various  effective 
parameters,  the  next  step  is  to  compute  the  values  of  these  parameters  in  terms  of 
the  couplings  and  masses  of  various  underlying  models.  We  therefore  now  turn  to  a 
brief  summary  of  some  of  the  applications  to  which  the  above  formalism  has  been 
made. 

4.1  mt  dependence 

The  starting  point  for  the  comparison  with  experiment  is  the  choice  of  a  set  of 
fiducial  values,  mt  and  mH,  for  the  unknown  masses,  mt  and  mH,  which  appear 
in  the  SM  contributions  to  various  observables.  We  have  chosen  mt  =  150  GeV 
and  rhff  =  300  GeV  in  performing  the  fits,  but  it  is  natural  to  wonder  how  the  SM 
predictions  change  as  these  masses  vary. 

Perhaps  the  simplest,  and  most  useful,  application  of  the  techniques  described 
in  earlier  sections  is  to  approximately  determine  this  mt  and  mH  dependence.  To  do 
so,  imagine  both  the  top  quark  and  the  Higgs  boson  to  be  much  heavier  than  mz. 
In  this  case,  all  of  the  implications  for  lower-energy  observables  due  to  these  heavy 
particles  can  be  phrased  in  terms  of  the  effective  lagrangian  obtained  by  integrating 
them  out  of  the  SM.  It  can  therefore  be  mimicked  by  choosing  an  appropriate  m* 
and  mH  dependence  for  some  of  the  effective  couplings  of  section  3. 


Table  5.  Charged-Current  Parameters 

More  results  of  the  fits  of  the  new-physics  parameters  to  the  data.  The  quan- 
tities Li  and  £2  arise  in  tests  for  the  unitarity  of  the  CKM  matrix,  and  are 
defined  as:  Si  =  Re  («&£')  +  [Re (Vub)Re («fc£fc)  +  Im (Vub)lm (6huLb)}  /|V«,|  and 
E2  =  Re(6AcLd)  +  |Ve.]R*(fifci'  +  6~h%)/\Ve*\  +  \Veb\K*(6~h?)/\Ved\.  Blanks  indicate 
where  the  corresponding  fit  would  be  inappropriate,  such  as  for  when  a  parameter 
always  appears  in  a  particular  combination  with  others,  and  so  cannot  be  individ- 
ually fit. 


Parameter 

Ae 
A, 


Individual  Fit 

-0.0008  ±.0010 
+0.00047  ±.00056 

-0.018  ±0.008 
-0.00041  ±  .00072 
-0.00055  ±  .00066 

0  ±  0.0036 

-0.0018  ±.0032 

-0.00088  ±.00079 

0  ±  0.0008 

-0.09  ±.16 


-f  0.11  ±.98 

+0.022  ±.20 

+0.022  ±  .20 

+0.5  ±4.6 


Global  Fit 

-0.0011  ±.0041 
+0.0005  ±  .0039 
-0.018  ±.009 
+0.0001  ±  .0060 
+0.0003  ±.0073 
-0.0036  ±.0080 

+0.0007  ±.0016 
-0.0004  ±.0016 

+0.005  ±.027 


+0.11  ±0.98 
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Table  6.   Neutral-Current  Parameters 

Still  more  results  of  the  fits  of  the  new-physics  parameters  to  the  data.  As  before, 
blanks  indicate  where  the  corresponding  fit  would  be  inappropriate,  such  as  for 
when  a  parameter  always  appears  in  a  particular  combination  with  others,  and  so 
cannot  be  individually  fit. 


Parameter 


Individual  Fit 


Global  Fit 


69% 


Sgl" 


+0.0016  ±.0015 

+0.003  ±.012 

+0.0037  ±  .0038 

+0.007  ±.015 

-0.0003  ±.0018 

-0.002  ±0.014 

+0.0032  ±  .0032 

-0.003  ±.010 

-0.0009  ±.0017 

-0.003  ±.015 

-0.0052  ±  .00095 

+0.002  ±.085 

-0.0011  ±.0021 

+0.001  ±.018 

+0.0028  ±  .0047 

+0.009  ±.029 

-0.0005  ±.0016 

-0.0015  ±.0094 

+0.0019  ±  .0083 

0.013  ±.054 

-0.0048  ±.0052 

— 

-0.0021  ±  .0027 

+0.0023  ±.0097 

-0.0048  ±.0052 

— 

&~9 


-0.00029  ±.00043 
-0.00014  ±.00050 
+0.0040  ±.0051 
-0.0003  ±.0047 
-0.0021  ±  .0032 
-0.0034  ±.0028 


-0.004  ±.033 

-0.0001  ±  .0032 

+0.0001  ±  .0030 

+0.005  ±  .032 

+0.001  ±  .028 

0.000  ±.022 

-0.0015  ±.019 


Evaluating  the  one-loop  vacuum  polarization  graphs  [2,  3,  4,  5]  containing  top 
and  Higgs  loops  gives  the  following  large-mass  dependence  for  the  oblique  param- 
eters: 


h  +        In 

or          mt          DTT 


L  (?•)_,„(?« 

L     \mtj          \mH 


f,JJ            ~      _  _   In      — 1  1  (17} 

°VSM     — in  I  ^r^  I  \LI ) 

IT       \fnt  J 
Similarly,  the  Zbb  vertex-correction  graph  involving  a  virtual  top  quark  gives  [5, 19] 


These  are  the  only  virtual  contributions  which  can  both  (i)  depenu  strongly,  at  one 
loop,  on  m*,  and  (ft)  contribute  appreciably  to  electroweak  observables. 

4.2  Exotic  fermions 

One  application  to  which  the  above  formalism  has  been  applied  [8]  is  the  determina- 
tion of  the  constraints  on  the  masses  and  couplings  that  are  possible  for  hypothetical 
exotic  fermions  which  can  mix  with  the  ordinary  ones.  For  the  present  purposes  we 
take  'exotic'  to  mean  new  fermions  which  transform  under  the  SUL(2)  gauge  group 
as  either  left-handed  singlets  and/or  right-handed  doublets.  Because  of  their  mix- 
ing with  ordinary  fermions,  exotic  fermions  of  this  sort  can  change  the  couplings 
of  the  ordinary  fermions  to  the  W  and  Z.  Constraints  on  these  mixings  can  be 
inferred  from  present  precision  measurements  of  these  couplings. 

One  motivation  for  doing  this  analysis,  is  that  it  also  has  been  performed  [20] 
by  fitting  these  models  directly  to  the  data.  By  comparing  the  bounds  obtained 
here  with  those  of  the  direct  fit  to  the  model  we  can  learn  how  closely  they  agree 
with  one  another.  One  would  expect  the  present  approach  to  give  bounds  that  are 
marginally  weaker  than  those  of  a  direct  fit  to  the  model,  since  there  are  typically 
more  parameters  in  the  general  effective  lagrangian  than  there  are  couplings  in  a 
particular  underlying  theory.  In  this  sense  our  results  can  be  considered  the  most 
conservative  bounds  possible.  The  interesting  point  is  that  the  limits  we  obtain 
are  not  very  much  weaker  at  all,  and  so  very  little  information  is  lost  by  using  the 
simpler  effective-lagrangian  fit.  For  simplicity  of  presentation  we  restrict  ourselves 
here  to  charged  quarks,  although  neutrino  mixing  can  be  handled  in  much  the  same 
way  [8]. 

In  general,  mixing  between  the  known  fermions  and  exotic  ones  induces  flavour- 
changing  processes  into  the  neutral  current  interactions  of  ordinary  fermions 
(FCNC's).  For  simplicity  we  ignore  these  types  of  induced  couplings  here,  al- 
though they  can  be  treated  in  a  similar  manner.  FCNC's  are  avoided  if  every 
known  electroweak  multiplet  of  fermions  mixes  separately  with  its  own  exotic  part- 
ner, in  which  case  the  mixing  can  be  characterized  in  terms  of  a  mixing  angle,  0^R-v, 
*  =  e1p,r,«,cf,s, .... 

To  make  contact  with  our  general  formalism,  simply  integrate  out  all  of  the 
undiscovered  exotic  particles  to  produce  the  low-energy  effective  theory.  Sufficient 


ive  «»e  oj  precision  eieciroweaK  measurements 


accuracy  is  obtained  by  working  at  tree  level,  and  so  it  is  easy  to  integrate  out  the 
heavy  fermions:  one  transforms  to  a  basis  of  mass  eigenstates,  and  sets  all  heavy 
fields  equal  to  zero.  As  a  result  we  find  S=T=U=Q,  and  the  nonstandard  neutral 
current  and  charged-current  couplings  of  the  ordinary  fermions  become  modified 
by  [8]: 


(19) 
and 

-          i  1  r  .1  .  j     **, 

-  cu«  e  j rnJ  f)r\\ 

-  SR  SR  UH   .  (<M) 

in  which  [s\ (a ^ }     =  1  —  (c1. /_\)     =  sin2  #',„,,,  and  L/LJ  is  a  unitary  CKM-type 
\  L\.H)J  \  L\.")/  L\")  *  J  r 

matrix  for  the  right-handed  charged-current  couplings.  Similar  expressions  also 
hold  for  Icptons  [8]. 

It  is  now  a  simple  matter  to  bound  the  mixing  angles  using  these  expressions 
together  with  the  constraints  of  Tables  5  and  6.  We  find  the  following  limits  at 
90%  c.I.  (defined  as  1.64<r): 

Ae,/i  :         K)2,«e)2  <  0.016;         (s£)2  ,  (s£")2  <  0.012 

L'\I*/'\L/'  \         / 

<VL,R  :        «)2  <  0-01;     K)2  <  0-09;     (*«)2  <  0.03; 
(sj)2  <  0.05;      (s*  )2  <  0.05;      (.<?*) 2  <  0.03, 

A  comparison  of  the  above  numbers  with  those  in  the  literature  [20]  confirms  that 
they  are  very  similar  to,  but  marginally  weaker  than,  those  found  by  fitting  directly 
to  the  mixing  angles  themselves. 

4.3  Light  exotic  particles 

Another  application  that  has  been  examined  with  this  formalism  [12,  14,  21]  is  the 
case  of  new  exotic  particles  whose  quantum  numbers  preclude  their  mixing  with 
ordinary  fermions  [22].  Such  particles  arise  in  a  great  many  types  of  theories  for  new 
physics,  including  supersymmetric  models,  theories  with  additional  generations, 
and  technicolour  models.  Tn  these  examples  the  new  particles  need  not  be  much 
heavier  than  the  weak  scale,  and  so  we  do  not  make  this  assumption  here. 

Unlike  for  the  previous  example,  in  this  case  the  absence  of  tree-level  mixing 
implies  that  the  dominant  effects  of  the  new  physics  arise  through  its  one-loop 
contributions  to  well-measured  observables.  This  is  in  most  cases  dominated  by 
the  contributions  to  gauge  boson  vacuum  polarizations,  and  so  we  consider  here 
only  oblique  corrections. 

Formulae  for  the  one-loop  contributions  to  the  six  oblique  parameters,  S  —  X, 
by  scalars  and  fermions  in  general  representations  of  the  electroweak  gauge  group 
have  been  given  in  the  literature  [7,  23,  24].  For  example,  the  one-loop  vacuum 
polarization  due  to  afermion  with  left-  and  right-handed  coupling  constants,  &£(„), 
to  gauge  boson  'a'  (with  a  =  7,  W,  Z]  is: 


fab(q2,z)  In 


(22) 
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Here  m^x)  =  m2(l  -  x)  +  rnjx,  where  mt-  and  mj  are  the  masses  of  the  two 
fermions  which  appear  within  the  loop,  and 


" 


(*) 


*i*a      i    Kf,Kr        /17ljfn«\  /no\ 

+     L     R     ^      "    L       (-^J-)  (23) 


For  example,  a  standard-model  doublet  would  have  kl  =  Jb^  =  eQ,,  kzL  =  (e/swcw) 
[Tat  ~  Q.-4L  fcJS  =  (*/*»c«)[-Q.-4],  *r  =  e/v^*  and  ^  =  0.  /i2  denotes  the 
renormalization  point,  where  we  have  renormalized  using  dimensional  regularization 
and  MS. 

Expressions  such  as  these  permit  a  survey  of  the  couplings  and  masses  that 
are  permitted  for  exotic  particles  transforming  in  a  variety  of  electroweak  gauge 
representations  [12],  For  the  purposes  of  illustration,  Figure  3  displays  the  range  of 
values  for  5'  and  T'  that  are  produced  by  an  additional  generation  of  quarks  and 
leptons.  Notice  that  this  region  does  not  necessarily  include  the  origin  (5'  =  T'  =  0) 
in  the  limit  that  the  members  of  this  additional  generation  become  very  heavy.  This 
is  because  heavy  fermions  like  these  need  not  decouple  from  5'  and  T'  in  this  limit. 
This  makes  the  oblique  parameters  especially  sensitive  i  probes  for  these  types  of 
new  particles. 

4.4  Is  technicolour  dead? 

The  above  observation,  that  some  heavy  particles  need  not  decouple  from  oblique 
parameters  like  5*  and  T  as  their  masses  get  very  large,  has  been  used  to  argue  that 
technicolour-like  models  for  dynamical  electroweak  symmetry  breaking  are  ruled 
out  by  the  current  electroweak  data  [2,  3,  4,  5].  Various  estimates  for  the  oblique 
parameters  in  these  theories  predict  5"  £  O(+ 1),  and  the  two-parameter  5  -  T  fit 
to  the  Z-po\e  data  excludes  these  values  at  roughly  the  three-<r  level.  (A  similar 
conclusion  has  been  reached  by  considering  the  anomalous  Zbb  coupling  as  well 
[25].) 

This  conclusion  would  be  inescapable  provided  that  these  parameters  could 
be  reliably  computed  to  be  O(-fl)  in  technicolour  models.  Unfortunately,  since 
these  are  strongly  coupled  theories,  the  robustness  of  the  various  estimates  is  not 
clear  [26].  For  example,  a  potential  loophole  may  exist  when  light  particles  are 
present  in  the  technispectrum  —  such  as  often  happens  in  these  theories  due  to 
the  appearance  of  pseudogoldtone  bosons.  In  this  case  the  analysis  requires  the 
additional  parameters  V  through  X,  and  the  contributions  of  the  light  particles 
can  be  made  to  contribute  negatively  to  5"  and  T'  [24,  12],  for  some  choices  for 
their  masses  and  mixings. 

These  loopholes  are  difficult  to  completely  close  due  to  the  difficulty  in  comput- 
ing with  these  theories.  In  the  meantime,  it  must  be  conceded  that  the  case  against 
these  theories  remains  persuasive,  albeit  circumstantial. 

4.5  TGV's 

As  a  final  application  we  consider  how  the  oblique  analysis  of  section  2  can  be  used 
to  bound  [27]  the  potential  existence  of  nonstandard  self- couplings  for  the  elec- 
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Figure  3.  The  contribution  of  an  extra  SM  family  to  the  oblique  parameters  5"  and  T'. 
Solid  line:  a  colour-triplet,  Y  =  |  quark  doublet  with  masses  (mb/,mt/)  as  indicated  in 
brackets.  Dotted  line:  a  colour-singlet,  Y  =  |  lepton  doublet  with  masses  (m,/,m(>).  The 
grid  spacing  represents  steps  of  25  (resp.  30)  GeV  for  the  solid  (resp.  dotted)  plots. 
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The  starting  assumption  is  that  the  lightest  mass,  M,  of  any  undiscovered  par- 
ticle is  high  in  comparison  with  the  weak  scale,  mz.  Integrating  out  this  heavy 
physics  gives  an  effective  lagrangian,  denned  at  the  scale  M,  which  we  assume  to 
include  the  five  CP-conserving  TGV's,  whose  coefficients  are,  in  a  standard  [28] 
notation:  AK7,  AKZ,  A0i2,  Az,  and  AT.  The  goal  is  to  constrain  the  coefficients 
of  these  effective  interactions  using  precision  electroweak  measurements  at  lower 
energies,  E  ^  mz. 

In  order  to  do  so  we  imagine  running  the  effective  lagrangian  defined  at  the  scale 
M,  down  to  the  scale  mz  where  current  measurements  are  made.  Since  anomalous 
TGV's  are  not  directly  probed  at  these  energies,  the  bounds  come  from  the  other 
effective  interactions  which  are  generated  by  the  TGV's  during  the  running  down 
from  M  to  mz  .  We  therefore  compute  all  of  the  effective  interactions  at  the  weak 
scale  that  get  generated  (at  one  loop)  by  loops  containing  TGV's. 

There  are  two  kinds  of  effective  interactions  that  are  produced  by  TGV's  in  this 
way:  (i)  oblique  parameters,  and  (ii)  nonstandard  fermion  /gauge-boson  vertices. 
The  fermion/gauge-boson  vertices  themselves  come  in  two  types:  (a)  those  which 
are  universal,  in  that  they  are  independent  of  the  masses  of  the  SM  fermions,  and 
(6)  those  which  depend  strongly  on  the  top-quark  mass,  mt.  Interestingly,  we  find 
that  all  of  the  vertex  corrections  of  type  (a)  that  are  generated  by  TGV's  can  be 
rewritten  as  oblique  corrections  by  performing  a  suitable  field  redefinition  [27].  The 
entire  analysis  can  therefore  be  done  using  the  oblique  parameters,  together  with  a 
small  number  of  m£-dependent  terms. 

The  TGV-generated  oblique  parameters  that  are  produced  by  this  analysis  are 
displayed  in  Table  7,  where  the  numerical  values  a(mj)  =  1/128  and  si  =  0.23 
are  used.  We  also  take  M  —  1  TeV  and  the  effective  couplings  are  evaluated 
at  the  low-energy  scale  jz  =  100  GeV.  The  quantities  V  and  X  incorporate  the 
nonuniversal,  mt-dependent,  contributions,  and  are  to  be  used  instead  of  V  and  X 
in  the  expressions  for  any  observables  which  are  based  on  the  process  Z  —  »  bb. 

The  expressions  from  Table  7  can  then  be  used  in  Table  1,  for  the  electroweak 
observables  in  terms  of  the  oblique  parameters.  When  all  five  TGV  coefficients 
are  left  free  in  the  resulting  fit  to  the  data,  no  useful  bound  is  obtained,  beyond 
the  ever-present  ones  like  perturbative  unitarity.  This  shows  that  the  data  is  not 
yet  sufficiently  accurate  to  constrain  these  quantities.  A  common  procedure  in 
the  literature  is  to  instead  bound  each  TGV  separately,  with  all  of  the  others 
constrained  to  vanish.  This  type  of  analysis  gives,  in  our  case,  the  following  one-cr 
allowed  ranges  [27]: 


=  -0.033  ±0.031; 

A«7  =  0.056  ±  0.056;         AKZ  =  -0.0019  ±  0.044;  (24) 

A7  =  -0.036  ±  0.034;         Az  =  0.049  ±  0.045. 
Although  the  bounds  obtained  in  this  way  are  more  restrictive,  they  are  also  not 
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Table  7.  TGV  Contributions  to  Oblique  Parameters 

One-loop  results  for  the  induced  parameters  5,  T,  U,  V,  W  and  X,  defined  at 
H  =  100  GeV,  in  terms  of  the  various  TGV  couplings  defined  at  M  =  1  TeV.  As 
usual,  a(mz)  -  1/128  and  s£,  =  0.23. 


Parameter  One-Loop  Result 


S  2.63A0iz  -  2.98A/c7  +  2.38A«Z  +  5.97A7  -  4.50AZ 

T  -1.82A0H  +  0.550A«7  +  5.83A/cz 

U  2.42A0iz  -  0.908A«7  -  1.91A«Z  +  2.04A7  -  2.04AZ 

V  0.183A«:Z 

W  0.202A5iz 

X  -0.0213AK7-0.0611A/cz 

V  0.183A«Z  -  (3.68A5lz  +  0.797  AK*)(mJ/mi,) 

X  -0.0213A/c7  -  0.0611AK2  +  (0.423A5lz  +  0.0916AKz)(m2/m2w) 


realistic  for  any  underlying  theory,  for  which  all  couplings  would  be  expected  to  be 
generated  together. 

5.  Conclusions 


This  article  reviews  two  approaches  to  parameterizing  the  effects  of  new  physics 
for  precision  electroweak  measurements.  The  two  approaches  are  based  on  one  of 
the  following  two  assumptions.  Either:  (f)  the  new  physics  is  assumed  to  be  very 
heavy  in  comparison  with  the  weak  scale,  mz,  or:  (»)  it  is  not  assumed  to  be 
heavy,  but  it  is  assumed  to  dominantly  contribute  to  observables  through  oblique 
corrections.  These  two  approaches  contain  the  popular  formalisms  of  Peskin  et.ai, 
and  of  Altarelli  et.ai,  as  important  special  cases. 

The  advantage  of  parameterizing  the  data  in  this  way  is  the  efficiency  with 
which  it  permits  the  comparison  of  specific  models  to  the  data.  Rather  than  having 
to  perform  a  detailed  fit  to  the  data  of  every  proposed  theory,  it  is  possible  to  fit  the 
data  once  and  for  all  to  the  proposed  parameters.  To  constrain  any  particular  model 
it  is  then  simply  necessary  to  compute  these  parameters  in  terms  of  the  couplings 
of  the  underlying  theory.  A  conservative  estimate  of  the  allowed  range  for  these 
couplings  can  be  found  by  simply  using  the  appropriate  confidence  intervals  for  the 
basic  parameterization.  The  bounds  that  are  obtained  in  this  way  turn  out  to  be 
remarkably  similar  to  those  which  are  obtained  from  model-by-model  fits. 

A  number  of  phenomenological  analyses  have  been  performed  using  this  pro- 
cedure, some  of  which  have  been  briefly  summarized  here.  These  calculations  il- 
lustrate the  potential  applications  of  the  method,  and  the  simplicity  with  which  it 
may  be  carried  out. 
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Abstract. 

We  study  CP  violation  in  a  multi-Higgs  doublet  model  based  on  a  83  x  Z$  horizontal 
symmetry.  We  consider  two  mechanisms  for  CP  violation  in  this  model:  a)  CP 
violation  due  to  complex  Yukawa  couplings;  and  b)  CP  violation  due  to  scalar- 
pseudoscalar  mixings.  We  find  that  the  predictions  for  e'/e,  CP  violation  in  B 
decays  and  the  electric  dipole  moments  of  neutron  and  electron  are  different  between 
these  two  mechanisms.  These  predictions  are  also  dramatically  different  from  the 
minimal  Standard  Model  predictions. 

1.  Introduction 

The  origin  of  CP  violation,  is  one  of  the  outstanding  problems  of  particle  physics 
today.  So  far  CP  violation  has  only  been  observed  in  the  neutral  Kaon  system.  The 
observed  CP  violation  can  be  explained  in  many  models.  It  is  therefore  important 
to  study  other  CP  violating  experimental  observables  and  compare  the  results  with 
different  model  predictions.  Such  study  may  reveal  the  real  origin  of  CP  violation. 

In  the  minimal  SC7(3)c  x  SU(1}L  x  U(l)Y  Standard  Model  (MSM),  there  is  only 
one  Higgs  doublet.  When  the  Higgs  develops  valcuum  expectation  value  (VEV)  v, 
all  fermion  receive  masses.  In  the  mass  eigenstate  basis,  the  Higgs  coupling  to 
fermions  is  diagonal,  it  does  not  mediate  CP  violating  interaction.  However,  the 
coupling  of  the  charged  current  to  the  W-boson  becomes  complex.  It  is  given  by 


LC  =  -~UiVij7>>-djW+  +  H.C.  ,  (1) 

where  the  matrix  Vij  is  the  CKM  matrix  VJKA/[I]-  For  three  generations  of  quarks, 
there  is  a  non-removable  phase  in  the  matrix.  This  is  the  source  of  CP  violation  in 
the  MSM.  This  matrix  is  conveniently  parametrized  as,  following  Wolfenstein  [2] 

/         l-A2/2  A  A\3(p-irj)  \ 

VKM=\  -A  l-A2/2-M2A4r7  AA2  ,  (2) 

\  A\3(l-p-irj}  -A\2  1  / 

where  A  =  Vu,  =  0.221.  If  77  ^  0,  CP  is  violated.  Unitarity  constraints  on  the  matrix 
elements  provide  very  powerful  and  interesting  realtions.  The  most  interesting  one 
is  the  triangle  defined  by 


.  def5nes  the  triangle  shown  in  Figure  1  with  three  angles  or,  /?  and  7.  The 

f  Ih    triangle  is  given  by  42A6i//2.  CP  violation  in  the  neutral  Kaon  system 

h     e  lained  by  the  "box"  interaction[3j.  If  CP  violation  due  to  the  phase  ii 

PKM  matrix  is  the  only  source  for  CP  violation,  experiments  at  B  factories  w: 

able  to  determine  all  the  three  angles[4]. 


(AX3p,  AX3rt 


(0,0)  VusVcb  (A  X3 ,  0  ) 

Figure  1.  The  unitary  triangle 

Another  class  of  model  for  CP  violation  is  the  multi-Higgs  doublet  mod 
there  are  more  than  one  Higgs  doublets,  the  neutral  scalar  couplings  to  the  q 
are  not  necessarily  diagonal,  and  therefore  provide  new  sources  for  CP  violati 
CP  violation  can  arise  in  three  places  in  this  type  of  models:  1)  Non-trivial  phi 
the  VKM  mat"*;  2)Non-trivial  phases  in  the  Yukawa  couplings;  and  3)  Mixii 
scalar  and  pseudoscalar  Higgs  bosons.  In  cases  2)  and  3),  CP  violation  can  o 
at  the  tree  level  by  exchanging  neutral  Higgs  bosons.  In  this  talk  we  will  pr 
some  studies  of  CP  violation  in  multi-Higgs  models  with  flavour  changing  ne 
currents  at  the  trre  level  which  has  CP  violation  predominantly  throug  mechai 
2)  and  3). 

A  most  general  study  suffers  from  too  many  free  parameters.  To  have  a  de 
idea,  we  carry  out  the  study  in  a  S3  x  Z3  model  proposed  by  Ma[7].  This  mod< 
some  very  interesting  predictions  for  fermion  masses  and  their  mixings.  It  als 
interesting  predictions  for  CP  violation[8,  9,  10].  We  study  the  predictions  fc 
c'/e;  (ii)  CP  violation  in  the  neutral  B  system;  and  (iii)  the  neutron  and  ele 
electric  dipole  moments  (EDM).  We  compare  these  predictions  with  those  i 
MSM. 

2.  Yukawa  couplings  in  the  £3  x  Z3  model 

In  the  £3  x  Z3  model,  there  are  four  Higgs  doublets,  0i ,2,3,4.  The  quark 
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where  w  9^  1,  w3  =  1  is  the  £3  element.  The  Yukawa  couplings  consistent  with  the 
83  x  £3  symmetry  are  given  by 


—    f^qiL^R  +  qiL^n)  —  fs(q~3L<t>3dR  -f  £31,^45/0  —  hqzL^R  + 

where  <^,-  =  (<^f*,  ~4>~)T  .  Without  loss  of  generality  we  work  in  a  basis  where  all 
VEVs  are  real.  The  up-quark  mass  matrix  is  diagonal  :  Mu  =  Diag(f\vz,  f\v^,  /2fi), 
and  the  down-quark  mass  matrix  can  be  written  as,  with  a  suitable  choice  of  quark 
phases, 


Md=  la     0      b        ,  (7) 


with  a,  b,  c,  d  real  and  £  =  \£\e*ff  complex.  Md  can  be  diagonalized  by  a  bi-unitary 
transformation  Md  =  Vx,A/rfV^.  Here  Md  is  the  diagonalized  down  quark  mass 
matrix.  VL  and  VR  are  unitary  matrices.  Because  the  up  quark  mass  matrix  is 
already  diagonalized,  VL  is  the  CKM  matrix  VKM- 

It  is  convenient  to  work  in  a  basis  of  the  Higgs  bosons  in  which  the  Goldstone 
bosons  are  removed.  To  this  end  we  define  the  following[9] 


-Vi/V\2      V2V4/VUV124       — 

0  0 

0 


where  v^  =  v  ^  +  v|,  t/i24  =  ^  +  v|  -f  v|,  and  v2  =  vjf  -f  t;2  +  v|  -f  v|.  The 
transformation  is  the  same  for  both  the  neutral  and  charged  Higgs  bosons.  For  the 
neutral  Higgs  bosons,  G  =  ha  -f  iGz,  where  GZ  is  the  Goldstone  boson  'eaten'  by 
Z,  and  h°  is  a  physical  field  whose  couplings  are  the  same  as  the  Higgs  boson  in 
the  MSM.  For  the  charged  Higgs  bosons  G  is  the  Goldstone  boson  'eaten'  by  W. 
In  this  basis,  we  have 


„  Rf>h° 

Ly      =      -(£LMdD*-     ""'    "" 


+  H.C.,  (9) 

where  /i,  are  the  component  fields  of  Hi  with  //i  =  (/i/",  /if/>/2).  f/i,R  =  (U)CI^)L,R> 
and  DL,/?  =  (rf,  5, 6)J  R.  The  couplings  Yi  can  be  easily  expressed  in  term  of  quark 
masses,  V£,,H,  and  VEVs. 
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(--i  i     —  uu        —  ui        f(j, 

DxiliO       \  I  t3l 

Hehk         -         QkQ     Qki     p 

\Irnhl)  Uo    <     ftjy   U 

(10) 

where  ^,  /i  and  77,-  are  the  mass  eigenstates,  the  matrix  (a/?)  is  a  7  x  7  othogonal 
matrix,  and  (7)  is  a  3  x  3  unitary  matrix. 

The  specific  values  for  the  mixings  depend  on  the  details  of  the  Higgs  potential 
Unfortunately  they  are  not  determined.  To  simplify  the  problem,  we  will  discuss 
two  cases:  a)  CP  violation  only  comes  from  complex  Yukawa  couplings;  and  b)  CF 
violation  only  comes  from  the  mixings  of  real  and  imaginary  /*°[10j.  Case  a)  can  b« 
realised  by  constraining  certain  soft  symmetry  breaking  terms  in  the  potential[9] 
We  further  assume,  for  simplicity,  that  Retf-  are  the  mass  eigenstate  Ri  and  considei 
their  effects.  The  same  analysis  can  be  easily  carried  out  for  Imh®  in  the  same  way 
The  source  for  CP  violation  is  the  non-zero  value  for  a  which  is  a  free  parameter 
We  will  present  our  results  for  cr  =  80°,  which  is  close  to  the  maximum  of  th< 
allowed  phase.  Case  b)  can  be  realised  by  requiring  spontaneous  CP  violation.  Th< 
value  of  a  will  be  zero  and  CP  violation  arises  due  to  scalar-pesudoscalar  Higg 
boson  mixing.  For  illustration,  we  consider  the  effects  of  a  neutral  mixed  state 


R  =  cosOReh^  +  sin6Imh%  ,  (11 

and  for  the  charged  Higgs  boson  we  consider  mixing 

r?+  =  722/J J  +  723/4  ,  (12 

where  7^  are  complex  numbers,  and  (722 12  +  [723 12  =  1. 

The  parameters  a,  6,  c,  and  d  are  constrained  from  the  down  quark  masse 
and  the  CKM  mixings.  We  take  as  input  parameters  a  -  0.04<7eV,  b  -  0.25<7eV 
c  =  2.66GeV  ,  d  =  4GeV '.  The  mass  eigenvalues  for  the  down  quarks  are  quit 
insensitive  to  the  phase  <r.  For  both  cases,  we  have  mb  =  4.8GW,  ms  =  149Mel 
and  md  =  9.5MeV.  These  values  are  well  within  the  allowed  regions[ll].  The  CKI\ 
matrix  for  case  a)  is 

/  0.975  -0.222  0.00476        \ 

VKM  =          0.221  +  t0.0033  0.974  +  iO.014          0.043  -  iO.0015  I    ,  (1 

\-0.014+il.2x  10~5     -0.041  -  i6.8  x  10~4     0.998  -  ?0.034  / 

and  for  case  b) 

/  0.975        0.22        0.0048  \ 

VKM  =      -0.219     0.975     -0.0436  (1^ 

\-0.014     0.0415       0.999    / 

The  values  for  the  VEV's  are  not  fixed,  we  only  know  u3/u4  =  mu/mc.  We  wi 
use  the  values:  Vl  =  v2  =  44GeK,  v3  =  0.9  GeV  and  v4  =  238GeV  for  illustratioi 
We  shall  comment  on  effects  of  changing  these  values  later. 
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J.  cjonstraints  on  toe  riiggs  masses  trom  tne  neutral  K.  and  Jo  mesons 

The  63  x  ^3  model  has  very  restrictive  allowed  values  for  the  non-trivial  CP 
violating  phase  in  the  CKM  matrix.  The  CP  violating  measure  J[12]  is  less  than 
2.5  x  10~6  which  is  too  small  to  explain  CP  violation  in  the  neutral  Kaon  sysytem. 
Therefore  in  this  model  CP  violation  due  to  Higgs  boson  exchange  has  to  be  con- 
sidered. 

The  CP  violating  parameter  e  is  given  by 

,»T/4 

1     '      , 

where  M^2  is  the  matrix  element  which  mixes  K°  with  K°  ,  and  Am#  is  the  mass 
difference  between  rriKL  and  m/^.  Experimental  value  for  e  is  2.3  x  10~3elir/4.  The 
AS"  =  2  Hamiltonian,  responsible  for  M/2,  generated  by  exchanging  neutral  Higgs 
bosons  Ri  is  given  by 


(aki  -  ia'ki)Yk%)~}s.  (16) 


We  obtain 

M&    =     <K°\Heff\K°> 


ml 


Here  we  have  used  the  vaccum  saturation  and  factorization  approximation  results 
for  the  matrix  elements[13].  The  contribution  to  the  mass  difference  Am/f  is  given 
by  2/JeMi2-  Similar  formula  holds  for  the  neutral  B  system. 

To  constrain  the  Higgs  boson  masses,  we  require  that  the  neutral  Higgs  boson 
contributions  to  the  mass  differences  in  the  neutral  K  and  B  systems  to  be  less 
than  the  experimental  values:  Am^/m.^  =  7  x  10~15,  and  Amfi/mfj  =  8  x  10~14. 
We  find  that  for  case  a)  the  tightest  constraints  on  the  masses  of  Reh®  2  are  from 
the  mass  difference  AMjg  of  the  neutral  B  mesons  which  gives  Mh1  >  2.9TeV  and 
M/,3  >  3.1Te  V.  With  these  masses,  Reh^>2  can  n°t  produce  large  enough  €.  Similar 
consideration  yields  Mhs  >  3.5TeV,  and  we  find  the  experimental  value  of  e  can 
now  be  produced  if  the  mass  is  about  5.6TeV.  The  mass  difference  AM^  of  the 
neutral  K  mesons  gives  weaker  bounds  in  all  cases.  For  case  b),  the  experimental 
value  of  AA/#  constrains  A//?  >  3TeV.  From  the  experimental  value  off,  we  obtain 
=  1.1  x  10~8GeV~2  which  implies  MR  <  7TeV. 


4.  Predictions  for  ('  /€. 

In  this  section  we  study  the  direct  CP  violation  in  KL,S  —  *  2ir  decays.    CP 
violation  in  these  processes  is  characterized  by  the  value  of  c'  /c.  c'/c  is  defined  as 


-  _T     T*Viv«.     ftnr\r«1omont  TABII**.    1  ftQFi  77 


In  the  MSM,  the  contribution  to  e'/e  is  donminantly  from  the  gluon  penguin. 
However,  for  large  top  quark  mass  of  order  200  GeV,  the  electroweak  penguin 
also  contribute  significantly  and  may  even  cancel  the  gluon  penguin  contribution. 
There  are  large  uncertainties  from  hadronic  matrix  evaluation,  A^  dependce 

and  errors  in  the  CKM  matrix.  The  range  for  e'/c  is  predicted  to  be  between 
10-4  to  10-3  for  A^QCf>)  =  300  MeV[14].  This  is  consisten  with  the  experimental 
constraints  from  Fermilab,  (7.4  ±  6.0)  x  10~4  and  CERN,  (23  ±  6.5)  x  10~4[15]. 

In  the  53  x  Z3  model  there  are  several  contributions  to  c'/e.  Due  to  large  neutral 
Iliggs  masses,  the  neutral  Higgs  boson  contributions  to  e'/€  are  verv  sma^-  However 
there  may  be  large  contributions  from  the  charged  Higgs  bosons.  The  dominant 
contribution  is  from  the  charged  Higgs  boson  mediated  gluon  penguin.  The  relevant 
AS  =  1  effective  Larangian  is  given  by 

/2iA^G    ,  (19) 


where  ££„  is  the  gluon  field  strength,  Aa  are  the  SU(3)c  generators,  and 
~h     = 


Jit  ht 

(-/n 


where  C;  =  (a.(mfc)/<*«GO)14/23  w  0.17  is  the  QCD  correction  factor,  and  /  is 
summed  over  u,  c  and  t.  We  will  use  a3(n)  w  47T/6  for  p  =  IGeV.  The  above  effec- 
tive Lagrangian  will  generate  a  non-zero  value  for  7m>4o[16].  Z-AS=I  also  generates 
a  non-zero  value  1LD  for  CP  violation  in  K°  -  KQ  mixing  due  to  long  distance 
interactions  through  KQ  and  TT,  77,  rj'  mixings[17].  One  obtains[17,  18] 


«    -0.196D, 


2mKImM&iLD     «     0.8  x  lO'^A  +  h)(GeV3)  ,  (21) 

where  D  is  a  supression  factor  of  order  0(m^,mJ)/A2.  £/eLD  is  of  order  -0.014  to 
-0.1. 

We  find  that  in  both  a)  and  b)  cases,  the  donimant  contributions  are  from  the 
top  quark  in  the  loop  arising  from  mixing  in  the  charged  Higgs  boson  couplings. 
For  case  a),  we  have 

mt      ,     m? 


with  01  =  18,  a2  =  25  and  a3  =  -7. 
And  for  case  b),  we  have 


(23) 
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The  contributions  to  e  can  be  significant  in  both  cases  depending  on  the  Higgs  boson 
masses  and  the  CP  violating  parameter  /m^aTjia)-  We  W*N  study  constraints  on 
these  parameters  in  Sec.VI.  When  these  constraints  are  taken  into  account,  CLD  is 
generally  constrained  to  be  less  than  3  x  10~5  for  case  a)  and  e'/f  to  be  less  than 
3  x  10~5.  However,  for  case  b),  CLD  can  still  be  as  large  as  10~3  and  e'/e  can  be 

io-3. 

5.  CP  violation  in  the  neutral  B  system. 

There  are  many  processes  which  can  test  CP  violation  in  the  neutral  B  system. 
Some  particularly  interesting  ones  are[4] 


Bd  -+  J/i/iKs  ,  Bd  ->  *+*-  ,  B3  -+  pKs  .  (24) 

The  differences  of  time  variation  of  decay  rates  for  the  above  processes  and  their 
CP  tranformed  states  are  given  by 


a  JCP     = 

JL  \u  ^t;  — r  JCP;  T  JL  {jj  ^j  — ••  JCP; 

=  THAP ^~" '  ^ 

where  fcp  indicates  the  final  states.  A  is  defiaed  as 

(26) 


where  (q/p)e  =  v/M^*/Af]^,  A  and  yl  are  the  decay  amplitudes.  If  the  final  state 
contains  K$,  S  =  (q/p)/c  which  has  a  phase  of  order  10~3.  For  other  cases  S  is 
equal  to  one. 

Non-zero  asymmetry  a/cp  signals  CP  violation.  If  |A|  is  not  equal  to  one,  it 
indicates  that  CP  is  violated  in  the  decay  amplitudes.  In  the  MSM  |A|  is  equal  to 
one  to  a  very  good  approximation  for  the  above  three  processes.  The  asymmetries 
are  proportional  to  JmA.  In  the  MSM,  the  processes  in  Eq.(24)  measure  the  three 
angles  a,/3  and  7, 


Im\(Bd 

Im\(Bd  ->7r+7r-)  =  sm2c*,  (27) 


In  the  63  x  Z$  model,  the  situation  is  very  different  .  Although  the  CP  violating 
decay  amplitudes  A  and  A  are  small,  the  phase  of  \/M^2*  /Mfy  in  the  B~B  mixing 
due  to  neutral  Higgs  boson  exchange  can  be  large.  In  case  a),  there  is  CP  violation 
arising  from  the  phase  in  Yukawa  coupling  of  Higgs  bosons,  as  well  as  CKM  matrix, 
but  the  former  is  much  larger.  The  three  meaurements  in  Eq.(24)  do  not  measure 
the  angles  a,  /?  and  7  anymore.  The  first  two  processes  will  mostly  measure  the 
phases  in  M12*  .  We  have 

Im\(B<;  -+  T+7r~)     «     7mA(Bd  -+  J/*l>Ks)  <  0.42  ,   from  fle/ij, 

Im\(Bd  ->  ir+ir~)     »     Im\(Bd  -*  J/J>KS)  <  0.19  ,   from  Reh%  ,  (28) 

Im\(Bd  -»  JT+  TT-)     «     7mA(5d-4  7/^5)  w  0.19  ,    form  Reh%  . 


Im\(Bd  -*•  *+O  w  JTnA(B<,  -*  J/V>tfs)  *  -0.25  .  (29) 


7mA  for  B,  -*  pKs  is  different  for  a)  and  b).  For  case  a),  the  neutral  Higgs 
boson  contributions  to  the  asymmetry  are  small.  However  7mA (B,  — «•  pKs)  due 
to  CP  violation  in  the  KM-matrix  can  be  about  0.1.  For  case  b),  7mA(B,  — »•  pKs} 
from  neutral  Higgs  boson  exchange  is  only  about  0.02. 

If  interpreted  as  in  Eq.(27),  we  find  for  case  a),  sm2a  =  -sm2/?,  sinf  =  0.05, 
and  a  +  (3  -j-  7  ^  180°.  For  case  b),  we  have,  stn2a  =  -sm2/?,  sinf  =  0.01.  We 
again  find,  a  +  ft  +  7  /  180°. 

6.     The  neutron  and  electron  electric  dipole  moments. 

The  EDMs  of  neutron  and  electron  in  the  MSM  model  are  extremely  small.  The 
neutron  EDM  Dn  can  only  be  generated  at  three  loop  level.  It  is  predicted  to  be 
less  than  10~31  ecm[19].  The  electron  EDM  is  even  smaller  (<  10~36  ecm)[20]. 
The  experimental  upper  bound  on  the  neutron  EDM  is  1.2  x  10~25  ecm[21].  For 
the  electron  the  bound  is  about  10~26  ecm[22].  If  future  measurement  will  obtain 
an  EDM  larger  than  the  MSM  model  prediction,  it  will  be  an  indication  for  new 
physics  beyond  the  MSM. 

The  prediction  for  the  EDMs  in  the  S3  x  Z3  are  very  different  from  the  MSM. 
They  may  reach  the  experimental  bounds. 

At  the  one  loop  level,  the  neutral  Higgs  contributions  to  the  neutron  EDM 
are  small.  For  case  a)  we  find  that  Dn  <  2  x  10~28  ecm  For  case  b),  we  have 
Dn(d)  fa  2  x  10~29ecm.  The  u  quark  EDM  is  zero  at  the  one  loop  level. 

There  may  be  large  contributions  to  the  neutron  EDM  at  the  two  loop  level 
from  the  Weinberg  operator[23]  Dn(W)  and  from  the  color  dipole  moment  of  gluon 
due  to  Bar-Zee  type  of  diagrams[24,  25]  Dn(BZ).  In  our  model,  we  have 

Dn(W)     t* 

U17I-  7U-Q  Trt^o 

m         f*  *->•»  *       /  %-M  J-      \ 

(30) 


where  C,w  «  6  x  10~6,  and  £iz  «  10~2  are  the  QCD  correction  factors,  cu  =  2  and 
cj  =  4,  and  A  w  IGe  V  is  the  chiral  symmetry  breaking  scale.  The  parameters  ImZ 
are  given  by 

ImZ         =       - 


ImZ'd     =  M^H-^jy^^o^  +  ia',,,)).  (31) 


For  case  a),  because  there  is  no  CP  violation  in  the  up  quark  sector  only  down 
quark  loops  contribute,  Dn(W]  from  the  Weinberg  operator  at  the  two  loop  level 
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is  small.  There  are  non-zero  Dn(BZ)  from  d-quark  due  to  Bar-Zee  mechanism. 
We  find  that  the  contributions  from  Reh\%  is  also  small  (<  4  x  10~28ecm).  Reh% 
contribution  is  even  smaller. 

For  case  b),  the  two  loop  contributions  to  the  EDM  are  significantly  larger 
because  in  this  case  there  is  CP  violation  in  the  top  quark  interaction.  We  have 

Dn(BZ,u]  «  (2  -  8)  x  l(T26ecm  , 

Dn(BZ,  rf)  «  (2  ~  8)  x  l(T27ecm  ,  (32) 

for  mt  between  100  GeV  to  200  GeV.  The  contribution  from  the  Weinberg  operator 
is  small,  Dn(W)  <  W~30ecm. 

The  charged  Higgs  bosons  can  also  contribute  to  the  neutron  EDM.  At  the  one 
loop  level,  the  u  and  d  quark  EDM  are  given  by 


dd    =  '"MTy.'Tj^V^jifyAiCVn")!,].  (33) 

£^iTi       i  ft    _L  *t4- 

For  du,  /  is  summed  over  d,  s,  and  b;  and  for  dd,  I  is  summed  over  u,  c,  and  t.  At 
the  two  loop  level,  there  is  a  large  contribution  from  the  Weinberg  operator, 


where  £'w  =  3  x  10~4  is  the  QCD  correction  factor,  and 

lmZ(\  =  Im^i^VjKMYMVKMY^as]  .  (35) 


We  find  that  in  case  a)  the  dominant  contributions  are  from  the  two  loop  Wein- 
berg operator.  We  have 


Dn(W)     a    biXW-ln_ecm,  (36) 


where  61  =  1.6,  &2  =  1.4  and  63  =  1.2  x  10~6. 

Requiring  the  contributions  to  be  less  than  the  experimental  value,  we  find  the 
masses  of  h^2  have  to  be  larger  than  2.5TeV.  There  is  no  constraint  on  ^3"  mass. 
Combining  this  information  with  those  from  Eqs.(22)  and  (23),  we  find  the  charged 
Higgs  boson  contributions  to  CID  is  less  than  3  x  10~5,  and  (!  jc  is  less  than  3  x  10~5. 

For  case  b),  we  find  the  dominant  contribution  is  from  the  one  loop  d  quark 
EDM.  We  have 


Dn(d]  *  5.4  x  lO-19/m(T2272*3)'n  ecm  .  (37) 


Requiring  Dn(d)  to  be  less  than  the  experimental  value,  CLD  is  constrained  to  be 
less  than  10~3,  and  c'/f  can  still  be  of  order  10~3.   Assuming  maximum  mixing, 


UJL      UlC 


the  charged  right  handed  leptons  as  their  quark  partnenrs[7]  .  The  mass  matrix 
and  Yukawa  couplings  for  the  charged  leptons  are  similar  to  the  down  quarks.  One 
simply  changes  the  parameters  (a,  b,  c,  d  ,  and  £)  for  quarks  to  (a/  ,  &/  ,  c\  ,  di  ,  and 
6  =  !£|elV)  for  leptons.  We  use[8j:  a,  =  0.106GeV  ,6,  =  0  ,c,  =  1.781GeK  ,cfj  = 
8.6  x  10~3(7eV.  For  this  set  of  parameters,  we  have  me  =  0.51  IMeV,  m^  = 
106MeV  and  mT  =  1784MeV  which  are  in  good  agreement  with  experimental 
data. 

The  calculation  for  the  electron  EDM  is  similar  to  the  neutron  EDM.  For  case 
a)  we  find  that  the  one  loop  contributions  are  small  (<  1029ecm)  with  ff'  =  80°. 
However  the  two  loop  contribution  due  to  Bar-  Zee  mechanism  [24,  26]  can  be  as 
large  as  10~27  ecm,  for  mt  <  ZQQGeV.  For  case  b),  we  find  that  the  one  loop  and 
two  loop  contributions  are  small  (<  10~33ecm). 

7.  Conclusions 


We  have  studied  in  detail  some  effects  due  to  two  different  CP  violating  mecha- 
nisms in  the  S3  x  Zs  model.  Both  mechanisms  discussed  in  this  paper  can  explain 
the  observed  CP  violation  in  the  neutral  K  system.  CP  violation  in  the  neutral 
K  system  and  the  mass  difference  in  the  neutral  B  system  constrain  the  neutral 
Higgs  boson  masses  to  be  in  the  multi  TeV  region.  In  the  previous  discussions  we 
have  chosen  a  particular  set  of  parameters.  The  detailed  predictions  will  depend 
on  all  the  parameters,  but  the  general  features  will  remain  to  be  the  same.  We 
have  checked  the  predictions  using  another  set  of  parameters,  we  find  the  changes 
are  not  significant  except  that  the  electron  EDM  for  case  b)  can  reach  10~28  ecm. 
The  predictions  presented  here  represent  the  typical  values  for  the  observables.  We 
summarize  our  results  for  e'/€i  CP  violation  in  B  decays  and  the  neutron  and  elec- 
tron EDMs  in  the  table  below.  It  is  clear  from  the  table,  that  the  predictions  for  the 
observables  considered  are  very  different  in  the  MSM  and  in  multi-Higgs  doublet 
models.  Future  experiments  will  be  able  to  rule  out  some  models. 


Observable 

MSM 

Case  a) 

Case  b) 

€ 

Input 

Input 

Input 

c'/e 

10~4  ~  10~3 

-  io-5 

~ur3 

B  Decay 
Asymmetry 

a  +  /?  +  7  =  180° 

a  +  /?  +  7  ^  180° 
sin27  w  —  sin2/3  « 
0.2  ~  0.4 
sirry  <  0.1 

a-f  £  +  7^180° 
sfn27  w  —  sin2/?  fa 
0.25 
st'rry  «  0 

jDn(ecm) 

10~31  ~  1CT33 

can  reach  10~25 

can  reach  10~25 

De  (ecm) 

<  10~36 

<  io-1J7 

<  io-28 
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Abstract.  The  minimal  grand  unified  supergravity  model  is  discussed.  Requiring  ra- 
diative breaking  of  the  electroweak  gauge  symmetry,  the  unification  of  b  and  r  Yukawa 
couplings,  a  sufficiently  stable  nucleon,  and  not  too  large  a  relic  density  of  neutralinos  pro- 
duced in  the  Big  Bang  constrains  the  parameter  space  significantly.  In  particular,  the  soft 
breaking  parameter  m^  has  to  be  less  than  about  130  GeV,  and  the  top  quark  Yukawa 
coupling  has  to  be  near  its  (quasi)  fixed  point.  The  former  condition  implies  m-g  <  400 
GeV  and  hence  very  large  production  rates  for  gluino  pairs  at  the  LHC,  while  the  latter 
constraint  implies  that  the  lighter  stop  and  sbottom  eigenstates  are  significantly  lighter 
than  the  other  squarks,  leading  to  characteristic  signatures  for  gluino  pair  events. 


1.  Introduction 

Supersymmetry  (SUSY)  [1]  now  seems  to  be  the  most  popular  extension  of  the 
Standard  Model  (SM).  There  are  several  reasons  for  this.  First  of  all,  SUSY  solves 
the  (technical)  hierarchy  problem  [2]  (also  known  as  finetuningor  naturalness  prob- 
lem), i.e.  stabilizes  the  weak  scale  against  radiative  corrections  that  otherwise  tend 
to  pull  it  up  towards  the  GUT  or  Planck  scale.  This  is  also  true  for  SUSY's  main 
competitor,  technicolor  (TC)  [3],  although  through  a  completely  different  mecha- 
nism. However,  it  seems  increasingly  difficult  to  find  realisations  of  the  TC  idea 
that  are  not  ruled  out,  or  at  least  strongly  disfavoured,  by  LEP  measurements,  in 
particular  of  the  so-called  5  parameter  [4]  and  of  the  Z  — >•  66  partial  width  [5].  In 
contrast,  sparticles  decouple  quickly,  i.e.  do  not  affect  predictions  for  LEP  observ- 
ables  noticably  if  sparticle  masses  exceed  100  GeV  or  so,  so  that  LEP  measurements 
can  only  rule  out  SUSY  if  they  also  exclude  the  SM  (with  a  light  Higgs  boson)  at  the 
same  time.  Moreover,  if  sparticles  are  light,  agreement  between  LEP  measurements 
and  predictions  is  often  (slightly)  improved  compared  to  the  non-supersymmetric 
SM  [6]. 

Another  strong  motivation  for  SUSY  is  that  the  minimal  supersymmetric  stan- 
dard model  (MSSM)  allows  for  a  predictive  grand  unification  of  all  gauge  inter- 
actions, in  contrast  to  the  SM  [7],  [8].  Actually  GUTs  were  in  better  agreement 
with  data  [9]  with  than  without  SUSY  already  before  LEP  was  turned  on,  but  the 
higher  precision  achieved  by  LEP  experiments  has  made  this  argument  much  more 
convincing. 

All  t.bpsft  aremmeni-.s  are  indeoendent  of  the  wav  SUSY  is  broken,  provided  only 
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group,  517(5).  Moreover,  only  the  minimal  necessary  number  of  fields  will  be 
assumed  to  exist  at  the  GUT  scale  as  well  as  at  lower  energies.  As  discussed  in  more 
detail  in  sec.  2,  this  leads  to  a  fairly  predictive  (constrained)  scenario  [17],  although 
some  freedom  in  the  choice  of  parameters,  and  of  the  resulting  phenomenology,  still 
exists.  It  should  be  emphasized  that  it  is  not  at  all  trivial  that  this  simplest  of  all 
SUSY  GUTs  is  still  experimentally  viable  [18]. 

The  remainder  of  this  contribution  is  organized  as  follows.  Sec.  2  contains 
a  description  of  the  model  as  well  as  a  discussion  of  the  constraints  that  have 
been  imposed.  In  sec.  3  the  resulting  (s)particle  spectrum  will  be  discussed  in 
some  detail;  this  section  contains  the  only  material  that  cannot  be  found  in  our 
publication  [19].  In  sec.  4  signals  for  sparticle  production  at  the  LHC  are  treated 
for  a  few  characteristic  spectra;  the  emphasis  will  be  on  methods  to  distinguish 
these  spectra  from  each  other  as  well  as  from  the  kind  of  spectrum  that  has  been 
studied  previously.  Finally,  sec.  5  is  devoted  to  summary  and  conclusions. 

2.  The  model 

As  explained  in  the  Introduction,  I  will  assume  a  simple  form  for  the  sparticle 
and  Higgs  spectrum  at  very  high  energy  scales.  In  fact,  one  ultimately  hopes  to 
describe  all  of  SUSY  breaking  by  a  single  parameter,  the  equivalent  of  the  elec- 
troweak  symmetry  breaking  scale  (\/2GV  )-1/2  =  246  GeV.  At  present  we  are  still 
far  from  this  ambitious  goal,  so  it  is  prudent  to  allow  at  least  a  few  free  parameters 
to  describe  the  spectrum.  Here  I  will  follow  the  standard  assumptions  [14]  and 
introduce  four  independent  SUSY  breaking  parameters:  mg,  which  contributes  to 
the  squared  masses  of  all  scalar  bosons;  a  common  gaugino  mass  mi/2>  ar*d  com- 
mon nonsupersymmetric  trilinear  and  bilinear  scalar  interaction  strengths  A  and 
B,  respectively  [20].  In  addition  one  has  to  introduce  a  supersymmetric  Higgs(ino) 
mass  p.  in  order  to  avoid  the  existence  of  a  (visible)  axion  and  to  give  masses  to 
both  up-  and  down-type  quarks.  Altogether  the  masses  of  the  two  5t/(2)  doublets 
of  Higgs  bosons  needed  in  any  realistic  SUSY  model  are  then 

ml(Mx]  =  ml(Mx)  =  ml  +  f(Mx\  (I) 

One  can  show  quite  easily  that  spontaneous  SU(2)  x  t/(l)y  breaking  is  not 
possible  as  long  as  m2H  —  m^.  Fortunately  this  degeneracy  is  lifted  by  radiative 
corrections.  The  reason  is  that  H  only  has  Yukawa  couplings  to  up-type  quarks, 
while  H  only  couples  to  down-type  quarks  and  leptons.  Of  course,  the  t  quark 
belongs  to  the  former  category,  and  has  by  far  the  largest  Yukawa  coupling  of  all 
SM  fermions  (unless  tan/?  >•  1;  see  below).  This  implies  that  radiative  corrections 
from  Yukawa  interactions  will  be  much  larger  for  m^  than  for  m^.  The  crucial 
point  is  that  these  corrections  reduce  the  (running)  squared  mass  when  going  to 
lower  energy  scales,  eventually  leading  to  nonzero  vevs  for  H  and  H.  As  already 
emphasized  in  the  Introduction,  this  mechanism  will  only  work  if  logM^/A/z  ^  1 
and  the  top  Yukawa  coupling  ht  is  large  [21]. 

The  running  of  the  soft  SUSY  breaking  parameters  as  well  the  of  the  gauge 
and  Yukawa  couplings  is  described  by  a  set  of  renormalization  group  equations 
(RGE)  [22].  As  a  consequence  of  the  assumption  of  minimal  particle  content  the 
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from  the  input  parameters  at  MX-  Of  course,  at  the  weak  scale  certain  equalities 
have  to  be  satisfied.  First  of  all,  we  know  that 

2  +  {^°)2=M|.  (2) 


It  is  often  convenient  to  introduce  the  weak  scale  parameter  tan/?  =  (HQ}/(H°}. 
With  the  help  of  this  parameter,  eq.(2)  can  be  solved  for  ^  at  the  weak  scale 

Qo:[23] 

-M|,  (3) 


, 
tan  p  —  1  2 

where  m2  ,  m2  are  the  nonsupersymmetric  contributions  to  the  squared  Higgs  masses, 
i.e.  m^  =  m\  +  /*2,  m^  =  m2  +  /i2.  For  heavy  top,  m2(Q0)  is  negative,  giving  a 
positive  (and  usually  quite  large)  contribution  to  /A  A  similar  equation  determines 
B  •  /i  at  scale  Q0  in  terms  of  m2,  m2  and  tan/?. 

Having  fixed  MZ,  we  are  left  with  the  parameters  mo,mi/2  and  A  (at  the  GUT 
scale)  as  well  as  tan/?  (at  the  weak  scale).  In  addition  the  mass  mf  of  the  top 
quark  is  an  important  parameter,  since  the  top  Yukawa  coupling  plays  a  vital  role 
in  radiative  gauge  symmetry  breaking.  At  this  point  the  assumption  of  a  minimal 
5f/(5)  GUT  helps  to  further  reduce  the  number  of  free  parameters.  The  reason  is 
that  minimal  5(7(5)  implies  the  equality  of  b  and  r  Yukawa  couplings  at  scale  MX  • 
As  has  been  shown  by  several  groups  [26],  this  can  only  be  brought  into  agreement 
with  the  experimentally  measured  ratio  mb/mr  if  either  ht  is  close  to  its  upper 
bound  or  if  hi  ~  /it,  which  implies  tan/?  ~  mj/mj,  [27].  The  second  choice  not  only 
necessitates  some  finetuning  in  this  minimal  scenario  [28],  it  also  makes  it  more 
difficult  to  satisfy  proton  decay  constraints  (see  below).  I  therefore  only  consider 
the  first  solution  here.  Within  the  precision  of  a  1-loop  calculation  it  can  simply 
be  implemented  by  taking 

ht(Mx)  =  2.  (4) 

This  ensures  that  at  low  energies  ht  is  very  close  to  its  infrared  quasi  fixed  point 
[26],  which  implies 

m((Tnt)  ~  190  GeV  •  sin/?,  (5) 

where  mt(mt)  is  the  running  (MS)  top  mass  at  scale  mt;  the  on-shell  (physical) 
top  mass  is  about  5%  larger.  Eq.(5)  reduces  the  number  of  free  parameters  from 
five  to  four  in  this  scenario.  Assuming  m((pole)<  185  GeV,  as  indicated  by  LEP 
data  [29]  if  the  Higgs  is  light,  then  implies 

tan/?  <  2.5.  (6) 

In  addition  to  the  relations  discussed  so  far  a  number  of  conditions  has  to  be 
satisfied.  These  can  all  be  expressed  in  terms  of  inequalities,  i.e.  they  define  allowed 
ranges  of  parameter  values  rather  than  determining  them  uniquely.  One  important 
constraint  emerges  from  the  requirement  that  nucleons  should  be  sufficiently  long- 
lived.  In  minimal  SUSY  S£/(5)  the  main  contribution  to  nucleon  decay  comes 
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from  the  exchange  of  the  fermionic  superpartners  of  the  517(5)  partners  of  the 
elw.  Higgs  bosons,  i.e.  from  higgsino  triplet  exchange  [30].  The  reason  is  that  the 
corresponding  diagrams  are  only  suppressed  by  one  power  of  a  mass  O(Mx),  as 
compared  to  two  such  powers  for  5/7(5)  gauge  boson  exchange.  However,  while 
higgsino  exchange  suffices  to  violate  both  baryon  and  lepton  number,  it  leads  to 
two  sfermions,  rather  than  two  fermions,  in  the  final  state.  These  sfermions  have 
to  be  transformed  into  a  lepton  and  an  anti-quark  by  gaugino  (mostly  chargino) 
exchange,  i.e.  the  decay  only  occurs  at  1-loop  level.  The  matrix  element  therefore 
contains  a  so-called  dressing  loop  function,  which  scales  like  mi/2/mo  ^or  mo  ^ 
Altogether  one  thus  has  [30] 

(7) 


where  m^3  is  the  mass  of  the  Higgsino  triplet,  and  the  factor  tan/?  appears  since 
Yukawa  couplings  to  d  and  s  quarks  grow  oc  tan/?.  The  experimental  lower  bound 
on  the  proton  lifetime  rp  gives  an  upper  bound  on  the  matrix  element  (7);  assuming 
conservatively  that  m^3  could  be  as  much  as  ten  times  larger  than  .he  scale  MX 
where  the  gauge  couplings  meet,  and  taking  into  account  that  we  are  only  interested 
in  rather  small  values  of  tan/?,  eq.(6),  a  conservative  interpretation  of  the  constraint 
imposed  by  the  bound  on  rp  is8  [30]  [31] 

m0  >  min(300  GeV,  3m1/2).  (8) 

Another  important  constraint  can  be  derived  from  the  requirement  that  relic 
LSPs  produced  in  the  Big  Bang  do  not  overdose  the  universe,  in  which  case  it  would 
never  have  reached  its  present  age  of  at  least  1010  years.  In  minimal  SUGRA  the 
lightest  supersymmetric  particle  (LSP)  is  always  the  lightest  of  the  four  neutralino 
states.  Moreover,  the  large  value  of  ht,  eq.(4),  and  small  tan/?  (6)  imply  that 
|^(<2o)|  is  quite  large,  see  eq.(3).  The  LSP  is  therefore  always  a  gaugino  (mostly 
bino)  with  small  higgsino  component.  The  LSP  relic  density  is  essentially  inversely 
proportional  to  its  annihilation  cross  section,  summed  over  all  accessible  channels. 
Gaugino-like  LSPs  mostly  annihilate  into  //  final  states  [32],  where  /  stands  for 
any  SM  fermion  with  mass  below  that  of  the  LSP.  This  final  state  is  accessible 
via  /  exchange  in  the  t  or  u  channel,  as  well  as  via  the  exchange  of  the  Z  boson 
or  one  of  the  neutral  Higgs  bosons  in  the  s  channel.  However,  the  constraint  (8) 
implies  that  the  /  exchange  contribution  is  strongly  suppressed,  due  to  the  large 
sferrnion  masses.  (Recall  that  all  sfermions  get  a  contribution  -frr^  to  their  squared 
masses  at  scale  MX-)  Moreover,  since  both  mo  and  |/jj  are  large,  most  Higgs  bosons 
are  very  heavy.  Finally,  the  LSP-LSP-Z  coupling  needs  two  factors  of  the  small 
higgsino  component  of  the  LSP,  while  the  LSP-LSP-Higgs  couplings  only  need  one 
such  factor.  Therefore  the  only  potentially  large  contribution  to  LSP  annihilation 
comes  from  the  exchange  of  the  light  neutral  Higgs  boson  k°.  Fortunately  it  has 
recently  been  shown  [33]  that  this  contribution  suffices  to  reduce  the  LSP  relic 
density  to  acceptable  values  for  a  substantial  range  of  LSP  masses,  provided  it  is 
below  mh/2.  The  upper  bound  (6)  on  tan/?  implies  that  m>,  <  110  GeV  even  after 
radiative  corrections  [34,  25]  are  included,  while  Higgs  searches  at  LEP  imply  [35] 
m/,  >  63  GeV.  (The  couplings  of  h°  are  very  similar  to  that  of  the  SM  Higgs  boson 


this  implies 

60  GeV  <  m1/2  <  130  GeV.  (9) 

It  should  be  emphasized  that  such  a  strong  upper  bound  on  mi/2  onty  holds  in 
this  specific  scenario.  If  we  give  up  on  the  unification  of  b  and  r  Yukawa  couplings, 
we  can  allow  smaller  ht  and/or  larger  tan/?,  leading  to  smaller  values  of  |/i($o)|> 
and  hence  a  larger  higgsino  component  of  the  LSP  and  stronger  annihilation  mio 
final  states  containing  Higgs  and /or  gauge  bosons.  If  we  allow  for  a  sufficiently 
non-minimal  GUT  Higgs  sector  the  bound  on  the  proton  lifetime  can  be  satisfied 
even  if  the  constraint  (8)  is  violated,  allowing  for  efficient  LSP  annihilation  through 
/  exchange.  Finally,  if  we  allow  new,  R— parity  violating  interactions,  relic  LSPs 
could  have  decayed  a  long  time  ago,  and  no  constraint  on  LSP  annihilation  could 
be  given. 

Eqs.(4),  (6),  (8)  and  (9)  describe  the  basic  allowed  parameter  space  of  the  model, 
except  for  the  A  parameter.  The  bounds  on  this  parameter  are  intimately  linked 
to  to  the  details  of  the  (s)particle  spectrum,  which  is  the  topic  of  the  next  section. 

3.  The  spectrum 

In  this  section  I  discuss  the  sparticle  and  Higgs  spectrum  of  the  model  defined  in 
the  previous  section,  with  emphasis  on  features  that  are  relevant  for  "new  physics" 
searches  at  colliders;  see  also  refs.[36,  37]  for  recent  discussions  of  the  spectrum  in 
mSUGRA  models  with  top  Yukawa  coupling  close  to  its  fixed  point. 

A  general  feature  of  all  mSUGRA  models  is  that  the  Higgs(ino)  mass  parameter 
p,  is  not  an  independent  variable,  but  can  be  computed  from  the  SUSY  breaking 
parameters,  mt  and  tan/?  (as  well  as  Mz,  of  course),  as  described  by  eq.(3).  In 
general,  the  r.h.s.  of  that  equation  is  a  complicated  function  of  all  input  parameters, 
which  has  to  be  computed  by  solving  the  relevant  RGE  [22]  numerically.  However, 
the  following  analytical  expression  are  often  sufficient  for  practical  purposes: 

mi(Qo)  ^  mo  -f-  0.5mJ/2;  (10a) 

m^Qo)  -  fnl(Qo) r-y— ,    where  (10b) 


-f 


1  _  '  mt 


190  GeV  sin/? 


Eqs.(lO)  reproduce  the  exact  numerical  results  to  10%  or  better  if  Q0  is  around  350 
GeV  (which  corresponds  [25]  to  squark  masses  around  600  GeV),  and  if  hb  <  1, 
which  is  always  true  here  due  to  the  upper  bound  (6)  on  tan/?.  Note  that  mt  in 
eqs.(10)  is  the  running  top  mass  mt(mt).  The  expression  in  square  brackets  in 
eq.(lOc)  will  therefore  always  be  small  if  ht  is  close  to  its  fixed  point,  see  eq.(5). 
The  resulting  very  weak  dependence  of  X2  on  A  has  also  been  observed  in  ref  [361 
[38].  J 
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certainly  true  in  the  given  scenario.  Specifically,  if  ht  is  close  to  the  fixed  point  one 

has 

2/^  x         21  +  0.44  tan2/? 
2 


where  I  have  used  (6)  in  the  second  inequality;  the  lower  bound  (8)  on  mo  then 
implies  that  the  LSP  is  indeed  always  a  very  pure  gaugino,  as  stated  in  the  previous 
section.  It  also  means  that  the  two  heavier  neutralinos  and  the  heavier  chargino, 
whose  masses  are  all  very  close  to  \p(Qo)\,  will  be  very  difficult  to  detect:  Their 
production  cross  section  at  the  LHC  is  much  too  small  to  yield  a  viable  signal,  while 
they  are  too  heavy  to  be  produced  at  all  at  a  linear  e+e~  collider  with  ^/s  ~  500 
GeV,  which  is  likely  to  be  the  next  high-energy  e+e~  collider. 
Another  useful  identity  is  [25] 


sin2/?        ' 
where 

m?  ~  ml  +  0.5m?/2  +  Q.5MJ  cos2/3  (13) 

is  the  squared  sneutrino  mass  and  mp  is  the  mass  of  the  pseudoscalar  Higgs  boson, 
which  is  almost  degenerate  with  the  charged  and  heavy  neutral  scalar  Higgs  bosons 
if  Mp  >>  M|  [39].  Note  that  eq.(12)  is  exact,  unlike  eqs.(10),  up  to  corrections  of 
order  h\.  In  our  fixed  point  scenario  eq.(12)  and  the  bounds  (8)  and  (11)  imply 
that  mp  can  easily  exceed  1  TeV  even  for  quite  modest  values  of  mo;  such  heavy 
SUSY  Higgses  are  very  difficult  to  detect  (or  even  produce)  experimentally.  On  the 
other  hand,  this  also  implies  that  the  couplings  of  the  light  neutral  scalar  Higgs 
boson  /i°  to  SM  fermions  and  gauge  bosons  are  practically  identical  to  those  of  the 
SM  Higgs  [39],  so  that  h°  will  be  produced  copiously  at  the  LHC,  at  future  e+e~ 
linacs,  and  perhaps  even  at  the  second  stage  of  LEP  [40] 

Since  we  have  m^  ^>  mi/2  m  our  model,  all  sleptons  have  very  similar  masses; 
eqs.(13)  and  (8)  then  imply  that  they  are  most  likely  too  heavy  to  be  detectable  at 
the  LHC  [41],  while  they  cannot  be  produced  at  all  at  a  500  GeV  e+e~  collider. 

Clearly  the  best  chance  for  a  decisive  test  of  the  model  is  given  by  the  upper 
bound  (9)  on  m^-  Since  |/^(Qo)|  is  so  large  the  masses  of  the  lighter  chargino 
and  the  next-to-lightest  neutralino  lie  within  a  few  GeV  of  the  low-energy  SU(2) 
gaugino  mass  MI  c±  0.82mj/2-  In  particular,  the  lighter  chargino  always  lies  below 
105  GeV  [17],  offering  a  good  chance  for  its  discovery  at  LEP,  especially  if  the 
machine  energy  can  be  boosted  beyond  the  currently  foreseen  value  of  176  GeV. 

The  situation  is  slightly  more  complicated  for  the  gluino.  The  bound  (9)  implies 
a  rather  low  upper  bound  for  the  running  (MS  or  DR)  gluino  mass,  rn-g(m-g)  ~ 
2.5m1/2  <  330  GeV.  However,  it  has  recently  been  pointed  out  [42]  that  the  on-shell 
gluino  mass  can  be  substantially  larger  than  this,  especially  if  squarks  are  heavy: 


~  m-g(m-g) 


(14) 
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where  I  have  only  included  the  leading  logarithmic  correction  from  squark-quark 
loops.  Requiring  somewhat  arbitrarily 

m0  <  1  TeV  (15) 

in  order  to  avoid  excessive  finetuning  in  eq.(3),  we  see  that  the  "threshold  correc- 
tion" (14)  can  change  the  gluino  mass  by  about  30%  if  m^  is  close  to  its  lower 
bound  and  mo  is  at  its  upper  bound.  This  can  change  both  the  cross  section  and 
the  signal  for  gluino  pair  production  quite  significantly.  Including  the  correction 
(14),  the  bound  (8)  implies  m$(pole)  <  400  GeV  if  (15)  is  imposed. 

The  lower  bound  (8)  on  mo  also  implies  that  the  squarks  of  the  first  two  gen- 
erations are  significantly  heavier  than  the  gluinos;  they  will  thus  dominantly  decay 
into  g  -\-  q,  although  the  left-handed  (577(2)  doublet)  squarks  also  have  (9(10%) 
branching  ratios  into  an  elw.  gaugino  and  a  quark.  The  masses  of  the  first  two 
generations  of  squarks  lie  between  about  330  GeV  and  a  little  above  1  TeV  in  this 
model,  where  the  upper  bound  merely  reflects  the  somewhat  arbitrary  constraint 
(15). 

So  far  the  spectrum  resembles  a  special  case  of  the  type  of  models  whose  sig- 
natures were  discussed  in  the  existing  literature  [1,  13],  with  heavy  squarks  and 
sleptons,  large  \(j,\  and  mp,  but  rather  light  gluino  and  elw.  gauginos.  This  is  not 
surprising,  since  in  these  ealiers  studies  p  and  mp  were  considered  to  be  free  pa- 
rameters; they  could  thus  be  chosen  to  be  large,  whereas  the  present  model  requires 
them  to  be  large.  However,  as  already  remarked  in  the  Introduction,  in  these  pa- 
pers all  squarks  were  also  assumed  to  have  the  same  mass  at  the  weak  scale.  It  is 
here  that  the  present  model  makes  specific  predictions  which  cannot  be  mimicked 
by  these  earlier  treatments,  in  spite  of  the  larger  number  of  free  parameters. 

The  reason  is  that  the  same  kind  of  radiative  corrections  that  reduce  the  Higgs 
mass  parameter  m2  to  negative  values  also  reduce  the  masses  of  the  stop  and  left- 
handed  sbottom  squarks,  as  compared  to  the  masses  of  first  generation  squarks. 
Specifically,  one  finds 

m?    =  m2    -m?--^2     ;  (16a) 

bl-          tL          q      3sin2/?  V 


(16b) 

where 

m|~m2  4- 6*7*1/2  (17) 

is  a  typical  first  generation  squark  mass  (at  scale  Q  =  m^),  and  Xi  has  been  given 
in  eq.(lOc). 

Since  for  the  small  values  of  tan/?  of  interest  here  IL  —  bji  mixing  is  almost 
negligible,  b^  is  to  good  approximation  the  mass  eigenstate  61 ,  with  mass  given 
by  eq.(16a).  The  ratio  m^  /mzL  (including  small  mixing  effects  and  D— terms)  is 
shown  in  fig.  1  as  a  function  of  AQ  =  A/mo,  for  various  values  of  m0  and  mi/2 
and  /j  <  0  [44].  We  observe  that  the  dependence  on  A  is  indeed  quite  weak  here, 
as  claimed  earlier.  The  allowed  range  for  AQ  is  determined  by  various  constraints: 
The  squared  mass  of  the  lightest  stop  eigenstate  (see  below)  must  be  larger  than 
-f(45  GeV)2;  the  scalar  potential  at  the  weak  scale  should  not  have  minima  [45] 


in  the  directions  (TR)  =  (fL)  =  (H°)  or  (fR}  =  (tL)  =  (H°)  that  are  deeper  than 
the  desired  minimum  where  only  (H°)  and  (H°)  are  nonzero;  and  the  potential 
must  be  bounded  from  below  at  the  GUT  scale,  which  requires  ml  +  H2(MX)  > 
1\B(MX}»(MX}\. 


iri|(m|)=  166  GeV,/A<0 

i 

i   i   j   i   i   i    i   j   i   i    i   i   ]  i   i 

mo  =  0.4  TeV,  m|/2  -  1  3O  GeV 

1 

0.80 

h— 

--'''     ~                           ^"^, 

•""" 

-J 

0.78 

— 

m0  =0.3  Te^rn^SO  GeV 

- 

e3 

0.76 

L- 

^^               ^^-\ 

- 

?JQ~ 

E 

0.74 

Lr 

X^ 

_ 

- 

mo  =1  TeV  m|/2=6OGeV 

- 

0.72 

r\  "7f*\ 

- 

I    i    i    1    i    i    i    i    1    i    i    i    i    [    i    i 

- 

U./O 

4 

-202 

A 

Ao 

Figure  1.  The  ratio  rn-bi/m^L  as  a  function  of  the  GUT  scale  parameter  A0  =  A/m0. 

The  ratio  rn^/m^  does  depend  somewhat  on  the  ratio  m^/mo,  however,  be- 
ing maximal  where  m^/mo  is  minimal  (and  vice  versa).  We  see  that  in  our  fixed- 
point  scenario  m^  is  reduced  by  typically  20  to  30%  compared  to  first  generation 
squark  masses.  This  can  be  quite  significant,  since  partial  widths  for  three-body 
decays  of  gluinos  or  elw.  gauginos  that  involve  squark  exchange  scale  approximately 
like  the  inverse  fourth  power  of  the  squark  mass.  A  reduction  of  the  squark  mass 
by  20%  (30%)  therefore  leads  to  an  increase  of  the  corresponding  partial  width  or 
branching  ratio  by  a  factor  of  2.1  (2.9).  This  leads  to  6-rich  final  states,  as  will  be 
discussed  in  more  detail  in  sec.  4. 

Eq.(16b)  implies  that  the  mass  of  the  IR  current  state  is  reduced  even  more 
than  m-bL .  Moreover,  <£,  -  tR  mixing  is  not  negligible;  it  reduces  the  mass  m^  of 
the  lighter  t  mass  eigenstate  even  further.  In  the  convention  of  ref.[43]  the  i  mass 
matrix  is  given  by  [46]: 


2  _       ™1L  +"**+  0.35M|  cos2/? 
J    *~  -mt(At  +  n  cot/?) 


-mt(At  -I-  p  cot/?) 
+m}+  0.16M|  cos2/? 


(18) 


At  scale  MX,  At  =  A,  but  it  is  subject  to  radiative  corrections  due  to  both  gauge 


and  Yukawa  interactions.  For  small  tan/?,  one  has  approximately: 


At(Q0)~A 


1- 


m< 


190  GeV 


where  I  have  again  assumed  QQ  ~  350  GeV.  Right  at  the  fixed  point  of  ht,  where 
eq.(5)  becomes  exact,  the  weak-scale  value  of  At  is  again  independent  of  A  [36]. 
However,  even  for  values  of  ht(Mx}  as  large  as  2,  m^  can  still  show  substantial 
.A-dependence.  The  reason  is  that  in  the  expression  for  m^  strong  cancellations 
occur  between  the  contributions  from  diagonal  and  off-diagonal  entries  of  the  stop 
mass  matrix;  relatively  small  changes  of  these  elements,  of  the  size  shown  in  fig. 
1,  can  therefore  have  sizable  effects  on  mf  .  This  is  especially  true  for  /*  >  0, 
since  there  all  effects  go  in  the  same  direction:  Increasing  A  increases  Xz,  which 
simultaneously  reduces  m?  and  m?  in  the  diagonal  entries,  eq.(16),  and  increases 

*  i»  *  R 

the  off-diagonal  elements  by  increasing  /i,  see  eqs.(3)  and  (10);  note  that  At  and  fj. 
have  the  same  sign  in  this  case. 

m^  also  depends  quite  strongly  on  mt  in  the  fixed-point  scenario.  The  reason 
is  that  a  larger  mt  implies  larger  tan/?,  see  eq.(5),  and  hence  smaller  |/i|,  eq.(3);  in 
addition,  the  off-diagonal  term  in  the  stop  mass  matrix  (18)  is  reduced,  due  to  the 
explicit  cot/?  factor.  Therefore  larger  m<  also  imply  larger  m^here.  (This  is  not 
generally  true  if  ht  is  significantly  below  its  fixed  point  [43].) 

In  all  of  parameter  space  allowed  in  our  model  one  finds  that  mt-t  is  close  to 
or  even  well  below  the  gluino  mass.  This  is  partly  due  to  the  correction  (14)  [47]. 
Not  surprisingly,  this  statement  remains  true  [36,  37]  if  the  constraint  (8)  from  the 
proton  lifetime  is  not  imposed,  i.e.  if  larger  ratios  m^/mo  are  allowed,  as  long 
as  ht  is  close  to  its  fixed  point.  Light  stops  are  hence  a  quite  generic  prediction  of 
SUGRA  models  with  large  ht.  Their  phenomenology  depends  on  whether  or  not 
they  can  be  produced  in  gluino  decays  (i.e.,  whether  nig  >  mt  +  m^-J,  and  also 
whether  the  decay  ^  -*  b  +  W\  is  allowed,  where  W\  is  the  lighter  chargino;  if  (and 
only  if)  this  decay  is  forbidden,  fj  -+  c  +  Z\  via  loop  diagrams  [49]. 

The  ordering  of  m$,  m^,  mt  and  m^  depends  quite  strongly  on  A,  mi/2 
and  mt,  as  well  as  on  the  sign  of  /*.  This  is  demonstrated  by  figs.  2  a-d,  which 
show  regions  in  the  (AQ,ml/2)  plane  for  m0  =  500  GeV;  results  for  other  values 
of  m0  allowed  by  (8)  are  quite  similar.  Figs.  2a,b  are  for  mt(m,)  =  160  GeV, 
corresponding  to  m{(pole)  =  168  GeV,  while  c,d  are  for  mt(mt)  =  175  GeV  or 
mt(pole)  =  183  GeV;  p  has  been  chosen  positive  in  a,c  and  negative  in  b,d.  In  all 
these  figures  the  regions  outside  the  dotted  lines  are  excluded  by  the  constraints 
described  in  the  discussion  of  fig.  1.  Along  the  solid  line  one -has  m-g  =  mfi  +  m£, 
i.e.  on  one  side  of  this  line  g  — >  t\  +t  decays  are  allowed;  since  all  other  squarks  are 
considerably  heavier,  this  is  the  only  possible  two-body  decay  mode  of  the  gluino 
and  will  hence  have  a  branching  ratio  of  nearly  100%  if  allowed  at  all.  Similarly, 
the  long  dashed  lines  separate  regions  where  1 1  — >•  W\  -f  6  is  open,  in  which  case 
it  has  a  branching  ratio  very  close  to  100%,  from  those  where  fi  ->  Z\  +  c.  The 
decay  t±  —>  t  -f  Z\  is  almost  never  allowed  here  (it  opens  up  if  both  mi/2  and 
m0  are  near  their  upper  bounds);  instead,  the  short  dashed  lines  are  contours  of 
constant  m^  =  mt.  In  all  cases  m^  increases  with  increasing  mi/2,  and  it  generally 
also  increases  with  decreasing  |X0|,  although  the  maximum  (for  given  mx /2)  is  not 
exactly  at  A0  =  0. 
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Figure  2.  Regions  in  the  (/40,ma/2)  plane  leading  to  different  gluino  and  stop  decays. 


In  fig.  2a,  g  — »  t\  + 1  decays  are  allowed  everywhere  except  within  the  small 
triangular  region  delineated  by  the  solid  line.  There  are  also  substantial  regions 
where  <i  — >•  W\  -f  b  is  not  possible,  and  m^  <  m<  almost  everywhere  except  for 
the  small  region  enclosed  by  the  the  short  dashed  curve.  Changing  the  sign  of  fi 
(fig.  2b)  or  increasing  m<(mf)  to  175  GeV  (fig.  2c)  changes  the  situation  quite 
drastically,  however.  The  region  where  g  — >•  fi  -f- 1  is  allowed  is  now  confined  to  the 
narrow  strips  between  the  solid  and  dotted  lines,  and  m^  <  m^  4-  mb  only  in  the 
even  narrower  strips  between  the  long  dashed  and  dotted  lines.  In  contrast,  the 
region  where  m^  >  mt  (above  the  short  dashed  lines)  is  now  quite  large.  Finally, 
if  m<  is  close  to  its  upper  bound  and  /i  <  0,  fig.  2d,  the  gluino  can  never  decay  into 
fi  -f  t,  and  £[  always  decays  into  W\  +  b;  moreover,  now  ra^  >  mt  over  most  of  the 
allowd  part  of  the  (Ao,  m^)  plane. 

Since  fi  and  the  gluino  are  by  far  the  lightest  strongly  interacting  sparticles 
in  our  model,  SUSY  signals  at  the  LHC  (or  other  hadron  colliders)  will  obviously 
depend  quite  sensitively  on  which  of  the  regions  depicted  in  figs.  2  is  picked.  This 
is  the  subject  of  the  next  section,  where  these  signals  are  discussed  in  more  detail. 

4.  Signals  at  the  LHC 

In  this  section  I  discuss  the  specific  signatures  produced  by  the  kind  of  spectrum 
described  in  the  previous  section.  The  signatures  at  e+e~  colliders  are  rather 
straightforward:  A  light  chargino  (below  105  GeV),  a  rather  light  neutral  scalar 
Higgs  boson  (below  110  GeV),  and  a  light  ti  (below  ~  300  GeV),  all  of  which  can 
be  detected  in  a  straightforward  way  [50].  Due  to  numerous  backgrounds  the  situa- 
tion at  hadron  colliders  is  much  more  complicated.  Moreover,  at  these  machines  the 
largest  signals  come  from  the  production  of  strongly  interacting  sparticles.  Since 
these  are  usually  heavier  than  many  other  sparticles  they  tend  to  decay  via  lengthy 
cascades  [13].  While  this  makes  signals  more  difficult  to  analyze,  it  also  offers  the 
opportunity  to  determine  various  branching  ratios,  which  greatly  helps  to  distin- 
guish between  different  SUSY  scenarios. 

Here  I  summarize  the  results  of  ref.[19],  where  a  full  Monte  Carlo  study  of  signals 
and  backgrounds  was  performed,  using  the  latest  version  of  ISAJET  which  contains 
the  ISASUSY  program  package  to  compute  sparticle  masses  and  decay  branching 
ratios.  The  program  includes  initial  and  final  state  showering,  fragmentation,  and 
crude  detector  modelling,  where  we  took  present  designs  of  LHC  detectors  as  guide- 
lines; see  ref.[51]  for  more  details  on  this  program  package. 

A  simulation  of  this  type  consumes  a  substantial  amount  of  CPU  time  [52]. 
We  therefore  limited  ourselves  to  the  study  of  six  spectra  that  can  occur  within 
SUGRA  5(7(5),  see  table  1.  We  saw  already  in  the  previous  section  that  over  a 
substantial  region  of  parameter  space  the  gluino  can  decay  into  t  +  t\.  The  two  'A' 
spectra  were  picked  from  that  region,  with  Al  being  an  example  where  t\  -+  W\  +6 
while  in  case  A2  the  light  stop  decays  into  charm  plus  LSP.  In  the  remaining  four 
'B'  cases  the  gluino  has  no  two-body  decay  modes;  here  we  picked  points  where 
m0  and  mi/2  are  minimal  or  maximal:  (m0,  m^)  =  (0.3  TeV,  60  GeV)  (Bl);  (1 
TeV,  70  GeV)  (B2);  (0.4  TeV,  130  GeV)  (B3);  and  (1  TeV,  130  GeV_HB4).  These 
six  spectra  show  all  the  features  discussed  earlier:  Light  Z\>  2T2,  Wi,  h°  and  g; 
quite  heavy  squarks  and  sleptons,  except  for  fi,  with  61  significantly  below  first 


Table  1.  Parameters  and  masses  for  six  SUGRA  cases  Al,  A2  and  B1-B4. 


parameter 


Al       A2 


Bl 


B2  |     B3  j       B4 


mo 

500 

500 

300 

1000 

400 

1000 

"h/2 

120 

130 

60 

70 

130 

130 

A0/m0 

0.6 

3.75 

0.1 

0.0 

0.0 

0.0 

tan/? 

1.49 

2.2 

1.94 

1.32 

2.22 

2.11 

n 

697.1 

580 

-313.3 

-1571.7 

430 

964.7 

mt 

160 

175 

170 

155 

175 

175 

m-g 

346 

371 

185 

231 

364 

400 

mq 

568 

578 

328 

1011 

496 

1039 

mfi 

131 

83.6 

198 

121 

225 

288 

mf3 

521 

500 

304 

781 

470 

799 

mlL 

437 

426 

250 

732 

396 

765 

mi 

508 

501 

305 

1001 

412 

1005 

mz 

43.6 

48.0 

27.1 

28.8 

46.1 

50.7 

m~a 

85.1 

93.5 

63.5 

59.8 

89.3 

99.9 

m~ 

698 

582 

316 

1572 

432 

966 

m~ 

710 

595 

333 

1577 

450 

973 

\v 

84.5 

92.9 

62.2 

59.7 

87.9 

99.6 

w 

707 

593 

331 

1576 

448 

972 

mAo 

85.2 

99.7 

61.6 

86.0 

91.3 

101 

fTTjyo 

1038 

841 

492 

2339 

652 

1543 

Tflp 

1038 

842 

487 

2342 

649 

1542 

m.H± 

1040 

845 

492 

2343 

653 

1544 
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generation  squarks;  heavy  Z3,   £4  and  Wi]  and  very  heavy  Higgs  bosons  P,  H 
and  H±. 

In  order  to  see  how  mSUGRA  signals  differ  from  the  signals  of  "conventional 
SUSY"  models  of  the  type  considered  in  ref.[13],  we  also  studied  four  cases  (la- 
belled BTW1  through  BTW4)  where  all  squarks  and  sleptons  are  assumed  to  be 
degenerate.  In  all  'BTW  cases  we  took  m~g  -  300  GeV,  and  picked  (m?-,  /^)  =  (320 
GeV,  -150  GeV)  (BTW1);  (600  GeV,  -150  GeV)  (BTW2);  (320  GeV,  -500  GeV) 
(BTW3);  and  (600  GeV,  -500  GeV)  (BTW4)  [53]. 

In  table  2  SUSY  event  fractions  as  well  as  important  sparticle  decay  branching 
ratios  are  listed.  In  the  'A'  cases  with  very  light  fi,  more  than  half  of  all  SUSY 
events  are  <!?*  pairs,  while  in  the  four  'B'  cases,  gluino  pairs  constitute  the  most 
copiously  produced  supersymmetric  final  state.  Pairs  of  electroweak  gauginos  are 
produced  only  in  a  few  percent  or  even  few  permille  of  all  SUSY  events;  their 
detection  therefore  necessitates  a  dedicated  search  [54],  as  opposed  to  the  "generic 
SUSY  search"  presented  here. 

Many  of  the  branching  ratios  shown  in  table  2  can  be  understood  directly  from 
the  sparticle  masses  listed  in  table  1,  keeping  in  mind  that  two-body  final  states 
will  always  overwhelm  three-body  final  states  if  both  are  accessible  at  tree  level. 
A  few  features  are  worth  pointing  out,  however.  For  example,  in  cases  Bl,  B2  the 
Zibbt,  coupling  is  "accidentally"  suppressed.  The  branching  ratio  for  g  — »•  Zibb  is 
therefore  dominated  by  IR  exchange,  whose  mass  is  not  reduced  compared  to  first 
generation  squark  masses;  therefore  g  — »•  Zibb  is  not  enhanced  significantly  over 
g  —*_Z\dd  in  these  cases.  Nevertheless  Zi  — »•  Z^bb  is  enhanced  considerably  over 
Z2dd.  The  reason  _is  that  virtual  h°  exchange  diagrams,  which  contribute  much 
more  strongly  to  bb  final  states,  are  not  negligible  here.  This  can  be  understood 
from  the  observation  that  h°  exchange  is  only  suppressed  [39]  by  one  power  of  the 
smalUiiggsino  component  of  Z2  or  Zi,  while  Z  exchange  needs  [1]  two  such  powers, 
and  /  exchange  is  suppressed  by  the  large  sfermion  masses. 

/i°  exchange  contributes  negligibly  to  Z2  —»  Z\e+e~  decays,  so  it  reduces  the 
corresponding  branching  ratio  [55].  The  leptonic  branching  ratio  of  Zi  can  never- 
theless exceed  considerably  that  of  the  Z  boson,  even  if  (most)  squarks  and  sleptons 
have  almost  the  same  mass,  as  is  the  case  here;  this  is  largely  due  to  interference 
between  Z  and  sfermion  exchange  diagrams,  which  can  however  also  result  in  very 
small  leptonic  branching  ratios  for  ,Z2,  see  cases  B3  and  B4.  In  contrast,  the 
branching  ratios  of  the  light  chargino  very  closely  track  that  of  the  W  bosons,  since 
the  W\Z\W  couplings  get  contributions  from  both  the  higgsino  and  517(2)  gaug- 
ino Components  of  W\  and  Z\,  and  are  therefore  much  less  suppressed  than  the 
Z^,Z\Z  coupling.  The  only  exception  occurs  in  case  A2,  where^the  two-body  decay 
Wi  — *•  fi  -f  b  is  allowed  and  hence  completely  dominates  all  W\  decays. 

It  is  by  now  quite  well  known  that,  in  addition  to  the  "classical"  missing  Ey+jets 
signature,  sparticle  production  at  hadron  colliders  can  also  give  rise  to  final  states 
containing  hard  leptons  [13,  56].  In  the  present  study  a  hard  lepton  is  defined  as  an 
electron  or  muon  with  pr  >  20  GeV,  pseudorapidity  \rj\  <  2.5,  and  visible  (hadronic 
or  electromagnetic)  activity  in  a  cone  AjR  =  0.3  around  the  lepton  less  than  5  GeV. 
Hadronic  clusters  with  ET  >  50  GeV  in  a  cone  AJ?  =  0.7  are  labelled  as  jets.  We 
can  then  define  various  mutually  exclusive  event  classes: 

•  "Missing  ET"  events  have  no  hard  leptons,  rij  >  4  jets,  missing  ET  >  150 
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Table  2.  (a)  Fractions  of  SUSY  particle  pairs  produced  in  pp  collisions  at  the  LHC;  and 
(b)  branching  fractions  of  selected  decay  modes,  for  six  SUGRA  cases  Al,  A2  and 
B1-B4,  where  q  stands  for  all  squarks  except  stops,  and  XX  stands  for  all  possible 
chargino  and  neutralino  pairs. 


SUSY  particles\ 

Case 

Al 

A2 

Bl 

B2 

B3 

B4 

(a)  Sparticle 

Pairs  Produced 

99 

0.30 

0.093 

0.72 

0.74 

0.44 

0.73 

fifi* 

0.51 

0.84 

0.011 

0.21 

0.10 

0.083 

gq 

0.13 

0.050 

0.23 

0.013 

0.33 

0.081 

|f 

0.018 

0.007 

0.029 

3xlO~4 

0.067 

0.005 

wfz2 

0.018 

0.006 

0.004 

0.019 

0.027 

0.066 

XX 

0.026 

0.009 

0.007 

0.027 

0.042 

0.088 

(b)  Important 

Decay  Modes 

g  —  >  t^t,  tit 

1.0 

1.0 

- 

- 

- 

- 

g  -»•  W^tb,  Wfbt 

1.6xlO-4 

1.4xlO-4 

- 

0.091 

0.12 

0.25 

g-+W^ud,Wfdu 

4.6xlO-4 

2.9xlO~4 

0.21 

0.10 

0.19 

0.16 

g  —  >•  Zidd 

S.lxlO-5 

1.9xlO~5 

0.012 

0.005 

0.014 

0.01 

g  —  >•  Zibb 

6.8xlO'5 

5.4xlO~5 

0.012 

0.006 

0.035 

0.018 

g  —  »•  Z%bb 

4.3xlO-4 

3.9xlO-4 

0.21 

0.10 

0.18 

0.15 

fj  ->  Wfb 

1.0 

- 

0.95 

1.0 

0.93 

0.29 

ti  —)•  Zi* 

- 

- 

0.05 

- 

0.07 

0.64 

^1  —  *•  -^2^ 

- 

- 

- 

- 

- 

0.07 

tj  —  >•  J^iC 

- 

1.0 

- 

- 

- 

- 

^2  _^  Zi  dd 

0.11 

0.18 

0.024 

0.028 

0.21 

0.17 

Z2  _>  Zi66 

0.37 

0.39 

0.057 

0.22 

0.38 

0.42 

£2  _^  Zie+  e~ 

0.047 

0.018 

0.14 

0.12 

0.007 

0.012 

W+^Zie+I/e 

0.11 

6.9xlO-5 

0.11 

0.11 

0.11 

0.11 

GeV.  If  n  =  1,  we  in  addition  required  the  scalar  ET  to  exceed  700  GeV,  and 
demanded  that  the  transverse  mass  computed  from  the  missing  pr  and  the 
leptonic  pr  does  not  fall  in  the  interval  between  60  and  100  GeV,  where  the 
background  from  real  W  decay  has  a  Jacobian  peak.  For  n  =  2  we  distinguish 
opposite  sign  (OS)  and  same  sign  (SS)  events,  and  required  total  ET  >  700 
GeV  in  the  OS  sample  in  order  to  suppress  it  backgrounds. 

Cross  sections  after  cuts  for  these  final  states  are  listed  in  table  3,  for  the 
six  mSUGRA  cases,  4  'BTW  cases  and  leading  sources  of  background,  i.e.  it, 
W+jets  and  Z-fjets  events.  (We  checked  that  backgrounds  from  W+W~ ,  cc  and 
bb  production  are  always  very  small.)  The  table  extends  out  to  n  =  4  leptond,  but 
the  results  for  n  =  4  already  suffer  from  substantial  statistical  errors  (we  generated 
50,000  events  for  each  SUSY  spectrum,  and  several  hundred  thousand  background 
events).  We  observe  that  the  missing  ET,  SS  and  (except  in  case  A2)  3/  signals  are. 
all  well  above  background;  the  II  and  OS  signals  are  also  always  larger  than,  or  at 
least  similar  to,  backgrounds  [57]. 

The  size  of  the  missing  ET  cross  section  is  mostly  determined  by  the  gluino 
mass;  direct  tit*  production  contributes  little  to  the  signal  after  cuts  even  in  case 
A.2,  mostly  because  we  demand  at  least  4  jets  here.  However,  due  to  the  poten- 
tially sizable  contribution  from  gq  production  shown  in  table  2,  the  masses  of  first 
generation  squarks  also  play  a  role.  Moreover,  even  this  relatively  robust  signal 
can  change  by  a  factor  of  more  than  2  depending  on  gluino  branching  ratios:  Even 
though  case  A2  has  a  slightly  heavier  gluino  than  case  Al,  and  hence  an  almost 
50%  smaller  gluino  pair  cross  section,  it  produces  a  two  times  stronger  missing 
ET  signal  than  case  Al  jlpes.  The  reason  is  that  t\  — »•  Z\  +  c  decays  give  much 
harder  LSPs  than  fi  — *  W\  -f  6  -+  Z\  +  qq1  -f  6  decays  do.  Similarly,  the  missing 
ET  signal  in  case  B2  is  almost  as  strong  as  in  case  Bl,  inspite  of  the  more  than 
three  times  smaller  sum  of  gg  and  gq  cross  sections.  This  is  mostly  due  to  a  large 
(22%)  branching  ratio  for  g  — *  Z\  -f  g  loop  decays.  (</  — »•  Z2  +  g  decays  also  have 
a  branching  ratio  of  about  22%,  and  about  20%  of  all  Z2  decay  invisbly  into  Zivv 
in  this  case.)  These  Z\  produced  in  g  two-body  decays  are  very  energetic,  much 
more  so  than  the  LSPs  produced  at  the  end  of  a  cascade.  The  branching  ratios  for 
these  loop  decays  of  the  gluino  are  enhanced  in  case  B2  because  ordinary  squarks 
are  very  heavy,  while  fi  (which  contributes  to  loops)  is  quite  light. 

The  size  of  the  II  signal  is  generally  roughly  comparable  to  that  of  the  missing 
ET  signal,  but  backgrounds  are  about  three  times  larger  here.  Real  top  quarks  are 
very  efficient  in  producing  hard  leptons  (and  missing  ET,  needed  to  pass  the  cut 
$T  >  100  GeV);  in  case  Al  the  II  signal  is  therefore  actually  larger  than  the  missing 
ET  signal.  Notice  also  that  the  effect  of  increasing  m0  to  1  TeV  is  now  about  the 
same  for  cases  B1/B2  and  B3/B4,  reducing  the  signal  by  approximately  a  factor 
of  two.  Many  "IP  events  in  cases  Bl,  B2  actually  come  from  Z2  -*•  Zi/+/~  decays 
where  one  of  the  leptons  failed  to  pass  the  cuts.  The  large  leptonic  branching  ratio 
of  Z-i  in  these  cases  also  leads  to  quite  large  OS  11  signals,  large  OS/SS  ratios,  and 
sizable  3/  signals.  Case  A I  also  has  substantial  OS  and  3/  signals,  but  the  OS/SS 
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Table  3.  Cross  sections  in  pb  for  various  event  topologies  after  cuts  described  in  the  text, 
for  pp  collisions  at  ^/s  =  14  TeV.  The  various  SUGRA  cases  are  listed  in  the  first 
column.  The  OS/SS  ratio  is  computed  with  the  OS  dilepton  sample  before  the  scalar 

ET  cut. 


case 


OS       55      OS/SS 


Al 

24.6 

36.2 

5.4 

3.7 

2.0 

1.2 

0.017 

A2 

48.0 

31.4 

1.5 

2.1 

1.2 

<0.02 

<0.02 

Bl 

79.1 

76.8 

11.9 

3.4 

6.9 

1.7 

0.17 

B2 

67.7 

37.5 

9.0 

2.4 

7.7 

0.8 

0.1 

B3 

51.8 

21.2 

1.4 

0.6 

3.3 

0.09 

<0.01 

B4 

20.1 

10.1 

1.1 

0.4 

3.5 

0.1 

<  0.004 

BTW1 

105 

39.8 

2.8 

1.4 

3.1 

0.21 

0.03 

BTW2 

57.3 

22.5 

2.2 

0.85 

3.6 

0.14 

<0.02 

BTW3 

96 

58 

10.9 

2.9 

6.7 

1.5 

0.06 

BTW4 

52.3 

23.9 

2.3 

0.9 

3.7 

0.2 

<0.02 

tt~(160) 

2.9 

8.0 

1.1 

0.01 

640 

<  0.004 

<  0.004 

W  +  jet 

0.6 

3.8 

0.29 

- 

- 

- 

Z  +  jet 

0.6 

0.2 

0.02 

- 

- 

- 

total  BG 

4.3 

12.02 

1.411 

0.01 

<  0.004 

<  0.004 

ratio  is  much  smaller  here,  since  gg  events  produce  it  and  it  final  states  with  equal 
abundance  as  it  final  states.  Case  A2  has  very  few  n  >  3  lepton  events,  since 
basically  all  gluinos  decay  into  t  +  c  +  Z\  (or  the  charge  conjugate  thereof)  here, 
giving  at  most  one  hard  lepton  per  gluino.  Cases  B3  and  B4  give  smaller  leptonic 
signals,  due  to  the  larger  gluino  mass;  they  differ  from  each  other  by  a  factor  of 
two  in  the  missing  ET  and  (marginal)  II  signals,  but  become  more  similar  to  each 
other  as  more  leptons  are  required,  mostly  due  to  the  sizable  g  — +  W\tb  branching 
ratio  in  case  B4. 

Finally,  the  "conventional"  BTW  cases  can  produce  even  larger  missing  ET 
signals  than  our  mSUGRA  cases,  since  we  allowed  squarks  to  lie  just  above  gluinos 
here,  leading  to  large  gq  production  rates.  Case  BTW3  also  has  a  large  leptonic 
branching  ratio  of  Zi  (12.5%  per  generation),  and  hence  large  cross  sections  for  the 
n  lepton  final  states;  for  the  other  BTW  cases  the  leptonic  branching  ratio  of  Z?  is 
close  to  that  of  the  Z  boson. 

This  discussion  shows  that  the  counting  rates  in  the  six  or  seven  independent 
signal  channels  listed  in  table  3  already  go  a  long  way  towards  distinguishing  the 
various  mSUGRA  cases  from  each  other  as  well  as  from  the  *BTW  cases.  Some 
ambiguities  remain,  however;  for  example,  cases  Bl  and  BTW3  are  are  very  similar 
at  this  level  (except  perhaps  in  the  4/  signal,  but  statistics  is  poor  here).  The 
remaining  ambiguity  can  be  resolved  by  looking  at  the  events  within  one  signal 
class  in  more  detail.  An  example  is  shown  in  table  4,  where  event  fractions  for 
the  missing  ET  sample  are  shown,  together  with  the  average  total  scalar  ET  and 
average  missing  'Er  per  event. 

The  most  useful  discriminators  appear  to  be  the  fractions  of  events  with  at  least 
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Table  4.  Cross  sections  and  event  fractions  for  missing  energy  plus  jets  events  for  various 
SUSY  cases  at  the  LHC. 


case 


<r(P6) 


-  4-5     6-7      >  8      nb>l 


Al 

24.6 

0.54 

0.35 

0.10 

0.65 

0.28 

1160 

212 

A2 

48.0 

0.55 

0.35 

0.10 

0.47 

0.09 

1112 

221 

Bl 

79.1 

6.73 

0.25 

0.02 

0.21 

0.04 

964 

195 

E2 

67.7 

0.77 

0.20 

0.03 

0.23 

0.05 

999 

211 

B3 

51.8 

0.57 

0.35 

0.08 

0.36 

0.12 

1118 

215 

B4 

20.1 

0.54 

0.36 

0.10 

0.44 

0.15 

1204 

217 

BTW1 

105 

0.69 

0.26 

0.04 

0.17 

0.04 

1006 

214 

BTW2 

57.3 

0.61 

0.33 

0.05 

0.18 

0.04 

1091 

211 

BTW3 

96 

0.69 

0.27 

0.04 

0.15 

0.03 

1011 

217 

BTW4 

52.3 

0.60 

0.33 

0.06 

0.18 

0.04 

1109 

208 

«(160) 

2.9 

0.81 

0.17 

0.01 

0.56 

0.12 

895 

201 

Z  +  jets 

0.63 

0.89 

0.11 

0.0 

0.11 

0.02 

905 

281 

one  or  two  tagged  6  quarks.  Here  we  have  assumed  a  b  tagging  efficiency  of  40% 
if  the  b— flavoured  hadron  has  p?  >  20  GeV  and  pseudorapidity  \r)\  <  2,  and  zero 
efficiency  otherwise;  we  have  ignored  the  possibility  of  false  tags.  The  spectra  where 
9  — *  fijr-  i  is  allowed  clearly  lead  to  the  largest  b  content;  the  difference  between 
fi  — »•  Wi  -f  b  (Al)  and  ?i  — »•  Z\  +  c  (A2)  decays  is  also  evident,  especially  in  the 
fraction  of  events  with  at  least  two  tagged  6's.  Cases  B3  and  B4  still  have  fairly 
large  6  content,  partly  due  to  g  —*•  Witb  decays  which  can  produce  up  to  four  b 
quarks  in  a  gluino  pair  event;  enhanced  g  — »•  Z2  +  66  and  Z2  — *•  Z\  -f  66  branching 
ratios  also  play  a  role,  see  table  2.  Finally,  in  the  light  gluino  scenarios  Bl,  B2  the 
6— fraction  is  considerably  smaller  than  in  the  cases  where  gluinos  can  decay  into 
top  quarks;  the  enhanced  branching  ratios  into  final  states  containing  66  still  lead 
to  6— fractions  that  exceed  those  of  the  BTW  cases,  however. 

The  presence  of  t  quarks  in  gluino  decays  also  leads  to  large  average  jet  mul- 
tiplicities: 10%  of  all  events  in  cases  Al,  A 2  and  B4  have  at  least  8  reconstructed 
jets  in  them  (recall  that  we  require  each  jet  of  have  pr  >  50  GeV).  Moreover,  the 
comparison  of  BTW2,4  with  BTW  1,3  shows  that  the  presence  of  first  generation 
squarks  significantly,  but  not  infinitely,  heavier  than  the  gluino  increases  the  aver- 
age number  of  jets  and  the  average  scalar  ET  per  event;  this  is  due  to  gq  production, 
of  course.  This  effect  is  less  pronounced  when  comparing  B4  to  B3,  or  B2  to  Bl, 
since  in  cases  B4  and  B2  first  generation  squarks  are  so  heavy  that  they  contribute 
little  to  the  total  SUSY  signal;  see  table  2.  The  average  missing  ET  seems  to  be 
a  less  useful  discriminator,  clustering  around  210  to  220  GeV  in  almost  all  cases. 
The  only  exception  is  case  Bl,  which  has  a  very  light  gluino;  in  case  B2,  where  the 
gluino  is  also  light,  the  missing  ET  is  enhanced  by  the  large  branching  ratios  for 
loop-induced  g  — •»•  Z\ti  -f  g  decays  discussed  earlier. 

Similar  tables  can  be  shown  for  the  other  samples  of  signal  events  [19];  for 
reasons  of  space  I  here  merely  summarize  the  most  important  points.  The  results 


jet  multiplicities  are  very  similar  for  all  samples,  except  that  a  larger  number  of 
tons  implies  an  overall  reduction  of  the  number  of  jets  [58].  The  b— content  of 
nts  in  the  I/  sample  are  similar  to  those  in  the  missing  ET  sample,  with  a  slight 
lancement  in  case  Bl  and  slight  reductions  in  the  BTW  cases  which  increases 
differences  between  these  scenarios. 

In  the  mSUGRA  cases  B1-B4  the  6  fraction  is  considerably  larger  in  the  OS 
nple  than  in  the  missing  ET  or^  II  samples.  This  effect  is  especially  pronounced 
;ases  B3  and  B4,  where  g  — *  W\tb  decays  are  good  sources  of  both  hard  leptons 
1  b  quarks;  indeed,  in  these  two  cases  the  6  fraction  exceeds  that  in  case  A2  in  the 
dilepton  sample.  The  correlation  between  the  number  of  6's  and  the  number  of 
tons  in  cases  Bl  and  B2  is  due  to  the  fact  that  here  most  leptons  come  from  Z-% 
ays,  and  g  — *•  Zi  decays  frequently  lead  to  a  bb  final  state;  the  enhancernent^of 
inal  states  is  much  weaker  in  g  — >•  Z\  decays,  as  shown  in  table  1,  while  g  — »•  W\ 
:ays  cannot  contain  6  quarks  in  these  cases. 

The  contribution  from  Z2  — +•  Z]_l+l~  decays  to  the  OS  signal  can  also  be  tested 
looking  at  the  flavour  of  the  produced  leptons:  Zi  decays  always  produce  e*e~  or 
fjL~  pairs,  while  OS  events  that  originate  from  (semi-)leptonic  decays  of  charginos 
t  quarks  are  equally  likely  to  contain  an  e/z  pair.  Indeed,  one  finds  a  strong 
ponderance  of  1  ike-flavour  pairs  in  cases  Bl  and  B2  as  well  as  in  all  BTW  cases, 
ile  all  combinations  of  lepton  flavours  occur  with  equal  frequency  in  cases  Al  and 
where  Zi  is  hardly  ever  produced;  in  cases  B3  and  B4  there  is  a  smaller  but  still 
nificant  preference  for  like-flavour  lepton  pairs.  The  invariant  mass  distribution 
the  two  charged  leptons  in  like-flavour  opposite-charge  dilepton  events  should 
>w  to  determine  the  Zi  —  Z\  mass  difference,  which  in  our  mSUGRA  scenario  is 
>roximately  given  by  0.4m!/2.  Unfortunately,  other  (combinations  of)  sparticle 
sses  are  much  more  difficult  to  determine  at  hadron  colliders. 

Summary  and  conclusions 

this  contribution  I  have  described  the  phenomenology  of  the  minimal  SUGRA 
'(5)  model.  The  underlying  theme  of  this  model  is  unification.  First  of  all,  the 
y  existence  of  a  GUT  sector  implies  that  the  Higgs  sector  of  the  SM  is  ill-behaved 
the  quantum  level  unless  SUSY  exists  at  an  energy  scale  not  much  above  the 
ak  scale.  Moreover,  we  now  know  that  the  particle  content  of  the  SM  by  itself 
js  not  allow  for  unification  of  all  gauge  interactions;  new  degrees  of  freedom  are 
;ded,  and  minimal  SUSY  just  fits  the  bill. 

Secondly,  in  this  model  one  can  also  successfully  unify  the  b  and  r  Yukawa 
iplings,  provided  that  the  top  Yukawa  coupling  is  large,  i.e.  close  to  its  IR 
lasi)  fixed  point.  The  idea  of  unification  can  even  be  extended  into  the  SUSY 
:aking  sector  by  assuming  a  single  gaugino  mass  as  well  as  a  single  scalar  mass, 
is  leads  to  the  very  elegant  mechanism  of  radiative  breaking  of  the  electroweak 
ige  symmetry,  which  hints  towards  explanations  for  the  large  mass  of  the  top 
ark  and  the  large  hierarchy  between  MX  and  MZ- 

In  this  scenario  constraints  from  proton  decay,  and  from  the  relic  density  of 
Ps  produced  in  the  Big  Bang,  imply  that  gauginos  must  be  quite  light,  while 
>st  sfermions  and  Higgs  bosons  are  heavy.  Light  charginos,  and  the  light  scalar 
ggs  boson  predicted  by  this  model,  are  quite  easily  detectable  at  e+e~  colliders 


below  ~  105  GeV  or  no  gluino  with  mass  below  ~  400  GeV  is  found  mSUGRA 
577(5)  has  to  be  discarded,  but  their  discovery  can  hardly  be  considered  proof  of 
this  model.  However,  it  also  predicts  that  left-handed  6  squarks  and,  much  more 
dramatically,  the  lighter  t  eigenstate  lie  well  below  the  other  squarks.  Indeed,  t\ 
might  even  be  produced  in  g  decays,  giving  events  with  several  b— quarks.  Even  if 
this  decay  is  not^  possible  the  reduced  masses  of  t\  and  &£,  give  enhanced  branching 
ratios  for  g  — >  W\tb  and  g  — +  Zijbb  three-body  decays  and  hence  again  an  enhaced 
6-quark  content  of  SUSY  events  at  the  LHC;  in  this  case  the  enhancement  is  usually 
less,  and  increases  with  the  number  of  hard  leptons  in  the  event.  Notice  that  the 
event  rates  are  so  large  that  a  "low"  luminosity  of  a  few  times  1032  cm~2sec-1 
is  quite  sufficient,  which  should  allow  for  6-tagging  at  least  in  principle.  Further 
clues  to  the  nature  of  the  spectrum  can  come,  e.g.,  from  the  average  jet  multiplicity 
per  event  and  from  the  flavour  composition  of  opposite-sign  dilepton  events. 

It  should  be  emphasized  that  in  this  model  a  SUSY  signal  should  be  seen  at 
the  LHC  in  at  least  three,  and  possibly  as  many  as  seven  independent  channels, 
where  only  the  number  and  charge  of  hard  leptons  has  been  used  to  classify  events. 
This  clearly  offers  great  opportunities  for  the  LHC.  Nevertheless  the  LHC  by  itself 
will  not  suffice  to  find  all  the  new  particles  predicted  by  this  model:  The  heavy 
Higgs  bosons,  higgsinos  and  sleptons  are  in  my  opinion  impossible  to  detect  at  the 
LHC.  Finding  first  generation  squarks  on  top  of  the  "background"  of  light  gluinos 
will  also  be  difficult,  especially  if  they  are  close  to  the  TeV  scale  in  mass.  More 
surprisingly,  even  the  direct  pair  production  of  light  t\  squarks  seems  difficult  to 
detect.  If  we  are  lucky  light  stops  might  be  detected  at  the  tevatron  [59].  However, 
if  this  model  is  correct  the  completion  of  the  sparticle  and  Higgs  spectrum  will 
most  likely  have  to  await  the  construction  of  a  TeV  scale  e+e~  collider.  Finally, 
one  would  eventually  want  to  see  direct  evidence  for  the  existence  of  GUT  particles, 
the  best  hope  probably  being  proton  decay  experiments.  The  model  presented  here 
would  therefore  keep  particle  physicists  busy  for  some  decades  to  come. 
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of  jets;  this  also  reduces  the  average  jet  multiplicity  compared  to  the  missing  ET 
sample,  of  course. 
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[59]     H.A.  Baer,  J.  Sender  and  X.R.  Tata,  preprint  FSU-HEP-940430,  hep-ph/9404342. 
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Abstract.  A  model  in  which  CP  is  spontaneously  broken  is  described;  it  solves  the 
strong  CP  problem  and  provides  an  additional  mechanism  for  weak  CP  violation.  This  is 
the  aspon  model.  Application  to  B  decays  leads  to  predictions  for  CP  asymmetries  much 
smaller  than  expected  from  the  KM  mechanism  of  explicit  CP  breaking. 


In  this  presentation  I  shall  cover  the  three  topics  all  of  which  pertain  to  the 
aspon  model: 

l)The  motivation,  arising  from  the  strong  CP  problem. 

2)The  phenomenology  of  CP  violation,  giving  upper  bounds  on  the  masses. 

3) Estimation  of  CP  asymmetries  in  B  meson  decays. 

Topic  (3),  done  in  collaboration  with  Ackley,  Kayser  and  Leung,  is  the  most 
recent  work  in  the  model  so  I  shall  emphasize  it. 

(1)  Solution  of  Strong  CP  Problem. 

In  solving  the  strong  CP  problem  there  are  three  alternatives: 

1)  Axion 

2)  mu  =  0 

3)  Spontaneous  (  or  soft  )  CP  violation 

(1)  ia  looking  less  and  less  plausible  experimentally  since  the  allowed  window  is 
10-5eV  <ma<  lQ~3eV. 

(2)  is  at  odds  with  pseudoscalar  meson  masses  and  spontaneous  chiral  symmetry 
breaking. 

The  aspon  model  is  an  economical  (  minimal  )  version  of  (3). 

The  strong  CP  problem  is  due  to  the  term  QqcoG^G^  in  the  QCD  lagrangian. 
The  physical  quantity  O  =  (&QCD+&QFD)  <  2x  10~10  for  dn  <  10~25e.cm.  where 
©QFD  =  —argdetM,  M  being  the  quark  mass  matrix. 

With  a  CP  invariant  lagrangian  SQCD  =  0  so  we  must  arrange  that  QQFD  < 
2  x  10-10. 

To  do  this  we  introduce  U(l}new  which  will  be  local  (gauged).  Thus  the  model 
has  gauge  group  SU($)C  *  5(7(2)^  x  I7(l)y  x  U(l)new .  All  the  particles  in  the  three 
family  standard  model  are  neutral  under  this  U(\}new.  In  the  simplest  version  of  the 
aspon  model  we  add  one  doublet  of  vector-like  quarks  which  carry  the  new  charge 


them. 

The  new  quarks  have  gauge-invariant  (heavy)  Dirac  masses  and  Yukawa  cou- 
plings to  the  light  quarks: 


Here  a  =  1,2  and  i  =  1,2,3.  The  superscript  C  implies  charge  conjugate.  There 
are  no  Yukawa  couplings  to  UL  or  to  DL  because  of  the  U(l)new  gauge  invariance. 
The  gauge  invariance  under  U(l)new  forces  the  4  x  4  quark  mass  matrices  (both 
up  and  down  sectors)  to  have  a  texture  such  that  the  determinant  is  real.  The 
diagonal  blocks  are  real  because  of  CP  conservation;  let  the  heavy  quarks  have 
mass  M.  The  off-diagonal  terms  are  of  the  form: 


and  arise  from  the  off-diagonal  Yukawa  couplings  listed  earlier.  Only  one  of  the  two 
off-diagonal  blocks  has  this  form,  the  other  being  zero;  this  is  how  the  determinant 
is  real. 

(2)  The  Phenomenology  of  CP  Violation,  Giving  Upper  Bounds  on 
the  Masses 

The  Yukawa  couplings  h^  are  restricted  to  be  «  1  by  the  limit  on  6  which 
is  generated  by  one-loop  corrections.  In  diagonalization  of  the  quark  mass  matrix 
the  relevant  expansion  parameter  is  a:,-  =  Fi/M  (i  =  1,2,3  for  the  three  families). 
In  the  aspon  model  these  satisfy  Xj  «  1. 

Since  U(l)new  is  clearly  anomaly  free,  it  can  be  gauged  and  its  gauge  boson,  the 
aspon  become  massive  by  the  Higgs  mechanism.  Combining  the  upper  limit  on  6 
and  the  values  of  e  and  e  from  K°  decay  it  is  possible  to  show  that  both  the  aspon 
and  the  vector-like  quarks  have  masses  <  GOOGeV.  The  analysis  also  bounds  the 
values  of  |zj|  to  be  <  10~2,  10~3,  1(T4  for  i  =  1,2,3  respectively. 

I  shall  not  go  through  the  details  here.  In  the  treatment  of  CP  violation,  there 
is  a  parallel  to  the  recent  paper  of  Deshpande  and  He  (OITS-529  12/93)  which  uses 
a  Sa  x  Zz  family  symmetry  introdued  in  1991  by  Ernest  Ma.  Their  case  (a)  has 
hard  explicit  CP  violation,  but  their  case  (b)  has  spontaneous  violation  and  has 
too  small  a  KM  phase  to  account  for  the  effect  in  the  K  system  which  is  instead 
explained  by  charged  Higgs  exchange  [the  analog  of  aspon  exchange]. 

In  the  MSSM  new  CP  phases  occur  and  must  be  suppressed  to  avoid  too  large 
a  value  of  dn.  Babu  and  Barr  (1993)  suggest  spontaneous  CP  violation  at  a  high 
scale  to  solve  this  by  providing  phases  only  in  gaugino  mass  terms. 

These  are  two  examples  of  other  recent  work  suggesting  spontaneous  CP  viola- 
tion. They  both  differ  from  the  aspon  model  as  can  be  seen  by  the  fact  that  they 
both  require  an  axion  to  solve  strong  CP. 


110  Pramana-  J.  Phys.,  Supplement  Issue,  1995 


(3)  estimation  ot  UJr  Asymmetries  in  B  JJecays 

In  B  decays,  using  the  KM  mechanism  it  has  been  estimated  that  with  108 
samples  one  could  see  CP  asymmetries  and  begin  to  check  the  unitarity  relations 
required  for  the  (complex)  CKM  matrix  elements. 

As  shown  in  [1],  the  details  are  different  in  the  aspon  model  because  aspon 
exchange  can  compete  with  the  usual  box  diagrams  for  the  decay  amplitudes.  In 
the  K  system,  the  mixing  is  dominated  by  aspon  exchange  and  that  led  to  the  upper 
limits  on  the  masses  mentioned  above.  The  large  phase  there  compensated  for  the 
small  phase  in  the  K  — *  ?nr  amplitude  from  W  exchange. 

For  the  B  system,  the  mixing  acquires  a  sizeable  contribution  from  the  W*W~ 
box  graph  and  the  compensation  is  incomplete  for  the  small  phase  occurring  in  the 
decay  amplitude.  The  reason  that  the  aspon  exchange  is  relatively  less  important 
in  the  mixing  can  be  traced  to  two  large  suppression  factors:  (1)  (mc/mt)2  ~  10~4 
and  (2)  |x3/x2|2  ~  HT2. 

As  a  result,  in  a  sample  of  108  B  decays  one  would  not  see  the  CP  asymmetry 
if  the  aspon  model  were  correct.  This  is  good  in  that  it  will  refute  the  standard 
model  if  one  defines  the  KM  mechanism  as  a  part  thereof  (this  may  be  unfair  to  the 
standard  model).  It  is  bad  because  there  would  still  be  no  example  of  CP  violation 
seen  outside  of  the  neutral  K  system. 

To  confirm  the  aspon  model,  the  most  convincing  would  be  the  production  of 
the  aspon  particle  in  a  collider. 

In  conclusion,  if  CP  asymmetries  are  undetectably  tiny  in  B  decay,  spontaneous 
CP  violation  is  an  alternative  to  the  KM  mechanism. 

This  work  was  supported  in  part  by  the  US  Department  of  Energy  under  Grant 
No.  DE-FG05-85ER-40219. 
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Abstract.  In  this  presentation,  I  discuss  an  extension  of  the  standard  model  which 
is  motivated  by  the  desire  to  allow  non-trivial  accommodation  of  three  families  and  is 
testable  by  the  production  of  new  particles  (especially  dileptons). 


The  present  talk  includes  (1)  a  description  of  the  model  (331  model);  (2)  the  cu- 
bic see-saw  for  neutrino  masses;  (3)  brief  comments  on  phenomenology  of  the  331 
model  for  hadronic  and  leptonic  colliders. 

(1)  Desciption  of  the  Model 

Family  symmetry  is  usually  taken  to  mean  a  horizontal  symmetry,  either  global 
or  gauged,  under  which  the  three  families  transform  under  some  nontrivial  rep- 
resentation. The  family  symmetry  is  broken  in  order  to  avoid  unobserved  mass 
degeneracies.  In  this  meaning  of  family  symmetry,  there  is  no  explanation  of  why 
there  are  three  families  which  is  an  input.  Rather  the  hope  is  that  the  postulated 
family  symmetry  may  explain  the  observed  hierarchies  of  the  fermion  masses  and 
the  quark  flavor  mixing  angles. 

In  the  present  model,  the  aim  is  indeed  to  attempt  to  address  such  unexplained 
hierarchies  and  to  explain  why  there  are  three  families.  This  may  be  a  necessary 
first  step  to  understanding  the  hierarchies  themselves. 

To  introduce  the  331  model  the  following  are  motivating  factors: 

(1)  Consistency  (  unitarity,  renormalizability)  of  a  gauge  theory  requires  that 
chiral  anomalies  be  cancelled.    This  requirement  alone  is  able  to  fix  all  electric 
charges  in  the  standard  model,  and  to  explain  charge  quantization,  e.g.  the  electric 
neutrality  of  the  hydrogen  atom,  without  the  necessity  for  grand  unification. 

(2)  This  does  not  explain  why  there  are  three  families  but  is  sufficiently  im- 
pressive to  suggest  that  Ng  =  3  may  be  explicable  by  anomaly  cancellation  in  an 
extension  of  the  standard  model.  This  requires  both  that  each  extended  family  have 
a  non-vanishing  anomaly  and  that  the  three  families  are  not  all  treated  similarly. 

(3)  A  striking  feature  of  the  mass  spectrum  of  the  standard  model  is  the  top 
quark  mass  suggesting  that  the  third  family  be  treated  differently  from  the  first  two 
and  that  the  anomaly  cancellation  be  proportional  to  the  combination  -f  1  + 1  —  2  =  0 
for  the  three  families. 

(4\  There  is  a.  factor  -2  fsnffffested  bv  (3\  above"}  alreadv  occurrine  in  the  theorv. 


bosons,  ine  adjoint  ot  bu(6)  breaks  down  to  3  4-  (2  -f  2)  -f  1.  The  1  is  a  Z  ana 
the  two  doublets  are  readily  identified  as  dileptonic  gauge  bosons  of  the  type  first 
encountered  in  577(15)  grand  unification. 

Several  interesting  contributions  to  517(15)  have  been  made  since  1991  (the 
WHEPP2  meeting  in  Calcutta)  by  B.  Brahmachari  of  the  Physical  Research  Labo- 
ratory in  Ahmedabad.  These  include  (with  Mann,  Steele  and  Sarker)  the  influence 
of  Higgs  on  unification  for  different  patterns  of  symmetry  breaking  -  published  in 
Phys.  Rev.  D  in  1991;  (with  Sarkar)  the  increase  in  the  GUT  scake  for  super- 
symmetrized  SU(15)  -  published  in  Phys.  Lett.  B  in  1992;  and  the  extension  to 
SU(16),  including  a  right-handed  neutrino,  and  the  symmetry  breaking  patterns 
and  phenomenology  -  published  in  Phys.  Rev.  D  in  1993.  There  have  also  been 
significant  contributions  to  577(15)  by  Deshpande,  Lavowa,  and  others. 

The  two  principal  shortcomings  of  517(15)  grand  unification  are  (1)  The  group  is 
so  large  that  the  ratio  of  unknown  particles  to  known  ones  is  unacceptably  high;  and 
(2)  the  anomaly  cancellation  is  inelegant,  being  accomplished  by  mirror  fermions. 

One  feature  of  5£7(15)  which  survives  in  the  331  model  is  the  presence  of  an 
5Z7(3)  electroweak  gauge  group.  But  it  is  not  a  subgroup  of  517(15)  as  can  be  seen 
by  the  fact  that  the  dileptonic  gauge  bosons  do  not  couple  to  the  quarks  in  517(15) 
wheras  they  do  in  the  331  model. 

Now  we  are  ready  to  describe  the  structure  of  the  model.  The  gauge  group 
is  extended  to  5f7(3)c  x  SU(3)L  x  U(l)X-  The  triplets  of  5(7(3)£  have  electric 
charges  (X-1,X,X+1)  while  antitriplets  have  charges  (X+1,X,X-1).  In  either  case 
the  X  value  is  the  electric  charge  of  the  middle  component. 

For  the  leptons  we  assign  (e~ ,  j/e,  e+)  to  an  antitriplet  and  similarly  for  the  //  and 
r  families.  The  left  handed  quarks  are  assigned  to  triplets  for  the  first  two  families 
and  to  an  antitriplet  for  the  third  family.  The  right-handed  quarks  are  taken  to  be 
singlets  as  in  the  standard  model.  No  new  leptons  have  been  introduced  but  there 
is  a  new  quark  per  family:  D  ,  S  (Q=-4/3)  in  the  first  and  second,  and  T  (Q=+5/3) 
in  the  third. 

Anomaly  cancellation  occurs  in  a  nontrivial  manner  between  the  families.  One 
can  check  in  turn  all  the  relevant  gauge  anomalies:  (3C)3,  (3z,)3,  (3c)2^,  (3/:)2X, 
X  ,  ^(T),,,)2.  Each  extended  family  has  non-vanishing  anomaly  for  the  2nd ,4th 
and  5th  of  these,  while  the  three  families  taken  together  cancel  with  anomalies 
proportional  to+l  +  l-2as  expected.  The  1st,  3rd  and  6th  anomalies  do  cancel 
family  by  family. 

In  the  Higgs  sector  there  is  a  triplet  $  with  X  =  +1  whose  VEV  breaks  the  331 
gauge  group  to  the  321  gauge  group  of  the  standard  model.  This  also  allows  D,S,T 
quarks  as  well  as  the  Z2  and  (Y~- ,  Y~)t  (Y++ ,Y+)  gauge  bosons  to  acquire  mass. 
The  electroweak  breaking  involves  three  5f7(2)  doublets  embedded  in  two  triplets 
<f>(X  =  0)  and  <j>  (X  =  -1)  and  a  sextet  Ha0(X  =  0).  The  sextet  is  necessary 
to  give  a  symmetric  (in  family  space)  contribution  to  the  charged  fermion  masses. 
Note  that  <j>  gives  masses  to  d,s,t  (and  an  antisymmetric  contribution  to  the  charged 
lepton  masses)  while  <f>  gives  masses  to  u,c  and  b.  Let  the  scale  of  breaking  of  331 
to  321,  the  VEV  of  $,  be  denoted  by  U.  What  can  the  scale  U  be?  There  is  a  lower 
bound  coming  from  precision  electroweak  data:  what  makes  the  331  model  more 
interesting  is  that  there  is  also  an  upper  bound. 
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The  lower  bound  comes  from  the  limits  on  Z  —  Z  mixing  and  from  the  absence 
of  flavor-changing  neutral  currents.  The  latter  requires  that  it  is  the  third  family 
which  is  treated  asymmetrically  rather  than  the  first  or  second.  A  lower  bound 
on  the  Y  mass  comes  from  data  on  polarized  muon  decay.  The  Y  would  give  a 
component  (V-f-A)  compared  to  the  W  which  is  (V-A).  The  strong  limits  on  the 
parameter  f  defined  as  the  coefficient  of  (-A),  namely  1  >  £  >  0.997  imply  that  the 
mass  of  the  dilepton  must  exceed  300  GeV.  More  accurate  measurements  of  muon 
decay,  and  of  muonium-antimuonium  conversion  will  put  better  limit  on  the  dilep- 
ton mass.  Incidentally,  it  is  fascinating  that  these  low  energy  experiments  rather 
than,  say,  LEP  data  place  the  most  stringent  limits  on  this  new  physics. 

The  upper  limit  on  U  arises  because  the  electroweak  mixing  angle  must  be  less 
than  0.25  in  order  that  the  321  standard  model  can  be  embedded  in  331.  It  is  easy 
to  see  that  if  SU(2)  x  U(l)  is  embedded  just  in  SU(3)  then  the  value  of  sin^O  is 
exactly  1/4,  and  if  this  value  is  exceeded  then  the  X  coupling  constant  becomes 
imaginary.  Since  the  value  is  already  close  to  one  quarter  at  the  Z  mass  and  it 
increases  with  energy  the  consequence  is  that  the  scale  U  cannot  be  more  than  a 
few  TeV.  Requiring  that  ax  not  exceed  2?r  then  enforces  that  M(Zi]  <  2.2  TeV. 

(2)  Cubic  See-  Saw  for  Neutrino  Masses 

In  the  minimal  331  model  neutrinos  are  massless  since  AZ/  =  0  and  there  are 
no  right-handed  neutrino  fields.  We  have  examined  (with  P.  Krastev  and  J.T.Liu) 
the  effects  of  soft  breaking  of  L  on  the  neutrino  masses. 

Spontaneous  breaking  of  L  would  lead  to  a  massless  Majoron.  Therefore  we 
employ  explicit  breaking  by  terms: 


Ha*»H**P'caia,a»cwah  +  m2(Haffafp  +  h.c.)  (1) 

The  Yukawa  couplings  to  the  leptons  are  given  by: 
HCliqH""  +  tijfoLfae"*  -I-  /i.e.)  (2) 

The  couplings  /ii,/*2  are  symmetric,  antisymmetric  respectively  in  family  space. 

At  one  loop  level  there  are  diagrams  where  the  soft  breaking  of  L  through  the 
parameters  mi,  m^  gives  rise  to  neutrino  masses.  What  is  unique  to  the  331  model 
is  the  presence  of  contributions  of  the  general  form: 

Af  (i/O  -  CM(liflM$,  (3) 

This  is  the  cubic  see-saw.  M(/i)  is  the  mass  ofthe  corresponding  charged  lep- 
ton.lt  arises  because  all  the  Yukawa  couplings  are  related  by  5t/(3). 

If  we  take,  merely  as  an  example,  Dennis  Sciama's  favorite  value  for  the  tau 
neutrino  mass  of  29.3  eV,  then  the  mu  neutrino  mass  is  6.2meV  (that  is,  0.062 
eV),  and  the  electron  neutrino  mass  is  690  peV.  This  last  tiny  mass  gives  rise  to  a 
contribution  to  neutrinoless  double  beta  decay  but  it  is  many  orders  of  magnitude 
below  present  experimental  limits. 

As  is  usual  with  this  type  of  formula,  one  can  fit  any  two  out  of  Cold  Dark 


there  is  no  right-handed  neutrino  involved. 

(3)  Phenomenology  of  the  331  Model  for  Hadronic  and  Leptonic  Col- 
liders 

In  hadronic  colliders,  for  example  pp  at  10  TeV  center  of  mass,  dileptons  can  be 
both  pair  produced  and  associatedly  produced  in  conjunction  with  new  quarks 
(which  themselves  carry  lepton  number).  Explicit  calculations  show  that  such 
events  have  a  healthy  detectable  rate  for  any  luminosity  sufficiently  high  to  be 
otherwise  interesting  in  looking  for  Higgs  bosons  or  low-energy  supersymmetry. 

In  an  e~e~  collider  at  300-1000  GeV  center  of  mass  energy  the  Y —  could  ap- 
pear as  a  striking  narrow  direct  channel  resonance.  One  would  best  look  at  the 
channel  e~e~  — >•  n~ /j~  for  which  the  standard  model  background  is  zero. 

This  work  was  supported  in  part  by  the  US  Department  of  Energy  under  Grant 
No.  DE-FG05-85ER-40219. 
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1.  Introduction 

Large  Electron  Positron  collider  (LEP)  at  CERN,  Geneva,  is  in  operation  since 
September  1989.  There  are  four  mammoth  detectors  -  ALEPH,  DELPHI,  L3  and 
OPAL-  which  are  collecting  data  at  LEP.  The  purpose  of  LEP  is  to  study  the  Z° 
during  LEP100  Phase  (  1989-1995  )  at  ^/a  ~  90GeV.  By  the  end  1993  each  of  the 
four  detectors  collected  a  total  number  of  Z°  events  ~  2  x  106  .  Some  of  the  results 
from  these  studies  are  [1]: 

(a)  Mass  of  Z°    is  measured  to  a  precision  of  ^-  —  10~4. 

(b)  Number  of  light  v   is  measured  as  Nv  =  3.00  ±  0.03. 

(c)  Effective  value  of  the  electroweak  mixing  angle  is  measured 
to  be  sin20eff  =  0.2321  ±  0.0006. 

(d)  Running  of  <*„   is  established  at  LEP. 

(e)  Top  mass  is  estimated  to  be  166t}g  ±  20   GeV 

(f)  Lower  limit  on  Higgs  boson  mass  is  63.5  GeV 

(g)  Standard  Model  of  EW  interactions  is  tested  at  LEP  to  a 
precision  better  than  0.5  %. 

LEP200:  The  second  phase  of  LEP,  sometimes  called  LEP200  phase,  is  to  achieve 
centre  of  mass  energy  for  e+e~  collision  above  the  W  production  threshold  so  as 
to  observe  and  study  for  the  first  time  the  reaction 

e+e-     -+     W+W  (1) 

It  may  be  recalled  that  one  of  the  main  reasons  for  the  need  of  Z°  is  to  regularise 
the  production  cross  section  of  W+W~;  without  the  Z°  the  energy  dependence  of 
the  cross  section  violates  unitarity  bound,  fig  1. 

Beam  Energy:  The  upgradation  of  the  LEP  beam  energy  from  ~  45  GeV  to  above 
the  W  mass  threshold  will  be  achieved  by  adding  192  superconducting  RF  cavities 
in  the  LEP  ring.  This  will  lead  to  Ebeam  =  88  GeV  yielding  centre  of  mass  energy 
for  collision  as  -^/s  =  176  GeV. 

Luminosity:  The  expected  luminosity  is  4  x  1031cm~2sec~1.    With  the  running 


period  of  ~  6  months  a  year  for  LEP,  the  integrated  luminosity  in  a  year  will  be 
~  240pb~1.  Total  approved  luminosity  at  LEP200  is  expected  to  be  500  pb~!.  With 
cr  (e+e~  — >•  W+W~)  ~  16pb,  the  number  of  expected  W  pairs  per  experiment 
will  be  ~  8000. 


v  exchange 


GSW 


160  200  240 

GeV  (e+  e~) 


Figure  1.  Production  cross  section  of  e+e    — »•  W+W~  as  a  function  of  >/s.  Dotted  line 
is  for  only  v   exchange  while  the  solid  line  is  from  the  Standard  Model. 


118 


Pramana-  J.  Phys.,  Supplement  Issue,  1995 


Aims  ol  LLr  2UU:  I  he  physics  capabilities  of  LEP200  have  been  extensively  studied 
[2,  3].  There  are  however  three  main  topics:  (a)  Precision  measurement  of  MW  and 
IV;  (b)  Determination  of  Triple  Gauge  Boson  couplings,  and  (c)  Search  of  New 
Particles.  In  this  talk  we  will  restrict  ourselves  to  the  first  topic. 


2.  Prsent  knowledge  of  MW  and  FW 

Present  knowledge  of  MW:  Mass  of  W  has  been  measured  in  two  ways: 

(i)  Direct  Measurements 

Experiments  at  pp  colliders,  UA2  and  CDF,  made  direct  measurements   [4]  of  the 
W  mass  from  W— *  Iv   events.  Their  results  are  summarised  below. 

MW/MZ    =    0.8813  ±  0.0036  ±  0.0019  GeV  (UA2) 
Mw    =    79.91  ±0.39  GeV    (CDF) 


The  combined  result  is: 


Mw  =  80.22  ±0.26  GeV 


(ii)  Indirect  Estimation 

The  LEP  data  consisting  of  energy  dependence  of  various  cross  sections  (  e+e~  — +  Z° 
and  Z°  — +  hadrons,  e+e~,  p+/i~  and  T+T~)  and  asymmetries  when  fitted  in  the 
framework  of  the  Standard  Model  leads  to  the  mass  of  W  and  this  value  is  [1]: 

Mw     =    80.25  ±0.10  ±0.03  GeV    (LEP) 
[  Second  error  due  to  variation  of  Higgs  between  60-1000  GeV] 

Present  knowledge  of  Fw:  The  W  width  as  quoted  by  UA2,  UA1  and  CDF 
experiments  [4]  are  not  direct  measurements;  the  width  is  'extracted'  from 
the  measurement  of  R 

R    =     <r(W-+ei/)/cr(Z-+e+e-)  (2) 

(3) 
<r(pp  ->  W  +  X)      F(W  -+  ei/)       Fz 


<r(pp  ->  Z  +  X)     F(Z  ->  e+e- )     Fw 


(4) 

The  quantities  tr(pp  -»•  W  +  X)/<r(pp  -»  Z  +  X)    and  Y(W  -+  ei/)/F(Z  -^  e+e~) 
are  calculated  from  theory,  and  FZ    is  taken  from  LEP  measurements. 

Values  from  the  three  collider  experiments  are  summarised  below: 

Tw     =    2.10  ±0.14  ±0.09  GeV  (UA2)  (5) 

Fw     =    2. 18  ±0.26  ±0.04  GeV  (UA1)  (6) 

Fw     =    2.12  ±0.20  GeV  (CDF)  (7) 


The  combined  result  is: 


Tw  =  2. 12  ±0.11  GeV 


T» 


W  is  a  vector  gauge  boson  and  therefore  the  knowledge  of  its  mass  is  very  important. 
Some  of  the  other  reasons  are: 

[A]  It  may  be  recalled  that  the  top  quark  and  the  Higgs  boson  are  the  two  unknowns 
in  the  Standard  Model.  Precision  measurement  of  MW  will  give  further  constraints 
on  the  mass  of  the  Higgs  if  the  top  quark  mass  is  known,  fig.  2,  [5]. 


80.4  - 
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Figure  2.  Mw  vs  Mtop  curves  in  the  framework  of  the  Standard  Model  for  three  different 
values  of  Higgs  mass. 


For  example  if  Top  mass  is  known  from  hadron  collider  experiments  at  Fermilab 
(a)  with  a  precision  of  AMt0p  =  10  GeV,  then  50  MeV  error  on  W  mass  mea- 
surement will  lead  to  AMHigg*  =l?tS  GeV,  and  (b)  precision  in  AMtop  =  2  GeV 
and  50  MeV  in  Mw  will  yield  AMmggs  =t\H  GeV. 
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[B]  Testing  radiative  corrections 


Radiative  corrections  come  from  self  energies  ,  formfactors,  box  diagrams  etc.  The 
mass  of  the  W  and  Z  are  related  by, 


M2W     = 


trot 


v/2GF(l  -  MW/M|)    1-Ar 


(8) 


The  error  in  the  radiative  correction  Ar  arises  mainly  from  the  error  in  MW  as 
the  value  of  MZ  is  known  to  a  very  high  precision  (AMz  =  6  MeV).  The  error  of 
50  MeV  in  MW  will  lead  to  error  on  Ar  as  0.003. 


It  may  be  noted  that  the  radiative  correction  Ar  depends  on  Mtop  andMniggs  and 
hence  can  be  determined  independent  of  MW-  The  L3  experiment  [6]  at  LEP  has 
determined  the  value  of  Ar  =  0.045  ±  0.013. 


Two  ways  of  determining  radiative  corrections  as  mentioned  above  can  thus  be 
compared. 


4.   W  production  mechanism 


Leading  order  diagrams  are  due  to  Z,  7   and  v  exchanges  and  their  interferences. 


Initial  state  helicities  are  LR  or  RL  for  the  Z  and  7  exchanges,  while  only  the  LR 
diagram  is  allowed  for  the  v  exchange  (since  v  is  left  handed).  It  may  be  noted 
that  the  v  diagram  is  the  dominant  one  and  as  a  result  W+  and  W~  are  peaked 
along  the  pe+  and  pe-  directions  respectively,  fig.3. 
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0.5  0.73  1. 
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Figure  3.   Angular  distribution  of  W+     and  W~    with  respect  to  the  direction  of    e"4" 
beam. 


Determination  of  mats  and  width  of  W  at  LEP200 

For  the  forward-backward  production  the  longitudinal  component  of  the  W  helicity 
(A  =  0)  is  equally  important  as  the  transverse  states  (A  =  +1  or  —  1). 


w  — 

=  0or  1 


•w 


=  — lorO 


5.  W  decay  modes 


Final  states  from  e+e~  —  +  W+W~  are  classified  into  three  classes  depending  on  W 
decay  modes 

Pure  Leptonic   :       WW  -+  Ivlv   :         7.5  % 


Semi  Leptonic   :        WW 


40  % 


Pure  Quark       :       WW  ->  qqqq    :          52.5  % 

A  quark  from  W  decay  will  undergo  fragmentation/hadronization  leading  to  a  for- 
mation of  jet  in  the  final  state. 

Topology  of  WW  events 

(i)  Since  the  W"*"   and  W~    are  peaked  along  pe-f    and  pe-    respectively  and  hence 
the  charged  leptons  from  W  decay  will  reflect  the  parents  angular  distributions. 

(ii)  Pure  Leptonic  Mode:  Two  visible  leptons  or  a  thin  jet  from  the  hadronic  decay 
of  T  will  be  seen  in  the  final  state. 


(iii)  Semi-leptonic  decay  :  Two  jets  and  one  isolated  lepton  or,  two  jets  and  one 
thin  jet  in  the  final  state. 


(iv)  Pure  Quark  decay  :  Four  jets  or  more  in  the  final  state. 


Production  cross-section  of  W+W~    process  is  ~  16pb.  In  this  section  we  discuss 
briefly  several  sources  of  some  of  the  background  and  they  are: 

(0  e+e~  -vZZ  -+ff  : 


Total  yield  of  this  reaction  is  only  a  few   %  of  the  WW  signal, 
(ii)  e+e-  -*  f  f : 

The  cross-section  is  10  pb  for  f  =  lepton  and  leads  to  nearly  back  to  back  leptons 
which  can  be  removed  by  an  appropriate  cut.  The  cross-section  is  26  pb  for  f  = 
quark,  but  then  this  reaction  is  dominated  by  2  and  3  jets,  while  WW  signal  leads 
to  4  or  more  jets. 


(iii)  e 


+~ 


f  f  7 


This  reaction  is  due  to  on-mass  shell  Z°  exchange  with  monochromatic  7  of  ~ 
70  GeV.  Cross  section  for  this  reaction  is  80  pb  for  f  =  quark  &  20  pb  for  f  =  lepton; 
Major  fraction  of  these  events  can  be  rejected  from  large  missing  momentum. 
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This  reaction  will  have  two  leading  electrons  which  are  lost  in  the  beam  pipe. 
Missing  PT   and  acoplanarity  will  remove  most  of  this  background. 


7.  Experimental  methods  to  measure  M\v 

Three  methods  have  been  discussed  in  the  literature  and  they  are: 

[A]  Direct  Reconstruction: 

W's  are  reconstructed  using  either  hadronic  events,  where  both  W  decay  into  jets, 
or  hadronic-leptonic  events,  where  one  W  decays  hadronically  while  the  other  into 
leptonic  mode. 

Note  that  the  masses  of  the  two  W's  produced  will  have  different  masses 
due  to  finite  width  of  W.  Some  of  the  groups  have  assumed  equal  masses  for 
reconstructing  W's  [5].  In  L3  we  have  assumed  unequal  masses  for  reconstructing 

W's. 

[B]  Shape  of  the  Excitation  Curve: 

From  the  measured  shape  of  the  production  cross  section  one  deduces  the  value  of 


[C]  Lepton  end  point  energy: 

The  value  of  MW    is  obtained  from  the  lepton  spectrum  of  leptonic  decays  of  W's 
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8.  W  pair  generators 


To  carry  out  simulation  studies  we  need  Monte  Carlo  programs  that  generate 
e+e~  — *•  W+W~  events.  There  are  several  generators  available  and  they  are 
summarised  in  Table  1 .  Some  of  the  important  ingredients  used  in  a  generator  like 
inclusion  of  W  width,  proper  implementation  of  CKM  matrix  elements,  incorpo- 
ration of  the  initial  state  radiation,  flexibility  in  triple  gauge  boson  coupling,  and 
hadronization  scheme  are  also  indicated  in  Table  1. 


Table  1. 


Generator 

Herwig 

Pythia 

LEPWW 

GENTLE 

WWgamma 

rw 

Y 

Y 

Y 

Y 

Y 

VCKM 

Y 

Y 

N 

Y 

Y 

ISR 

N 

Y 

Y 

Y 

hard  photon 

General 

ZWW,  7\VW 

N 

N 

Y 

N 

N 

Hadronization 

Cluster 

LUND 

LUND 

LUND 

LUND 

Author 

Webber 

Sjostrand 

Kleiss 

Bardin 

KEK-LAPP 

REMARKS:   GENTLE    [7]  makes  a  detailed  implementation  of  ISR  taking  into 
account  t-channel  exchange  diagrams  properly. 


9.  Jet  reconstruction  algorithms 


When  both  the  Ws  decay  by  hadronic  mode,  one  recombines  the  hadrons  to  make 
four  jets;  each  jet  representing  a  parent  quark.  Since  the  W's  are  slow  (ft  ~  0.5), 
the  hadrons  of  a  given  jet  are  not  well  separated  from  those  of  other  jets.  Therefore 
a  reconstructed  jet  may  have  particles  of  another  jet.  When  one  W  decays  hadron- 
ically  and  the  other  via  leptonic  mode  then  the  number  of  jets  is  reduced  and  the 
mismatch  is  much  less. 

There  are  several  different  algorithms  for  the  reconstruction  of  jets.  Below  we 
discuss  six  different  algorithms. 

(i)  Generator  History: 

In  this  procedure  one  recombines  the  hadrons,  belonging  to  a  given  quark,  to  form 
a  jet.  Therefore  no  mismatch  of  particles  between  different  jets.  This  is  possible 
because  we  are  dealing  with  generated  events. 


Determination  of  mass  and  width  of  W  at  LEPSOO 

(ii)  Jade  algorithm   [8]    : 

_     2EiEj(l-coeflu) 

y*j   —          ^2  iy; 

^vis 

The  quantity  yy  is  based  on  invariant  mass  squared  of  'jets'  i  and  j  (to  start  with 
one  uses  small  resolvable  cluster  as  a  jet).  E,-  and  Ej  are  energies  of  jets  with  0,-j 
and  Evis  as  opening  angle  and  total  visible  energy  respectively.  The  pair  of  jets 
with  the  smallest  yy  are  combined  into  a  new  jet  and  all  y[jS  are  recalculated; 
this  procedure  is  repeated  iteratively  until  we  are  left  with  four  jets.  The  exact 
definition  of  yy  is  different  in  (iii),  (iv)  and  (v)  below. 

(iii)  Durham  algorithm   [9]: 

2Mm(Ef,E?)(l-cos0y) 

y*j   ~  p2 

vis 

(iv)  Geneva  algorithm   [10]: 

_       SEjEjQ-COBfly) 

Vij     ~         9(Ei-fEj)2 
(v)  Lund  algorithm   [11]: 

__     2|piiipj|8in| 


(vi)  Generalised  thrust  approach  [12]  :  Thia  approach  has  been  used  at  PETRA 
to  study  4  jet  events.  The  aim  is  to  find  four  jet  axes  as  the  unit  vectors  rij  so  that 
a  generalised  form  of  the  standard  thrust 


is  maximised. 

10.  MW  from  direct  measurement 

There  are  three  types  of  events: 

(i)  Pure    Quark  or  'HadronicDecay  :      W+  W~  — >•  qi  q2  qa  q4 

(ii)  Semi    Leptonic  Decay  :        (a)  W+  W  -+  qi  q2  e  i/e    ,       (b)  W+  W~  -* 

(c)  W+  W~  -*  qi  q2  r  vr 

(iii)  Pure    Leptonic  Decay  :       W"1"  W~  — *  I  v\  I'  v\» 
Energy  and  momentum  conservation  of  the  reacilon  can  be  made  if  we  are  able 


00    IU   10    U\J\i    VJ 


,  , 

three,  and  (b)  energy  of  r   as  its  decay  involves  v   corresponding  to  a  decrease  of 
one  constraint. 

Keeping  in  mind  the  above  points  we  will  not  presently  use  (iic)  and  (iii)  for  our 
kinematic  fits. 

10.1  Details  of  Kinematic  Fitting  : 

As  mentioned  earlier  one  carries  out  kinematic  fits  to  impose  energy  momentum 
conservation.  Constraint  equations  are  described  in  detail  below. 

(j)  WW-»GiJ2)aaJ4) 
Constraint  equations  are  => 

(1)  Ejl  + 

(2)  Pfi  +  P 

(3)  PJi  +  PJi+Ps 


Besides  the  above  4  constraints,  two  more  constraints  can  be  used  if  one  assumes 
equal  masses  for  the  two  W's 

(5)  (Ej!  +  EJ2)2  -  (pV  +  pj2)2  -  M2W  =  0 

(6)  (EJ3  +  EJ4)2  -  (pj3  -f  pj4)2  -  M^  =  0 

With  unknown  as  Mw   one  ends  up  with  5C  fit. 
(jj)  WW 


Repeating  the  procedure  as  described  in  (i)  yields  2C  fit  for  this  reaction. 

Remark:  Allowing  the  two  W  masses  to  be  unequal  one  ends  up  with  4C  and  1C 
fits  respectively 

10.2  Effect  of  initial  state  radiation 

Initial  state  radiation  (ISR)  is  a  common  feature  in  e+e"  interactions  i.e.,  the  e~ 
emits  a  photon  in  the  presence  of  the  Coulomb  field  of  the  e+  or  vice  versa.  The 
distribution  in  the  photon  energy  from  ISR  is  shown  in  fig.4  from  the  GENTLE 
generator  and  its  mean  value  is:  <  Erad  >=  1.211GeV  at  ^=  176  GeV. 

The  effect  of  ISR  in  the  cross  section  for  e+e~  -»  W+  W~  is  incorporated  as  follows: 
Without  ISR  the  cross-section  is  given  by  [13]: 


ro(s;mi,m2)  (14) 
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Figure  4.  Energy  distribution  of  ISR  photons. 
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p(m)    = 


Mw  and  Tw  are  the  mass  and  width  of  W;  mi  and  m2  are  the  two  produced 
W  masses  and  p(m)  is  the  Breit-Wigner  (BW)  distribution.  <ro(«;mi,m2)  is  the 
Born  cross  section  [13]  at  the  centre  of  mass  energy  of  -  ^ 


With  ISR  the  cross-section  is  given  by   [7]: 


r(s)    =     /" 
Jo 

1>S',  =  / 

Jo 


dm 


• 


y.(V8-mi)3 

)  I  drn^  •  p(m2)  - 

Jo 

dx  •  <T0(s(l  -  x))  •  H(x,  s) 


H(x,s)    = 


Of  -TC 


(16) 

(17) 
(18) 
(19) 
(20) 
(21) 

7T 

H(x,s)  is  the  radiator  function  [14]  and  x  is  a  fractional  momentum  carried  by  the 
photon. 

From  the  two  distributions,  eqs.(14)  and  (16),  we  calculated  the  average  value  of 
mass,  <  (mi  +  m2)/2  >  and  the  mass  difference  |mi  —  m2|  and  they  are  listed 
below. 


Parameters 

Qo 
Without  ISR 
(GeV) 

Qi 
With  ISR 
(GeV) 

Qo-Qi 

(MeV) 

Mass 

80.046 
(3.19) 

79.972 
(3.20) 

74.0 

<  (ni!  +  m2)/2  > 

79.835 
(2.85) 

79.732 
(2.93) 

103.0 

|nu  -  m2| 

3.292 
(4.61) 

3.341 
(4.77) 

-49.0 

The  effect  of  ISR   [15]:  Finally  we  may  conclude  the  following  effect  due  to  ISR 
on  our  distributions: 

(a)  To  lower  the  mean  values  of  mass  and  <  (mi  +  m2)/2  >  by  74  and  103 

"Kf\T  i«l  V-4'*//  «/ 

MeV  respectively, 

(b)  To  increase  the  dispersion  by  nearly  0.2%, 

(c)  To  decrease  the  total  cross  section  by  ~  2  pb. 
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j.ney  nave  generated  «s  oouu  events  01  vv  •  vv  — »•  4jei  ana  w  louu  events  01 
(2jet  -f  It/)  corresponding  to  w  400pb~1.  Events  were  simulated  through  DELPHI 
detector.  Jet  reconstructions  were  carried  out  using  generalised  thrust  algorithm. 

Kinematic  fit  was  carried  out  with  the  constraint  that  the  two  W  masses  are  equal 
[5C  &  2C  fit  for  4-jet  &  2-jet+ei/e  respectively].  To  extract  MW  from  the  data  : 
fit  was  made  with  a  BW  of  fixed  width  folded  with  a  Gaussian  (detector  resolution 
function)  plus  a  constant  background. 

RESULTS:  Fitted  mass  distributions  of  W  are  shown  in  figs.5(a)-5(c)  for  three 
types  of  events:  4  jets,  2  jets  -f  pv  and  2  jets  -f  ei/.  They  find  a  systematic  positive 
offset  of  the  fitted  value  of  the  mass  with  respect  to  true  W  mass  used  in  the 
generator.  This  is  summarised  below: 


(Fitted  Mw  -  M§pe)  =  422  ±  79  MeV,     (4-jet) 


(Fitted  Mw  -  Mwue)  =  819  ±  160  MeV,     (2  jet  +  e) 
(Fitted  Mw  -  Mwue)  =  432  ±  116  MeV,     (2  jet  -f  p) 

Results  from  L3  study    [15] 

L3  collaboration  has  also  carried  out  similar  study  but  without  the  detector  sim- 
ulation. They  have  used  events  generated  by  GENTLE  and  quarks  were  allowed 
to  undergo  hadronization.  Jets  were  then  reconstructed  ignoring  ISR  photons. 
Kinematic  fits  were  carried  out  by  ignoring  the  ISR  photons. 

Figs.6(a)  and  (b)  show  the  distributions  of  (Fitted  -  Generated)  mass  for  two  cases: 
(i)  for  all  events  and  (ii)  for  events  with  ISR  photon  energy  E7  <   20  MeV. 
Mean  values  of  the  two  distributions  are: 


(Fitted  Mw  -  Mwue)  =  477  ±  16  MeV,     (all  events) 
(Fitted  Mw  -  Mwue)  =  -11  ±  7  MeV,     (events  with  E7  <   20  MeV.) 

From  the  above  results  it  is  clear  that  (i)  when  there  is  no  ISR  photons  (which  is 
same  as  selecting  events  with  E7  <  20MeV)  the  value  of  (Fitted  MW  —  M^6)  is 
consistent  with  zero,  and  (ii)  when  all  events  are  used  (and  not  using  ISR  photons 
in  kinematic  fits)  one  sees  the  positive  offset  as  seen  by  DELPHI. 

Understanding  the  positive  offset  in  MW 

The  emmision  of  ISR  actually  reduces  the  effective  centre  of  mass  energy  to 
which  is  lower  than  the  nominal  energy  ^/s  =  2  x  Ebeam-  If  x  is  the  fractional  beam 
energy  carried  by  the  initial  state  photon  then  x/s7  =  s(l  —  x). 

But  kinematic  fits  as  discussed  earlier  assume  nominal  cm  energy  >/3-  This  results 
in  higher  Fitted  Mass  and  the  positive  offset  can  be  written  down  as:  Am  =  +  < 

Pramana-  J.  Phys.,  Supplement  Issue,  1995  131 


&,=0.422±0.079(GeV/cJ) 
a,-  1.442:1:0.1  26  (CoV/c1) 


Aw=0.432±0.1  1  6  (GeV/ca) 
,  =  2.006±0.136  (GeV/c2) 


^=O.B19±0.l60(GeV/c3) 
,  =  2.162±0.202  (GeV/cl) 


82  84  86 

MF  (GeV/c2) 

Figure  5    Fitted  mas8  distributions  of  W  for  three  types  of  events  (a)  4  jets,  (b)  2  jets 
-I-  \iv  and  (c)  2  jets  +  e»/.  v  '     J 
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(Fitted  -  Generated)  mass  in  GeV  for  W  (All  events) 
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Figure  6a.  (Fitted  -  generated)  mass  distribution  for  all  events 
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(Fitted  -  Generated)  mass  in  GeV  for  W  (ET  <  20  MeV) 


7000 


6000 


5000 


4000 


3000 


2000 


1000 


-   L3 


Entries 

Mean 

RMS 

UDFLW 

OVFLW 

ALLCHAN 


9874 

-0.1070E-01 

0.6697 

29.00 

8.000 

9837. 


(b) 


I   i 


-20         -15          -10  -5  0  5  10  15  2( 

Jets  reconstructed  ignoring  ISR  photon 


Figure  6b.   (Fitted  -  generated)  mass  distribution  for  events  with  ISR  photon  en 
E-,  <  20  MeV. 
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trad  >  "Jf  • 

For  v/s  =  176  GeV  and  the  value  of  <  E^d  >=  1.2  GeV  yield  Am  =  500  MeV. 
Shift  in  MW  due  to  Reconstruction  algorithm 

DELPHI  collaboration  repeated  their  fit  analysis  without  ISR  in  the  Monte  Carlo 
and  obtained  (Fitted  Mw  -  M^e)  =  -130  MeV.  This  is  due  to  reconstruction 
algorithm  and  this  needs  to  be  done  by  other  groups. 

Expected  precision  in    MW 

Projecting  the  Fit  Analysis  to  SOOpb"1  DELPHI  estimated  the  expected  precision 
in  MW  to  be  55  MeV.  Similar  value  for  the  precision  has  also  been  estimated  by 
L3  [16],ALEPHfcOPAL. 


II.  Simultaneous  determination  of  MW  and  FW  (L3)  [17] 

The  analysis  mentioned  above  was  based  on  fitting  equal  mass  to  both  the  W,  which 
is  not  technically  right  and  also  there  is  no  straight  way  to  determine  the  FW- 

L3  collaboration  has  proposed  a  strategy  to  determine  simultaneously  the  mass  and 
width  of  the  W.  They  have  used  proper  expression  for  the  cross  section  involving 
convolution  of  two  Breit-Wigner  distributions  weighted  by  tree  level  cross-section: 

/•s  /•(•A-mi)2 

a(s)     =      /    dmi-p(mi)  /  dm^  •  p(m2)  •  <r0(s;mi,m2)  (22) 

Jo  Jo 

(23) 


Distributions  of  mass,  mass  difference  and  mean  mass  are  then  given  by 

/.(v/S-mi) 

dff/dmi     =    4mip(mi)-  /  dm2[m2p(m2)«To(s;mi,m2)]  (24) 

Jo 

/0.5>/8 
d  <  m  >  [mim2p(mi)p(m2)cr(s,  s')  (25) 

_.5Am 

/2<m>  _ 

dAm[mim2p(mi)/>(m2)cr(8,s')  (26) 

. 

Where  Am  =  |mi  -  m2|,  and  <  m  >=  (mi  +  m2)/2 

It  is  proposed  to  use  either  the  variables  mi  and  m2  or  Am  and  <  m  >.  ISR  has 
been  incorporated  in  these  expressions,  see  sect.  10.2. 

RESULTS:  Results  are  presented  here  at  the  generator  level  i.e.,  without  simu- 
lating through  the  detector.   Total  event  sample  used  was  10,000  events  without 


m  ________        T 
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using  maximum  likelihood  method  [15j,  and  (11)  Am  &  <  m  >  using  x  minimi- 
sation method.  Fits  from  (ii)  are  shown  in  fig.7.  Values  of  our  fits  are  shown  in  the 
Table  2. 


Table  2. 


Fitted  quantities 

MW 

(GeV) 

rw 

(GeV) 

Am,  <  m  > 

80.231  ±0.014 

2.038  ±0.022 

mi,  m2 

80.194  ±0.012 

2.041  ±0.027 

At  the  generator  level  the  errors  on  mass  and  width  from  10,000  events  are  14  and 
27  MeV  respectively.  Estimated  errors  on  mass  and  width  with  detector  resolution 
are  ~  50  MeV  an-d  ~  100  MeV  respectively. 


12.  MW  from  the  excitation  curve 

This  approach  utilises  the  energy  dependence  of  the  production  cross  section  of 
(W+W~).  But  it  is  difficult  to  disentangle  the  effect  due  to  Mtop,  Mniggs,  possible 
new  physics  etc.  on  the  cross  section.  Sensitivity  of  cross  section  on  MW  is  shown 
[5]  in  the  fig. 8.  Fractional  change  in  cross  section  for  change  in  mass  by  100  and 
200  Mev  are  shown.  It  is  seen  that  the  fractional  change  is  less  than  1%  even  for 
change  in  mass  by  200  MeV  for  >/s  >  172  GeV.  This  method  as  it  stands  at 
present  does  not  seem  to  be  very  useful. 


13.  MW  from  the  lepton  end-point  energy 

The  lepton  energy,  Ef  ,  from  the  decay  of  W  is  limited  by  MW 


E,  <   E+  =  iEb-f- 

Here  Eb  is  the  beam  energy.  The  error  on  MW  due  to  error  AE+  on  the  end  point 
is  given  by, 


AMw  = 

Several  studies,  see  e.g.    [5],  have  been  made  from  the  distribution  of  the  generated 
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Figure  8.   Fractional  change  in  cross  section  as  a  function  of  ^/s.    (a)  Exact  one  loop 
result,  and  (b)  Born  approximation. 
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is  expected  to  be, 
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Figure  9.  Lepton  energy  distribution  from  W— »•  Iv. 

AMW  »  270MeVy^?£^ 
i.e.,  the  error  expected  is  more  than  200  MeV. 

14.  Summary 

•  After  a  very  useful  run  of  physics  at  the  Z°    peak  the  reaction  e+  e~"  — »•  W+W" 
will  be  studied  at  LEP200  starting  from  1996. 

•  Longitudinal  polarization  of  W  will  become  available  besides  the  transverse  one 

•  The  approved  integrated  luminosity  is  500  pb"1  corresponding  to  ~  8000  events 

•  Aims  of  LEP200 

(a)  Precision  measurement  of  MW    and  TW 

(b)  Determination  of  Triple  Gauge  Boson  couplings 

(c)  Search  of  New  Particles 

•  Three  methods  have  been  studied  regarding  the  measurement  of  MW    and  TW 
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[B]  Shape  of  the  Excitation  Curve 

[C]  Lepton  end  point  energy 

Last  two  methods  are  not  very  useful. 

•  Precision  expected  for  MW   is  c±  50  MeV. 

•  Precision  expected  for  F\v   is  ~  100  MeV. 
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MAARTEN  F.  L.  GOLTERMAN 

Department  of  Physics,  Washington  University,  St.  Louis,  MO  63130,  USA 

Abstract.  The  quenched  approximation  for  QCD  is,  at  present  and  in  the  foreseeable 
future,  unavoidable  in  lattice  calculations  with  realistic  choices  of  the  lattice  spacing,  vol- 
ume and  quark  masses.  In  this  talk,  I  review  an  analytic  study  of  the  effects  of  quenching 
based  on  chiral  perturbation  theory.  Quenched  chiral  perturbation  theory  leads  to  quan- 
titative insight  on  the  difference  between  quenched  and  unquenched  QCD,  and  reveals 
clearly  some  of  the  diseases  which  are  expected  to  plague  quenched  QCD. 


1.  Introduction 

The  lattice  formulation  of  QCD  has  proven  to  be  a  powerful  tool  for  computing 
QCD  quantities  of  direct  phenomenological  interest,  such  as  hadron  masses,  decay 
constants,  weak  matrix  elements,  the  strong  coupling  constant,  etc.  (For  reviews 
see  for  instance  refs.  [1,  2],  or  the  proceedings  of  Lattice  93  [3].) 

In  order  to  perform  such  computations  numerically,  one  obviously  needs  to 
consider  a  system  with  a  finite  number  of  degrees  of  freedom,  which  is  accomplished 
by  putting  lattice  QCD  in  a  finite  box.  This  box  is  then  hopefully  large  enough  to 
accomodate  the  physics  one  is  interested  in  without  serious  finite  size  effects.  This 
leads  to  the  requirement  that  the  Compton  wavelength  of  the  particles  of  interest 
is  sufficiently  smaller  than  the  linear  dimension  of  the  box,  i.e.  the  mass  has  to  be 
large  enough  for  the  particles  to  fit  in  the  box. 

In  order  to  have  a  small  enough  lattice  spacing,  small  enough  masses  (in  partic- 
ular for  the  pion)  and  a  large  enough  box  size,  one  needs  a  large  number  of  degrees 
of  freedom  in  a  numerical  computation.  It  turns  out  that  for  QCD  with  realistic 
choices  of  the  lattice  spacing  a,  volume  V  and  the  quark  masses  (in  particular  the 
light  quark  masses),  the  presently  available  computational  power  is  not  adequate. 
The  most  severe  problem  comes  from  the  fermion  determinant,  the  logarithm  of 
which  is  a  very  nonlocal  part  of  the  gluon  effective  action  (specially  for  light  quark 
masses).  This  nonlocality  slows  down  the  Monte  Carlo  algorithms  dramatically. 

In  order  to  circumvent  this  problem,  most  numerical  computations  in  lattice 
QCD  have  been  done  in  the  quenched  approximation,  in  which  one  simply  sets  the 
fermion  determinant  equal  to  one  [4].  This  amounts  to  ignoring  all  fermion  loops 
which  occur  in  QCD  correlation  functions  (except  those  put  in  by  hand  through 
the  choice  of  operators  on  the  external  lines).  While  some  hand  waving  arguments 
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roughly  inversely  proportional  to  its  mass,  this  error  is  expected  to  be  particularly 
severe  for  quantities  involving  light  quarks.  It  appears  therefore  that  chiral  pertur- 
bation theory  ChPT  maybe  a  useful  tool  for  investigating  the  difference  between 
quenched  and  unquenched  ("full")  QCD. 

In  this  talk,  I  will  review  a  systematic  approach  to  the  study  of  the  quenched 
approximation  through  ChPT  [5,  6,  7,  8].  There  are  two  reasons  why  ChPT  is 
useful  in  this  context: 

*  It  turns  out  that  ChPT  can  be  systematically  adapted  to  describe  the  low 
energy  sector  of  quenched  QCD  [6].  It  will  therefore  give  us  nontrivial,  quantitative 
information  on  the  difference  between  quenched  and  full  QCD. 

*  ChPT  describes  the  approach  to  the  chiral  limit,  and  can  be  used  for  extrapo- 
lation of  numerical  results  to  small  masses  and  large  volumes.  If  these  results  come 
from  quenched  computations,  one  will  of  course  need  a  quenched  version  of  ChPT. 
(For  finite  volume  ChPT,  see  refs.  [9].  For  quenched  finite  volume  results,  see  refs. 
[6,  7].) 

In  this  review,  I  will  concentrate  on  the  first  point.  I  will  first  show  how  ChPT 
is  developed  for  the  quenched  approximation,  and  then  use  it  for  a  quantitative 
comparison  between  full  and  quenched  QCD.  The  quantities  that  I  will  discuss  are 
/K  /f*  [6,  8]  and  the  octet  baryon  masses  [10]. 

I  will  then  address  a  number  of  theoretical  problems  that  arise  as  a  consequence 
of  quenching.  That  such  problems  arise  is  no  surprise,  as  quenching  QCD  mutilates 
the  theory  quite  severely.  It  is  however  quite  instructive  to  see  what  the  detailed 
consequences  are. 

2.  Systematic  ChPT  for  quenched  QCD 

In  this  section  I  will  outline  the  construction  of  a  chiral  effective  action  for  the 
Goldstone  boson  sector  of  quenched  QCD  [6].  I  will  first  introduce  the  formalism, 
and  then  show  how  it  works  in  some  examples.  For  early  ideas  on  quenched  ChPT, 
see  ref.  [11,  5]. 

We  will  start  from  a  lagrangian  definition  of  euclidean  quenched  QCD.  (We 
will  restrict  ourselves  entirely  to  the  euclidean  theory  which  can  be  defined  by  a 
pathintegral.  Hamiltonian  quenched  QCD  presumably  does  not  exist.)  To  the 
usual  QCD  lagrangian  with  three  flavors  of  quarks  qa,  a  —  u,d,s,  we  add  three 
ghost  quarks  qa  with  exactly  the  same  quantum  numbers  and  masses  ma,  but  with 
opposite,  bosonic,  statistics  [11]: 


where  f>  is  the  covariant  derivative  coupling  the  quark  and  ghost  quarks  to  the 
gluon  field.  The  gluon  effective  action  produced  by  integrating  over  the  quark- 
and  ghost  quarkfields  vanishes,  since  the  fermion  determinant  of  the  quark  sector 
is  exactly  cancelled  by  that  of  the  ghost  sector.  Note  that  the  ghost  quarks  violate 
the  spin-statistics  theorem. 

We  will  now  assume  that  mesons  are  formed  as  (ghost)  quark  -  (ghost)  antiquark 


breaking.  The  Goldstone  particle  spectrum  of  quenched  QCD  will  then  contain  not 
only  qq,  but  also  qq,  qq  and  qq  bound  states.  We  will  denote  this  36-plet  by 


Note  that  the  fields  x  and  x*  describe  Goldstone  fermions. 

The  quenched  QCD  lagrangian  (1)  for  vanishing  quark  masses  has  a  much  larger 
symmetry  group  than  the  usual  U($)L  x  U($)R  flavor  group;  it  is  invariant  under 
the  graded  group  £/(3|3)L  x  *7(3|3)fi  [6],  where  17(3|3)  is  a  graded  version  of  17(6) 
since  it  mixes  the  fermion  and  boson  fields  q  and  q.  Writing  an  element  U  of  {7(3  13) 
in  block  form  as 


the  3x3  matrices  A  and  B  consist  of  commuting  numbers,  while  the  3x3  matrices 
C  and  D  consist  of  anticommuting  numbers. 

We  can  now  construct  a  low  energy  effective  action  for  the  Goldstone  modes 
along  the  usual  lines.  We  introduce  the  unitary  field 

E  =  czp(2t*//),  (4) 

which  transforms  as  £  —  »•  UL%UR  with  UL  and  UR  elements  of  £/(3|3).  Because  we 
are  dealing  here  with  a  graded  group,  in  order  to  build  invariants,  we  need  to  use 
the  supertrace  sir  and  the  superdeterminant  sdet  instead  of  the  normal  trace  and 
determinant,  with  [12] 


=tr(A)-tr(B), 
sdet(U)     =  exp(str\og(U})  =  det(A  -  CB-lD)/det(B).     (strsdet)  (5) 

To  lowest  order  in  the  derivative  expansion,  and  to  lowest  order  in  the  quark  masses, 
the  chiral  effective  lagrangian  consistent  with  our  graded  symmetry  group  is 


£0  =  i-strtfpVdn)  -  v  str(MZ  +  A4E),  (6) 

o 

where  .M  is  the  quark  mass  matrix 

mu 

/  and  t;  are  bare  coupling  constants  which  are  not  yet  determined  at  this  stage. 

The  symmetry  group  is  broken  by  the  anomaly  to  the  smaller  group 
[5(7(3|3)r,  x  5i7(3|3)R]®C/(l)  (the  semidirect  product  arises  as  a  consequence  of 
the  graded  nature  of  the  groups  involved;  the  details  are  irrelevant  for  this  talk). 
S!7(3|3)  consists  of  all  elements  U  G  Z7(3|3)  with  sdet(U)  =  1.  The  anomalous  field 
is  $o  =  (*?'  —  *?')/  v^,  where  the  relative  minus  sign  comes  from  the  fact  that  in  order 
to  get  a  nonvanishing  triangle  diagram,  one  needs  to  choose  opposite  explicit  signs 
for  the  quark  and  ghost  quark  loops,  due  to  the  different  statistics  of  these  fields. 
T]'  is  the  field  describing  the  normal  if  particle,  while  if  is  the  ghost  rf  consisting  of 
ghost  quarks  and  ghost  antiquarks.  We  will  call  the  field  $o  the  super-?/  field.  The 


and  we  should  include  arbitrary  functions  of  this  field  in  our  effective  lagrangian. 
Following  ref.  [13],  the  correct  chiral  effective  lagrangian  is 


where  the  function  multiplying  i  str(M^—  Mtf)  can  be  chosen  equal  to  zero  after 
a  field  redefinition.  This  lagrangian  describes  quenched  ChPT  systematically,  as 
we  will  show  now  with  a  few  examples. 

For  our  first  example,  let  us  isolate  just  the  quadratic  terms  for  the  fields  rf  and 
if,  choosing  degenerate  quark  masses  for  simplicity.  We  expand 


Vo($0)     =  constant  +  /*2$§  -f  .  .  .  ,  (9) 

Vs($o)  =  «+..., 

and  obtain 


\f(if  -  r?')2  +  .  .  .  ,  (10) 

where  m2  =  8mv//2.  The  relative  minus  signs  between  the  77'  and  77'  terms  in  (10) 
come  from  the  supertraces  in  (8),  and  are  related  to  the  graded  nature  of  the  chiral 
symmetry  group  of  quenched  QCD. 

The  inverse  propagator  in  momentum  space, 


clearly  cannot  be  diagonalized  in  a  p  independent  way,  which  is  quite  different  from 
what  one  would  expect  from  a  normal  field  theory!  Treating  the  /*2  +  ap2  term  as  a 
twopoint  vertex,  one  can  easily  show  that  the  repetition  of  this  vertex  on  one  meson 

line  vanishes,  due  to  the  fact  that  the  propagator  matrix  (  n  )  multiplied 

\°     ~1/ 
/  i      _i\ 

on  both  sides  by  the  vertex  matrix  (      .  1  gives  zero.  This  result  coincides 

exactly  with  what  one  would  expect  from  the  quark  flow  picture  for  rj'  propagation, 
as  depicted  in  fig.  1.  The  straight-  through  and  double  hairpin  contributions  do 
not  contain  any  virtual  quark  loops,  and  are  therefore  present  in  the  quenched 
approximation.  All  other  contributions  should  vanish  because  they  do  contain 
virtual  quark  loops,  and  this  is  exactly  what  happens  as  a  consequence  of  the 
(admittedly  strange)  Feynman  rules  for  the  propagator  in  the  rf-rf  sector!  This 
propagator  is  given  by  the  inverse  of  (11)  and  reads 


1        (I      0}        S  +  QP*    (1     1\ 
p>  +  ml\0     -l)      (j?  +  mW\l     l)' 

in  which  the  two  terms  correspond  to  the  two  first  diagrams  in  fig.  1. 
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Figure  1.  Quark  flow  diagrams  for  the  if'  propagator  in  full  QCD 

From  (12)  we  learn  several  things.  First,  because  /*2,  which  in  full  ChPT  would 
correspond  to  the  singlet  part  of  the  i/  mass,  appears  in  the  numerator,  we  need 
to  keep  the  if  (and  its  ghost  partner)  in  quenched  ChPT:  it  cannot  be  decoupled 
by  taking  p2  large.  Second,  this  "propagator"  is  definitely  sick,  due  to  the  double 
pole  term.  It  should  be  stressed  here  that  this  double  pole  term  is  an  unescapable 
consequence  of  quenched  QCD,  and  does  not  result  from  our  way  of  setting  up 
chirai  perturbation  theory.  In  the  case  of  nondegenerate  quark  masses,  this  double 
pole  also  shows  up  in  the  T°  and  rj  propagators,  due  to  mixing  with  the  rf  '.  I  will 
return  to  these  strange  properties  of  quenched  QCD  in  section  4. 


N 

\ 
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Figure  2.  One  loop  pion  selfenergy  in  quenched  ChPT. 

As  a  second  example,  we  will  consider  the  (charged)  pion  selfenergy  at  one  loop, 
again  with  degenerate  quark  masses.  I  will  set  a  —  0  for  simplicity.  At  one  loop, 
the  pion  selfenergy  only  contains  tadpoles,  with  either  <f>  or  x  lmes  (c/  (2))  on  the 
loop.  Also,  on  the  <f>  loop,  one  can  have  an  arbitrary  number  of  insertions  of  the 
vertex  fi?  if  the  internal  (f>  line  is  an  517(3)  singlet.  These  various  contributions  are 
drawn  in  fig.  2,  where  a  solid  line  denotes  a  <f>  line,  a  dashed  line  denotes  a  x  lme> 
and  a  cross  denotes  a  p2  vertex.  One  finds  that  the  diagrams  with  the  (f>  and  x 
lines  on  the  loop  without  any  crosses  cancel,  and  then,  of  course,  that  the  diagrams 
with  more  than  one  cross  vanish,  using  our  earlier  result  for  the  rf-ff  propagator. 
We  are  left  with  only  one  term,  and  the  result  is 


[ 
J 


d4k 


The  pion  selfenergy  is  logarithmically  divergent,  but  the  origin  of  this  divergence 
is  completely  different  from  those  that  arise  in  the  unquenched  theory,  as  it  is 
proportional  to  /*2.  One  can  easily  convince  oneself  that  the  diagrams  in  fig.  2  which 


•^j^v-w  VL  ui.^  1.1111.01  <u.*.jjfcuioi<_>u  in  4u«;u<-ueu  v^ur  x 

an  expansion  in  the  pion  mass  (see  e.g.  ref.  [14]).  However,  as  we  have  argued 
above,  in  quenched  ChPT  there  is  unavoidably  another  mass  scale,  namely  the 
singlet  part  of  the  rf  mass,  p2.  For  our  expansion  to  be  systematic  as  an  expansion 
in  the  pion  mass,  we  would  have  to  sum  up  all  orders  in  ji2,  at  a  fixed  order  in  the 
pion  mass.  This  is  clearly  a  formidable  task.  In  order  to  avoid  this  complication  in  a 
systematic  way,  we  can  think  of  ^2/3  (which  turns  out  to  be  the  natural  parameter 
as  it  appears  in  the  chiral  expansion)  as  an  independent  small  parameter.  To  check 
whether  this  makes  any  sense,  one  may  note  that  from  the  experimental  value  of  the 
T)'  mass  one  obtains  a  value  j*2/3  «  (500  MeV)2,  which  is  roughly  equal  to  the  kaon 
mass  squared,  m^.  Of  course,  for  quenched  QCD  the  parameter  p2  need  not  have 
the  same  value,  after  all  quenched  QCD  is  a  different  theory.  A  lattice  computation 
of  this  parameter  [15]  gives  /^uenched/^?ull  w  °-75-  (<*  can  be  estimated  from  rf-rf 
mixing,  and  is  very  small.)  Finally,  one  may  also  note  that  both  p*  and  a  are  of 
order  l/Ne,  where  Nc  is  the  number  of  colors  [16].  I  will  return  to  this  point  in 
section  4. 

3.  Quantitative  comparison  of  quenched  and  full  ChPT 

Let  us  first  consider  the  quenched  result  for  the  ratio  of  the  kaon  and  pion  decay 
constants  fK  and  /,  [6,  8].  I  will  set  a  =  0  and  take  mu  =  md  =  m: 

V~10°P         ,  .    /*2/3     [         mj-         .      /2mJ-      A      ,1     ,  w  ,1/o 

-  I  =  1+   *:  '  K     ..log    —  f  -1     -1  +(m--m)L.(14) 

/  quenched  165T2/?  L2(™K  ~  ™2*}        \™*         )        J 

L  is  a  certain  combination  of  "low  energy  constants"  [13].  Since  this  constant  is 
a  bare  parameter  of  the  quenched  chiral  lagrangian,  the  result  (14)  is  not  directly 
comparable  to  the  equivalent  result  for  the  full  theory.  In  other  words,  in  order  to 
compare  quenched  and  full  QCD,  we  have  to  consider  quantities  which  are  indepen- 
dent of  the  bare  parameters  of  the  effective  action.  (Alternatively,  we  would  need 
to  extract  the  values  of  the  bare  parameters  from  some  independent  measurement 
or  lattice  computation,  in  this  case,  we  would  need  independent  determinations 
of  L  in  both  quenched  and  full  QCD.)  In  the  full  theory,  (fr,fi/3)/f%3  is  such  a 
quantity  [13],  but  in  the  quenched  theory  this  quantity  is  not  well  defined,  due  to 
the  double  poles  which  occur  in  the  propagators  of  neutral  mesons. 

We  will  therefore  choose  to  consider  a  slightly  different  theory,  in  which  suffi- 
ciently many  decay  constants  referring  only  to  charged  (i.e.  off-diagonal)  mesons 
are  present  [8].  This  theory  is  a  theory  with  two  light  quarks  mu  =  m^  =  m  and 
two  heavy  quarks  m,  =  mt>  •=.  m'  .  This  theory  contains  a  ud  pion  TT,  an  s'J  pion 
TT'  and  a  tis  kaon  K,  with  mass  relation 
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How  good  ia  the  quenched  approximation  of  QCD? 

One  can  show  that  the  ratio  ffc/Vfxf*1  w  independent  of  the  low  energy  constants 
L.  For  the  quenched  theory  we  find 


K,  - 

whereas  in  the  full  theory 


Note  again  that  the  logarithms  in  the  quenched  and  unquenched  expressions  are 
completely  different  in  origin. 

We  may  now  compare  these  two  expressions  using  "real  world"  data,  where 
we'll  determine  the  value  of  the  TT'  mass  from  the  mass  relation  (15).  With  mT  = 
140  MeV,  mK  =  494  MeV  and  /^2/3  =  0.75  x  (500  MeV)2  we  find 

,          xl-loop 

-^— )  =1.049, 

/  quenched 
1  — loop 

=  1.023,  (18) 


a  difference  of  3%.  If  we  choose  /*2/3  =  (500  MeV)2,  we  find  a  difference  of  about 
4%.  This  difference  is  small.  Note  however,  that  this  is  due  to  the  fact  that  for  this 
particular  ratio,  ChPT  seems  to  work  very  well,  both  for  the  full  and  the  quenched 
theories.  If  one  only  considers  the  size  of  the  one  loop  corrections  (the  numbers 
behind  the  decimal  point),  the  quenched  and  full  results  are  very  different.  It  is 
also  possible,  and  in  fact  not  unlikely,  that  part  of  the  difference  between  the  full 
and  quenched  theory  gets  "washed  out"  by  the  fact  that  we  are  considering  a  "ratio 
of  ratios"  .  It  follows  that  the  relative  difference  is  a  lower  bound  on  the  difference 
between  the  quenched  and  full  values  of  the  decay  constants.  For  another  quantity 
for  which  the  difference  between  quenched  and  full  ChPT  has  been  calculated,  see 
ref.  [8]. 

Next,  I  will  review  some  very  recent  work  on  baryons  in  quenched  ChPT  by 
Labrenz  and  Sharpe  [10].  They  calculated  the  one  loop  corrections  to  the  octet 
baryon  mass  coming  from  the  cloud  of  Goldstone  mesons.  They  employed  an  ef- 
fective lagrangian  for  quenched  heavy  baryon  ChPT,  constructed  using  the  same 
techniques  as  described  in  section  2.  In  the  case  of  degenerate  quark  masses,  the 
result  for  the  nucleon  mass  is 

mN  =      m  -  f  (D  -  3F)2^mT  +  1(bD  -  36F)mJ 

37)  +  §«(£>-  3F)2]          .  (19) 


In  this  equation,  m,  D,  F,  60,  &F  and  7  are  bare  parameters  which  occur  in  the 
baryon  effective  action,  m  is  the  bare  "average"  octet  mass,  D  and  F  are  the  well 
known  baryon-meson  couplings,  &#  and  bp  are  low  energy  constant  which  arise 


a  cross  on  the  <f>  internal  line,  »'.  t.  an  insertion  of  the  /w2  twopoint  vertex.  Note  that 
in  this  case  there  are  also  one  loop  corrections  not  involving  /*2  which  survive  the 
quenched  approximation,  in  contrast  to  the  pion  selfenergy,  (13),  or  /*•//*,  (14). 
The  authors  of  ref.  [10]  then  calculated  the  coefficients  using  full  QCD  values  for 
the  various  parameters  (from  ref.  [19]).  With  a  =  0  and  7  =  0  (7  =  0  is  consistent 
with  available  information,  which  however  is  limited  [17]),  they  obtained 

mjv  =  0.97  -  0.5~mT  +  3.4m2  -  1.5mJ,  (20) 

\}.Z 

with  6  =  /i2/(24*2/J)  and  6  «  0.2  for  the  full  theory. 

In  ref.  [10],  (19)  was  also  compared  to  recent  numerical  results  from  ref.  [20]. 
These  data  are  presented  in  fig.  3,  where  the  scale  a"1  =  1.63  GeV  is  set  by  fr 
[10].  If  one  calculates  the  coefficients  in  (19)  by  "fitting"  the  four  data  points,  one 
finds 

mN  =  0.96  -1.0mT  +  3.6m2,  -2.0mJ.  (21) 

This  is  only  four  data  points  for  four  parameters,  and  the  "fit"  is  quite  sensitive  to 
for  instance  an  additional  m£  term.  From  the  agreement  between  (20)  and  (21)  it 
appears  that  it  is  reasonable  to  apply  ChPT  to  the  results  of  ref.  [20].  Note  that 
the  individual  terms  in  (20)  are  quite  large  for  the  two  higher  pion  masses  in  fig.  3 
(this  is  not  unlike  the  case  of  unquenched  ChPT).  From  fig.  3  it  is  also  clear  that 
(™//r)quenched  7^  (*"//*•  )fuii  because  of  the  term  linear  in  (19),  which  is  absent  in 
full  ChPT. 

Labrenz  and  Sharpe  then  went  on  to  consider  octet  mass  splittings.  In  order  to 
remove  effects  which  can  be  accomodated  by  a  change  of  scale,  they  calculated  the 
ratios 

^  =  ^1'     M'  =  tf,A,E,S  (22) 

mj 

in  quenched  ChPT,  and  compared  these  with  similar  ratios  obtained  from  ref.  [19]. 
They  assumed  that  all  bare  parameters  in  the  equations  for  the  octet  masses  (for 
explicit  expressions,  see  their  paper)  are  equal  in  the  full  and  quenched  theory,  and 
then  calculated  the  ratios 

oquenched 

•  (23) 


With  the  assumption  that  the  bare  parameters  of  the  quenched  and  full  theories 
are  equal,  ID  and  bp  drop  out  of  the  ratios,  and  with  7  =  0,  a  =  0  and  D  and  F 
equal  to  their  full  QCD  values,  they  obtain 


=  1  +  0.19(6/0.2)  +  0.13  =  1.31(1.27]  for  6  =  0.2(0.15], 

=  1  -  0.46(6/0.2)  +  0.43  =  0.97(1.09]  for  6  =  0.2(0.15],  (24) 

=  1  -  0.39(6/0.2)  +  0.26  =  0.87(0.97]  for  6  =  0.2(0.15]. 

(The  choice  8  —  0.15  corresponds  to  the  value  reported  in  ref.  [15].) 


0.0 


Figure  3.   The  nucleon  mass  from  the  lattice  [20]  (copied  from  ref.   [10]).   The  curve  is 


from  a  fit  to  the  form 


=  m  +  am*  +  bm%.  -f 


From  this,  one  would  conclude  that  one  can  expect  errors  from  quenching  of  at 
least  20%  in  the  octet  splittings.  These  differences  between  the  quenched  and  full 
theories  cannot  be  compensated  for  by  a  change  in  scale  between  quenched  and  full 
QCD. 

At  this  point  I  would  like  to  comment  on  the  above  mentioned  assumption  that 
was  used  in  order  to  obtain  (24).  Let  us  consider  in  particular  the  parameters  6/> 
ancLk/r^Jdiey  correspond  to  higher  derivative  terms  in  the  baryon-meson  effective 
action,  and  are  needed  in  order  to  absorb  the  UV  divergences  which  arise  at  one 
loop  in  ChPT.  Since  the  size  of  these  divergences  is  in  principle  different  between 
the  full  and  quenched  theories,  one  expects  that  &£>iquenched  and  ^.quenched  can  be 
different  from  bj},fu\\  and  &F,fuii-  If  we  want  to  proceed  without  assuming  that  the 
quenched  and  full  6's  are  equal,  we  have  to  consider  ratios  of  quantities  independent 
of  the  parameters  &£>  and  bp.  The  situation  is  essentially  the  same  as  in  the  case 
of  /*•//*••  From  the  available  results  [10],  only  one  ratio  independent  of  bjj  and  bp 
can  be  formed: 

(25) 


deviation  from  the  Gell-Mann-Okubo  formula  (cf.  ref.  [19]  for  the  full  theory). 
Setting  mr  —  0  keeping  only  mj(  as  in  ref.  [10],  one  finds 


(26) 

(The  parameter  7  drops  out  of  this  particular  combination  and  we  again  take  Of  = 
0.)  These  quantities  still  depend  on  the  other  bare  parameters,  D,  F  and  m. 
Again,  they  could  be  different  in  the  quenched  and  full  theories,  and  I  will  leave 
the  quenched  values  as  free  parameters.  Substituting  mK  —  495  MeV,  /*•  =  132 
MeV,  Dfuii  =  0.75,  Ffun  =  0.5  and  mfull  =  1  GeV  [19]  finally  gives 

)2  _  3  p2\ 

=  1-0.0214+    0.293--  -0.306 


0.2       '      J    mquenched/l  GeV 

For  any  reasonable  values  of  m  and  6,  and  for  (D2  —  3F2)  uenched  n°t- 
from  its  full  theory  value  of  —0.1875,  the  difference  between  the  quenched  and  full 
theories  as  measured  by  the  ratio  ^quenched /^Vuii  is  not  more  than  a  few  percent.  Of 
course  the  same  comment  that  applied  in  the  case  of  /K/f*  applies  here,  that  part  of 
the  difference  may  have  been  washed  out  by  taking  "ratios  of  ratios".  Summarizing, 
the  conclusion  of  this  analysis  seems  to  be  that  the  error  from  quenching  for  octet 
baryon  masses  is  at  least  a  few  percent,  and  could  be  as  much  as  20%. 

4.  A  sickness  of  quenched  QCD 

Let  us  again  consider  the  quenched  result  for  /#//*•,  (14),  as  a  function  of  the 
quark  masses  (using  treelevel  relations  between  meson  masses  and  quark  masses), 

(JK  \                   ,   .     p  /3     I    mu  +  ms  mt        1      ,  .. 

~T  J  =  1  +  1^2  ri  \v~, —rlog----  1    +(L-terms). 

/»  /  quenched  loir*/,    LAm»  ~  "M          ™«          J 

From  this  expression  it  is  clear  that  we  cannot  take  mu  -*  0  keeping  m,  fixed,  or, 
to  put  it  differently,  that  if  we  take  both  mu  and  m,  to  zero  keeping  the  ratio  fixed, 
the  limit  depends  on  this  ratio,  and  is  not  equal  to  one!  This  is  quite  unlike  the  case 
of  full  ChPT,  where  one  can  take  any  quark  mass  to  zero  uniformly,  and  deviations 
from  S?7(3)  symmetry  due  to  this  quark  mass  vanish  in  this  limit.  Technically, 
the  reason  for  this  strange  behavior  is  that  there  is  another  mass  ^,  which,  as  we 
argued  before,  cannot  be  avoided  in  quenched  ChPT.  This  mass  is  related  to  the 
singlet  part  of  the  17'  mass,  and  is  not  a  free  parameter  of  the  theory.  Even  if  we 
do  not  consider  any  Green's  functions  with  rf  external  lines,  this  mass  shows  up 
through  the  double  pole  term  in  12  on  internal  lines.  Because  of  the  double  pole, 
such  contributions  can  lead  to  new  infrared  divergences  in  the  mT  — »•  0  limit.  This 
problem  with  the  chiraHimit  of  quenched  ChPT  shows  up  in  other  quantities,  such 
as  meson  masses  and  {V>V>}  [6,  7,  21], 

A  question  one  might  ask  is  whether  this  problem  is  an  artifact  of  one  loop 
quenched  ChPT  [8].    For  instance,  if  we  would  sum  all  contributions  to  the  rf 
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improving  the  infrared  behavior  of  diagrams  in  which  the  double  pole  terms  appear. 
Let  us  address  this  question  in  the  chiral  limit,  roa  =  0,  where  the  problem  is  most 
severe.  In  the  full  theory,  we  can  write  the  fully  dressed  rf  propagator  as 

Z(p) 


and  define  ^(p)  =  S(p),  which  onshell  is  the  17'  mass  in  the  chiral  limit.  Likewise, 
in  the  quenched  theory  we  can  write  the  dressed  rf  ,  fj'  propagator  as 


0]' 


which  defines  /ig(p).  To  leading  order  in  1/NC,  these  two  definitions  of  /*2(p)  should 
lead  to  the  same  result: 

+o(-^)y  (30) 

Vvc// 

We  also  believe  that  f*2F(p  ~  0)  is  not  equal  to  zero,  since  we  expect  the  if  to  remain 
a  well-behaved,  massive  particle  in  the  chiral  limit.  This  implies,  in  sofar  as  we  can 
rely  on  the  large  Nc  expansion,  that  ^g(O)  ^  0,  and  that  the  double  pole  in  (12)  is 
a  true  feature  of  the  theory. 

While  this  argument  is  not  very  rigorous,  I  believe  that  the  foregoing  discussion 
implies  that  the  chiral  limit  of  quenched  QCD  really  does  not  exist.  This  believe  is 
futhermore  supported  by  the  following  remarks: 

«  Sharpe  [7]  has  summed  a  class  of  diagrams  in  the  case  of  degenerate  quark 
masses  for  a  very  simple  quantity  (the  pion  mass),  and  found  a  result  that  is  actually 
more  divergent  than  the  one  loop  result. 

«  With  nondegenerate  quark  masses  there  are  many  more  diagrams  that  are 
infrared  divergent  in  the  chiral  limit,  and  it  is  even  less  probable  that  resummation 
will  improve  the  situation. 

®  Any  mechanism  improving  the  infrared  behavior  would  have  to  work  for  each 
divergent  quantity.  One  expects  that  such  a  mechanism  would  be  related  to  the 
double  pole  term  in  the  rf  propagator,  which  created  the  problem  in  the  first  place. 
But  this  seems  unlikely  in  view  of  the  arguments  given  above. 

®  The  bare  quark  mass  parameter  appearing  in  the  chiral  effective  action  is 
not  the  same  as  that  appearing  in  the  (unrenormalized)  QCD  lagrangian.  But  one 
can  argue  that  the  two  bare  quark  masses  should  be  analytically  related,  and  the 
infrared  problem  is  not  just  a  problem  of  quenched  ChPT,  but  of  quenched  QCD. 

5.  Conclusion 

The  quenched  approximation  leads  to  an  unknown  systematic  error  in  all  lattice 
calculations  that  employ  this  approximation.  It  would  of  course  be  very  nice  to  have 
a  parameter  that  interpolates  between  full  and  quenched  QCD,  and  in  principle 
the  quark  masses  could  play  such  a  role,  since  one  expects  that  quenched  QCD 
corresponds  to  full  QCD  with  very  heavy  quarks.  One  would  have  to  distinguish 
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Quenched  QvJL)  can  be  denned  irom  a  euclidean  patnmtegral  as  rigorously  as 
full  QCD.  In  this  talk  I  have  explained  that  euclidean  quenched  ChPT  can  be  used 
as  a  tool  for  a  systematic  investigation  of  quenched  QCD.  Quenched  ChPT  does 
not  quite  accomplish  a  task  equivalent  to  that  of  an  interpolating  parameter.  Since 
the  bare  parameters  appearing  in  the  quenched  and  full  chiral  effective  actions  are 
not  the  same  (as  explained  in  section  3)  one  cannot  directly  compare  quantities 
calculated  in  full  and  in  quenched  ChPT.  However,  one  can  calculate  combinations 
of  physical  quantities  which  do  not  depend  on  the  bare  parameters,  and  in  that 
case  a  direct  comparison  between  quenched  and  full  QCD  is  possible,  as  we  demon- 
strated with  an  example  involving  meson  decay  constants.  This  makes  it  possible 
to  estimate  lower  bounds  on  the  differences  which  come  from  quenching;  these  es- 
timates are  dependent  on  the  values  of  the  meson  masses,  which  can  be  taken  to 
be  the  (known)  independent  parameters  of  the  theory.  For  realistic  values  of  these 
masses,  such  differences  turn  out  to  be  of  the  order  of  a  few  percent  for  ratios  of 
decay  constants  and  for  baryon  octet  splittings. 

The  disadvantage  of  this  more  conservative  approach  is  that  part  of  the  dif- 
ference maybe  hidden,  because  these  specific  combinations  of  physical  quantities 
maybe  less  sensitive  to  the  effects  of  quenching  than  other  quantities  of  interest. 
This  is  particularly  clear  in  the  example  of  baryon  octet  masses.  In  this  case, 
a  comparison  based  on  the  assumption  that  the  bare  parameters  of  the  full  and 
quenched  effective  theories  are  the  same,  lead  to  differences  of  up  to  20%  and  more. 
Of  course,  it  is  not  known  to  what  extend  this  assumption  is  valid. 

The  differences  between  the  quenched  and  full  theories  become  markedly  larger 
for  decreasing  quark  masses.  This  is  due  to  the  fact  that  new  infrared  divergences 
occur  in  quenched  QCD,  which  do  not  have  a  counterpart  in  full  QCD.  These 
divergences  lead  to  the  nonexistence  of  the  chiral  limit  for  quenched  ChPT  (as 
discussed  in  section  4).  The  origin  of  this  phenomenon  can  be  traced  to  the  special 
role  of  the  rf  in  the  quenched  approximation.  In  the  quenched  approximation, 
the  if  is  a  Goldstone  boson  (it  develops  massless  poles  in  the  chiral  limit),  but  an 
additional  double  pole  term  arises  in  its  propagator,  rendering  it  a  "sick"  particle. 
For  nondegenerate  quark  masses  this  problem  is  also  inherited  by  the  T°  and  the 
77.  In  section  4  I  argued  that  the  nonexistence  of  the  chiral  limit  is  a  fundamental 
feature  of  quenched  QCD. 

In  principle  therefore,  the  chiral  expansion  breaks  down  for  quenched  QCD. 
For  very  small  quark  masses,  at  fixed  values  of  the  singlet  part  of  the  rf  mass  /i2, 
the  expansion  becomes  unreliable.  In  order  to  make  progress,  one  may  take  the 
expansion  to  be  an  expansion  in  ^2/3  (which  was  shown  to  be  roughly  equal  to 
nix  phenomenologically),  with  coefficients  which  are  functions  of  the  quark  mass. 
These  functions  then  can  be  expanded  in  terms  of  the  quark  masses,  sometimes 
leading  to  divergent  behavior  of  the  leading  term  (e.g.  the  one  loop  correction  to 
frc/fr}-  If  such  divergences  occur,  the  expansion  is  only  valid  for  a  range  of  quark 
masses  which  are  neither  too  small,  nor  too  large.  It  would  be  interesting  to  see 
whether  this  point  of  view  can  be  made  solid. 

It  is  in  principle  interesting  to  study  any  quantity  which  is  being  computed  in 
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(meson  masses,  decay  constants,  condensates  and  the  kaon  B  parameter  have  been 
calculated  [6,  5,  8]).  As  discussed,  this  includes  not  only  Goldstone  meson  physics 
per  se,  but  also  chiral  corrections  to  baryon  masses  [10],  and  for  the  same  reason, 
to  mesons  containing  heavy  quarks. 

Recently,  also  attempts  have  been  made  to  compute  pion  and  micleon  scattering 
lengths  [23,  24]  from  quenched  lattice  QCD.  If  one  tries  to  calculate  the  /  =  0  pion 
scattering  amplitude  in  quenched  ChPT,  one  actually  finds  that  the  imaginary  part 
is  divergent  at  threshold,  even  for  non vanishing  pion  mass  [25]!  Again,  this  can  be 
related  to  double  pole  terms  in  the  t]'  propagator.  An  additional  reason  is  that  ap- 
parently euclidean  quenched  correlation  functions  in  general  cannot  be  analytically 
continued  to  Minkowski  space-time.  (The  euclidean  four  pion  correlation  functions 
are  well  defined.)  Further  work  is  needed  on  pion  scattering  lengths. 
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Polarised  structure  functions 

D.  INDUMATHI 

Physical  Research  Laboratory,  Navrangpura,  Ahmedabad  380  009. 

Abstract.  This  talk  discusses  the  current  understanding  in  some  aspects  of  polarised 
lepton-nucleon  deep  inelastic  scattering.  The  first  section  focuses  on  the  non-singlet  part 
of  available  spin  dependent  structure  function  data.  The  results,  though  consistent  with 
theoretical  predictions  in  this  sector,  are  extremely  sensitive  to  extrapolation  of  the  data 
to  the  x  — »•  1  region  and  seem  to  indicate  that  the  quarks  do  carry  a  very  small  fraction 
of  the  nucleon  spin.  We  then  review  the  formalism  of  DIS  processes  with  emphasis  on  the 
gluon  sector. 


In  this  talk  we  focus  on  some  aspects  of  polarisation  phenomena  in  deep  inelastic 
scattering  (DIS)  of  high  energy  leptons  (e,  /*)  off  a  nucleon  target  through  vir- 
tual photon  exchange.  The  energy  involved  in  the  process  is  such  that  the  target 
completely  breaks  up.  The  debris  remains  unobserved;  only  the  lepton  is  detected 
after  the  collision  (its  energy  and  angle  of  scattering).  If  both  the  initial  particles 
are  polarised  (here,  longitudinally,  along  the  beam  axis),  then  the  lepton  probes 
the  internal  spin  structure  of  the  nucleon.  Hence  such  experiments  can  shed  light 
on  the  distribution  of  the  nucleon  spin  in  terms  of  that  of  its  component  partons 
(quarks  and  gluons). 

Section  1  discusses  the  current  data  and  the  phenomenology  of  polarised  DIS 
with  emphasis  on  the  nonsinglet  sector.  While  the  data  turn  out  to  be  consistent 
with  theoretical  expectations  in  this  sector,  the  total  contribution  of  quarks  to  the 
spin  of  the  proton  is  small.  There  is  thus  a  possibly  significant  gluon  contribution 
to  the  proton  spin.  Section  2  therefore  addresses  the  question  of  the  gluonic  con- 
tribution to  the  polarised  DIS  cross  section.  This  section  is  basically  a  pedagogical 
review  of  the  formalism  of  such  processes.  We  establish  that  the  gluon  contribution 
to  the  spin  of  the  proton  is  well  defined,  though  it  cannot  be  extracted  from  data 
in  polarised  DIS. 

1.  Polarised  Deep  Inelastic  Scattering 


1.1  The  Formalism 

The  European  Muon  Collaboration  (EMC)  measured  the  asymmetry  with  respect 
to  the  polarised  pp  Deep  Inelastic  Scattering  (DIS)  cross  sections  when  the  muon 
and  proton  spins  were  parallel  and  antiparallel  respectively.  Theoretically,  this 
asymmetry  is  given  in  term&  of  the  spin  dependent  and  spin  independent  proton 


airuciure  luncuions 


Here,  x,  as  usual,  is  the  Bjorken  scaling  variable,  x  =  Q2/(2p  •  g),  where  Q2  =  —9  , 
p  and  q  are  the  4-momenta  of  the  virtual  photon  and  the  proton  in  the  DIS  process 
and  0  <  x  <  1.  The  SMC  and  E142  measured  the  asymmetries, 


where  the  quantities  are  correspondingly  defined  for  the  deuteron  and  neutron. 
The  spin  dependent  structure  functions  are  then  extracted  as  the  denominators  are 
known.  These  structure  functions  are  a  measure  of  the  distribution  of  the  nucleon 
spin  in  terms  of  that  of  the  partons. 

There  are  other  structure  functions  occurring  not  only  in  polarised  DIS  exper- 
iments but  also  in  other  processes  such  as  Drell  Yan  in  polarised  pp  collisions. 
These  have  recently  been  the  subject  of  intense  study.  For  want  of  adequate  time, 
however,  we  do  not  discuss  them  here  [1], 

The  structure  functions  of  interest  can  be  expressed  in  terms  of  parton  densities 
(to  leading  order)  as 


where  the  summation  runs  over  both  quark  as  well  as  antiquark  flavours  and  q  and 
q  are  the  spin  summed  and  spin  dependent  quark  densities,  x  is  then  interpreted  as 
the  fraction  of  nucleon  momentum  carried  by  that  parton.  The  Bjorken  sum  rule 
relates  the  moments  of  the  proton  and  neutron  spin  dependent  structure  functions. 
To  <9(a,),  we  have 


Note  that  higher  order  (through  O(a,)3)  corrections  have  been  calculated  and  are 
less  than  2%  at  the  scale  of  interest,  Q2  =  5  GeV2. 

1.2  The  data 

The  five  year  old  data  on  the  spin  dependent  proton  structure  function  [2]  has 
recently  been  augmented  by  data  from  the  Spin  Muon  Collaboration  (SMC)  [3] 
and  the  E142  Collaboration  [4]  on  spin  dependent  deuteron  and  neutron  structure 
functions  respectively. 

The  new  information  gained  from  these  experiments  is  in  the  non-singlet  sector 
since  these  experiments  probe  three  different  combinations  of  the  two  (u  and  d) 
valence  densities  in  a  nucleon.  In  fact,  the  problem  is  over  specified.  Hence  we  can 
get  stringent  bounds  not  only  on  the  Bjorken  Sum  Rule  [5]  (which  involves  moments 
of  non-smglet  densities)  but  also  on  the  z-dependence  of  these  non-singlet  densities. 


156 


Pramana-  J.  Phys.,  Supplement  Issue.  loan 


equations,  i.e.,  the  x-dependent  unpolarised  structure  functions,  F\. 

Recent  data  in  the  unpolarised  sector  from  both  NMC  [6]  and  HERA  [7]  have 
radically  changed  our  understanding  of  the  small-x  behaviour  of  spin  independent 
structure  functions  [8].  This  will  affect  the  extraction  of  the  spin  dependent  func- 
tions from  the  asymmetry  data.  The  EMC  used  the  old  EMC  F-%  data  and  obtained 
F\  by  setting  the  ratio,  R  (of  longitudinal  to  transverse  photoproduction  cross  sec- 
tions), to  zero  while  the  E142  collaboration  used  their  older  FI  and  R  data.  (Specif- 
ically, FI  =  F2/(2x(l  +  R)}.)  The  NMC  have  used  their  own  current  F2  data  as 
well  as  corrected  for  non-zero  R.  Furthermore,  all  the  experiments  are  at  different 
average  Q2  values.  Table  1  shows  the  results  as  announced  by  the  various  groups, 
in  comparison  with  theoretical  predictions. 


Table  1.  The  results  of  various  polarised  DIS  experiments:  the  moment  of  g\  vs  the  Ellis 
Jaffe  [9]  prediction  and  the  total  quark  contribution  (2AE  =  A«  -f  Ad  +  As)  to 
the  proton  spin  are  listed.  Also  shown  are  the  Bjorken  Sum  Rule  [5]  computations 
using  two  data  sets  at  a  time. 


Collab.  fc 
Target(Q2) 

fgidx 
measured/EJSR 

2  AE 

EMC 
p(10GeV2) 

0.126  ±  0.01  ±0.015/ 
0.189  ±0.005 

0.12  ±  0.09  ± 

0.13 

SMC 
D(5GeV2) 

0.023  ±  0.02  ±0.015/ 
0.094  ±0.005 

0.06  ±  0.2  ±0 

.15 

E142 
n(2GeV2) 

-0.022  ±0.011/ 
-0.021  ±0.018 

0.57  ±0.11 

B.S.R.  (Measured/Predicted) 

EMC  +  SMC 
PD(5GeV2) 

0.20  ±  0.05  ±  0.04/0.189  ±  0.005 

SMC  -f  E142 
Dn(2GeV2) 

0.146  ±0.021/0.183  ±0.007 

While  the  combined  EMC/SMC  data  are  essentially  in  agreement  with  the  Bjorken 
sum  rule,  the  EMC/E142  data  were  found  to  differ  by  nearly  two  standard  devia- 
tions. Various  reanalyzes  have  been  done,  taking  into  consideration  the  fact  that 
these  data  are  at  different  Q2  values  and  cannot  be  combined  per  se  and  also  by 
including  higher  twist  effects.  Recently,  the  SMC  [10]  has  done  a  combined  analysis 
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of  all  these  data.  They  claim  that  there  is  no  conflict  between  the  data;  further- 
mote,  that  the  Bjorken  Sum  Rule  is  confirmed  to  within  17%  uncertainty  using 
these  data.  Their  procedure  involved  a  reanalysis  of  the  neutron  spin  dependent 
structure  function  data,  using  the  EMC/SMC  data,  which  have  smaller  errors,  to 
determine  the  neutron  spin  dependent  structure  function,  0" (x),  at  small-z  values. 
Hence,  there  exist  widely  differing  viewpoints  [11],  [12],  [13]  on  whether  the  Bjorken 
sum  rule  is  violated  or  not. 

1.3  The  phenomenology 

In  what  follows,  we  shall  study,  in  depth,  the  behaviour  of  both  the  asymmetries 
and  the  structure  functions  at  small-  and  large-z  respectively  and  the  impact  on  the 
moments  and  Bjorken  sum  rule  of  extrapolation  to  these  experimentally  inaccessible 
kinematic  regimes. 

Before  we  begin  the  actual  analysis,  the  following  comments  are  in  order: 

1.  The  spin  independent  as  well  as  spin  dependent  densities  can  be  constructed 
from  the  helicity  densities,  ql(x)  and  gj(x),  of /-flavour  quarks  with  momen- 
tum fraction  «  and  spins  parallel  and  opposed  to  that  of  the  nucleon.  Specifi- 
cally, q  =  gt— q^  and  q  —  q^+q*-  .  For  a  large-z  behaviour,  q^'^  ~  (1  — z)0fT'or4  , 
both  q  and  q  have  a  behaviour  given  by  q,  q  ~  (l  —  x)a  ,  where  or  is  the  smaller 
of  (ar|,a!j)  .  Hence,  they  vanish  at  large- a:  at  the  same  rate. 

2.  gt,  q^-  are  positive  definite  densities;  hence,  \q\  <  \q\  for  all  x.    Hence  the 
partonic  asymmetries,  Aq  =  q/q,  are  bounded  by  ±1  .  In  fact,  unless  «f  =  c*| 
(which  is  unlikely),  Aq  — +  ±1  as  £  — »•  1  .  Furthermore,  it  is  experimentally 
established  that  the  unpolarised  u-quark  density  dominates  over  the  d  and 
s  at  large- a: ;  hence,  the  proton,  neutron  and  deuteron  asymmetries  must 
approach  each  other  as  x  — »•  1. 

Our  observations  now  follow  automatically.  We  use  the  parametrisation  of  F$'  as 
given  by  the  NMC  [6]  (with  F2n  =  2Ff  -  F$)  and  use  R  -  <TL/<TT  as  given  by 
the  SLAG  group  [14].  This  way,  there  is  no  need  to  specify  the  individual  parton 
densities.  We  assume  all  the  A±  to  be  Q2  independent,  as  observed,  and  analyse 
all  data  at  Q2  =  5  GeV2  . 

1.4  The  unconstrained  fits 

The  corresponding  gi  from  the  three  experiments  are  shown  in  fig.  1;  the  solid  curve 
is  the  best  fit  to  the  data  for  a  parametrisation,  gi(x)  =  Nxa(l  +  bx)(l  —  x)&  , 
with  b  =  0  for  the  proton  case  and  a  large-je  dependence,  (1  —  z2)'3,  specially  for 
the  neutron  case  due  to  the  very  fast  fall-off  of  g"  at  large-a:.  (The  errors  in  the 
deuteron  data  preclude  us  from  specifying  the  exact  crossing  point,  XQ  (=  —1/6), 
where  the  structure  function  (or  asymmetry)  changes  sign;  the  two  curves  shown 
indicate  the  maximum  extent  of  variation  of  b).  These  fits  to  the  structure  functions 
result  in  fits  to  the  asymmetry  shown  by  the  dotted  lines  in  fig.  2.  It  is  observed 
that  the  intermediate-x  asymmetry  is  distinctly  different  from  zero  only  for  the 
proton. 

The  corresponding  moments  are  shown  in  table  2.  We  assume  our  errors  to  be 
the  same  as  the  experimentally  quoted  ones  as  the  errors  due  to  the  fit  are  negligible 
in  all  but  the  deuteron  case.  The  range  of  variation  in  the  deuteron  case  due  to 


tne  anowea  range,  u.ui  s  #o  S  "-"0,  may  De  considered  as  an  error  in  the  moment 
coming  from  the  fit.  The  moments  are  not  all  in  agreement  (to  within  Iff}  with 
the  predictions  of  the  Bjorken  sum  rule.  Of  course,  the  proton  and  deuteron  data 
are  also  not  in  accordance  with  the  Ellis  Jaffe  prediction  [9],  although  we  do  not 
discuss  this  further. 


Table  2.  The  nonsinglet  moments,  f(tf-g?)dx,  /2(0f  -  gf)dx  and  f  2(gf  -  g?)dx  are 
shown  for  Q2  =  5  GeV2. 


Target 

/  9i  d*  (Q*) 
(expt) 

(fit  to  0i) 

J^dx(5GeV2) 
from  fit  to  A\. 

P 

0.126  ±0.01  ±0.015(10) 

0.125(10) 
0.124(5) 

0.131 

n 

-0.022  ±0.011  (2) 

-0.030(2) 
-0.036(5) 

-0.035 

d 
d' 

0.023  ±0.02  ±0.015(5) 

M 

0.0176(5) 
0.0135(5) 

0.052 
0.042 

We  now  move  on  to  the  x  dependence  of  these  structure  functions,  which  is  really 
the  focus  of  the  talk.  We  first  express  every  quark  density  as  a  sum  of  valence  and 
sea  parts,  i.e., 

«'(*)  =  «£(*)  +  «/(*);       9* '(*)  =  «{(*)• 


We  ignore  (small)  corrections  due  to  deviations  from  the  Gottfried  sum  rule,  i.e., 
we  assume  «  =  d.  The  three  measurements  that  we  have  in  the  spin  dependent 
sector  are  not  sufficient  to  extract  any  of  the  unknown  densities  individually;  we 
can  only  extract  the  non-singlet  combination  («(x)  —  d(x}).  This  is  because  all 
the  experiments  measure  quantities  whose  singlet  parts  are  identical.  Then  the 
combinations,  (<%  —  0"),  2(^  —  gf)  and  2(grf  —  gf)  all  give  information  on  the 
non-singlet  quantity,  (u  —  d).  We  use  the  parametrised  fits  to  the  various  structure 
functions  at  Q2  =  5  GeV2  to  plot  these  differences  which  we  denote  as  pn,  pd  and 
dn,  as  functions  of  x  in  fig.  3.  We  have  chosen  the  deuteron  parametrisation,  d, 
corresponding  to  XQ  =  0.06.  Similar  results  are  obtained  on  using  the  d':  i.e.,  the 
XQ  =  0.04  fit.  The  non-singlet  moments  of  all  combinations  are  shown  in  table  3. 
The  dn  combination  is  seen  to  deviate  substantially  from  the  Bjorken  sum  rule 
prediction. 


-.xx     tone 


1 

0.8 
0.6 

1    °'4 
0.2 

0 
-0.2 


-0.2 


Ct-H 

00 


-0.4 


0.5 


x 
^, 

lad 


-1 

-1.5 


Q2=5  GeV*- 


Q2=5  GeV2 


0.01 


0.1 
x 


Figure  1.  The  structure  function  data  for  the  proton,  neutron  and  deuteron  are  sht 
as  a  function  of  i.  The  solid  curves  are  our  fits  to  the  data. 
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Polarised  structure  functions 
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Figure  2.  The  proton,  neutron  and  deuteron  asymmetry  data  are  shown  as  a  function 
of  x.  The  solid  curves  are  our  constrained  fits  to  the  data  (vli(l)  =  1),  while  the  dotted 
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Figure  3.  The  non-singlet  combinations,  (g{  -  g")  (dotted  curves),  2(pf  -  g* )  (dash* 
curves)  and  1(g*  -  g")  (long  dashed  curves)  are  plotted  as  a  function  of  x.  The  solid  lii 
corresponds  to  a  prediction  in  agreement  with  the  Bjorken  sum  rule. 


Table  3.  The  nonsinglet  moments,  f(g?-g?)dx,  fl(g*-gf)dx  and  f2(gf-g")dx  a 
shown  for  Q2  =  5  GeV2. 


Set 

Moments  of  non-single 
9i(x) 

;  structure  functions  from  fits  to 
A,(x] 

p-n 
2(p-d) 
2(d-n) 

0.161  ±0.015 
(0.215-0.223)  ±0.032 
(0.099-0.107)  ±0.033 

0.166  ±0.015 
(0.160-0.179)  ±0.032 
(0.154-0.173)  ±0.033 

Average  value: 
Bjorken  Sum  Rule  prediction: 

0.166  ±0.013 
0.189  ±0.005 
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Polarised  structure  functions 

To  proceed  further,  we  need  some  more  input.  For  this,  we  parametrise  the  spin 
dependent  valence  densities  [15]  using  the  Carlitz  Kaur  model  [16].  Specifically,  we 
have 


dy(x)    =     — 

where  cos2^(x)  =  1/(1  +  7\/l  —  x/x0-1)  is  a  one-parameter  spin  dilution  function. 
We  use  the  GRV  parametrisations  [17]  for  the  spin  independent  densities,  though 
results  are  not  sensitive  to  choice  of  parametrisation.  Since  uy  and  dy  are  known, 
7  fixes  uy  and  dy  individually. 

The  input  that  we  use  is  the  Bjorken  sum  rule  prediction  itself!  We  choose 
7  so  that  the  combination,  (uy(x)  —  dy(x})/6  integrates  to  g^,  so  the  Bjorken 
sum  rule  is  satisfied.  The  resulting  O(as)  corrected  non-singlet  combination,  [18] 
(uy(x)  —  dy(x))(l  —  at/ir)/Q  ,  is  plotted  as  the  solid  curve  in  fig.  3,  for  comparison 
with  the  data  differences.  We  see  that  the  pd  and  pn  combinations  are  quite  close 
to  the  Carlitz-Kaur  fit  while  dn  deviates  severely  from  it,  especially  at  small-x. 
Furthermore,  at  large-  a:  (x  >  0.2),  the  combination  dn  underestimates  the  fit  while 
pd  overestimates  it.  We  have  also  plotted  the  actual  data  points  for  the  pd  combi- 
nation (made  possible  as  EMC  and  SMC  have  data  in  the  same  x-bins)  to  give  an 
idea  of  the  errors  involved. 

Therefore,  deviations  from  the  expected  behaviour  (where  by  'expected'  we 
mean  an  x-dependence  of  the  non-singlet  density  compatible  with  the  Bjorken  sum 
rule)  occur  at  both  large  as  well  as  small  values  of  x.  Hence  deviations  in  the  mo- 
ments of  these  differences  also  arise  from  both  these  kinematically  distinct  regions. 
Is  there  any  way  to  explain  these  deviations? 

1.5  The  constrained  fits 

Note  that  the  errors  on  the  data,  especially  at  x  >  0.2,  are  fairly  large.  Also  recall 
the  phenomenological  bias  that  all  the  asymmetries  should  approach  each  other  at 
large-x.  Furthermore,  the  parton  model  expects  this  value  to  be  ±1  at  x  =  1.  We 
now  reparametrise  the  data  with  this  bias  or  constraint.  Our  motivation  for  such 
a  procedure  is  that  the  proton  asymmetry  is  clearly  distinct  from  zero  at  large-x 
values.  Hence,  the  deuteron  and  neutron  asymmetries  (which  are  not  so  clear,  due  to 
large  errors  at  large-x,  as  can  be  seen  seen  from  fig.  2)  must  also  follow  this  trend.  So 
we  fit  the  asymmetries  to  the  form,  Apl'd>n(x]  =  Nxa  {I-  exp  [a  (x  -  XQ)]}  ,  where 
the  constraint  A\(\]  =  1  determines  N  from  N  {1  -  exp  [a(l  -  x0)]}  =  1  ,  and 
XQ  =  0  for  the  proton.  These  fits  are  shown  as  solid  lines  in  fig.  2.  The  differences 
between  the  two  fits  are  strongly  marked  at  larger  x  values  for  the  neutron  and 
deuteron.  Surprisingly,  the  x2  values  are  not  very  different  for  these  new  fits  as 
compared  to  the  case  when  the  data  were  fitted  without  any  theoretical  bias  (see 
table  4).  This  is  obvious  for  the  proton  case  as  the  data  are  clearly  compatible 
with  the  assiimntion  that  -4?  —  »•  1  as  z  —  >•  1.  For  the  deuteron  and  neutron  data, 


Table  4.  The  xa  values  of  the  fits  (at  Q2  -  5  GeV2)  to  the  data  in  figs.  1  and  2  are 
shown,  d  (d')  is  the  denteron  fit  corresponding  to  xo  =  0.06  (0.04). 


Target 

No.  of 
data  points 

No.  of  para- 
meters in  fit 

X2/d-o.f 
9i(x) 

from  fits  to 

^i(«) 

P 
n 
d 
d7 

10 
8 
11 
11 

3 

4 
4 
4 

4/7 
4/4 
5/7 
5/7 

4/8 
5/5 
6/8 
8/8 

These  new  fits  to  the  asymmetry  yield  corresponding  spin  dependent  structure 
functions  as  shown  in  fig.  4. 

While  the  small-a:  fits  are  good,  the  large-r  data  are  systematically  overestimated 
(while  remaining  within  Iff)  for  the  neutron  and  deuteron.  The  next  logical  step 
is  to  find  the  moments  of  these  fits;  these  are  shown  in  the  last  column  of  table  2. 
The  moments  of  the  proton  and  neutron  do  not  change  much,  as  expected;  however, 
the  deuteron  moment  goes  up  remarkably.  The  deuteron  result  is  very  sensitive  to 
the  value  of  a?0.  Earlier,  the  (negative)  integral  in  the  small-x  region  was  being 
more  or  less  cancelled  by  the  positive  contribution  from  the  remaining  interval.  By 
increasing  the  large-ac  contribution,  the  moment  is  now  clearly  positive. 

We  show  the  ^-dependence  of  these  new  (theoretically  constrained)  non-singlet 
combinations,  pn,  pd  and  <fn,  as  defined  earlier,  in  fig.  5.  The  small-a:  fits  do  not 
change  much;  what  is  dramatic,  however,  is  the  behaviour  of  all  the  combinations 
at  large-z,  which  now  precisely  satisfy  the  theoretical  requirements.  The  agreement 
of  all  the  three  combinations  with  the  predicted  non-singlet  density  is  visible  from 
x  =  0.2  onwards.  (Recall  that  the  earlier  parametrisations  had  yielded  fits  which 
either  overestimated  or  underestimated  the  non-singlet  density  in  this  region).  Fur- 
thermore, the  corresponding  moments  of  all  these  non-singlet  distributions  are  now 
very  much  in  accordance  with  the  prediction  of  the  Bjorken  sum  rule,  as  seen  from 
table  3.  The  errors  are  those  from  the  data;  errors  in  the  fit  are  only  accounted  for 
in  the  deuteron  case,  leading  to  corresponding  range  of  values  for  the  pd  and  dn  mo- 
ments. We  emphasise  that  these  fits,  though  tailored  to  theoretical  considerations, 
are  definitely  consistent  with  the  data. 

To  sum  up,  while  the  results  are  sensitive  to  large-ar  extrapolation,  the  non- 
singlet  data  from  all  the  experiments  are  compatible,  within  errors  (10'),  with  the- 
oretical expectations  in  the  large-ar  region  and  the  Bjorken  sum  rule.  The  small-x 
non-singlet  data  still  deviate  considerably  from  the  nai've  parton  model  picture. 
This  sector  is  already  controversial  and  poorly  understood.  There  has  been  a  great 
deal  of  literature  on  the  subject;  we  do  not  comment  on  it  further. 

Since  we  do  not  have  three  independent  measurements,  we  cannot  specify  the 
individual  quark  contributions  (A«,  Arf  and  A*)  without  a  further  assumption 
We  choose  this  to  be  the  constraint  coming  from  the  non-singlet  SU(3)  symmetric 
combination: 

Aw  -f  Ad!  —  2A«  =  08  , 
where  g&  =  0.584  ±0.018  based  on  nucleon  and  E~  decay  data. 
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Figure  4.  The  same  as  fig.  1,  but  with  the  solid  curves  corresponding  to  constrained  fits 
to  the  asymmetry  data. 
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Figure  5.  The  same  as  fig.  3,  with  the  structure  functions  evaluated  from  constrained 
fits  to  the  asymmetry  data. 
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we  men  use  tne  moment  ot  tne  proton  gi  structure  function  and  (A«  —  Ad)  = 
1.097  ±  0.083  obtained  from  tables  2  and  3  respectively  to  get 

A«  =  0.701  ±0.037,      Ad  =-0.396  ±0.091,      As  =  -0.140  ±0.050  , 

so  that 

£  =  -  (Au  -f  Ad  +  As)  -  0.083  ±  0.055  . 

This  is  still  small,  just  as  the  nearly  zero  result  first  obtained  by  the  EMC  (though 
with  large  errors).  In  some  sense  then,  the  new  data  not  only  indicate  that  the 
Bjorken  sum  rule  is  valid,  but  also  that  the  original  controversy  on  the  nature  of 
the  nucleon  spin  and  the  smallness  of  the  quark  contribution  remains. 

We  emphasise  the  need  for  more  data,  particularly  in  the  intermediate  and 
large-  z  region.  Only  when  the  data  improve  to  a  point  where  the  large-x  p,  d 
and  n  asymmetries  clearly  differ  from  each  other  can  we  say  that  the  old  proton 
"singlet  spin  puzzle"  has  been  joined  by  a  new  "non-singlet  spin  puzzle!"  In  short, 
we  believe  that  current  data  are  not  in  conflict  with  the  Bjorken  sum  rule. 

We  briefly  comment  on  possible  higher  twist  effects.  These  affect  the  large- 
x  data.  The  aymmetries  are  seen  to  be  largely  Q2  independent.  So  all  the  Q2 
dependence  of  <%'  >n  must  come  from  that  of  F%  and  R.  This  has  already  been 
accounted  for.  In  fact,  the  NMC  parametrisation  of  F?  includes  both  logarithmic 
and  higher  twist  corrections.  Hence,  it  is  unlikely  that  higher  twist  terms  play  any 
major  role  in  the  process  of  lining  up  the  data  with  the  Bjorken  sum  rule.  Certainly 
we  have  shown  that  agreement  can  be  brought  about  without  the  use  of  such  extra 
terms. 

Finally,  it  seems  that  non-singlet  densities  are  interesting  and  deserve  to  be 
studied  in  their  own  right.  In  this  context,  we  recall  a  proposal  [19]  for  studying  spin 
dependent  valence  densities  through  semi-inclusive  hadroproduction  in  polarised 
DIS,  which  can  be  immediately  performed  without  difficulty.  In  particular,  the 
asymmetry  for  K  meson  production,  (defined  to  be  the  usual  spin  asymmetry,  but 
for  the  difference  in  production  of  K*  and  K~  mesons)  is 


i.e.,  it  measures  «v.  The  asymmetry  for  TT  production  involves  both  uv  and  dy 
although  it  uses  the  currently  controversial  [20]  assumption  of  isospin  invariance 
(the  former  does  not).  Certainly  these  two  asymmetries  should  well  be  able  to 
supplement  results  from  inclusive  DIS  as  both  are  expected  to  be  large  and  positive 
over  all  x. 


2.  The  spin  dependent  gluon  densities 

Finally,  we  briefly  address  the  question  of  extraction  of  the  spin  dependent  gluon 
densities.  Since  the  nucleon  spin  is  not  saturated  by  the  quark  contribution,  the 
gluons  are  likely  to  contribute  substantially,  although  they  occur  at  O(as)  in  the 
expression  for  the  spin  dependent  structure  function.  This  has  been  the  subject  of  a 

1  R7 


great  deal  of  controversy;  in  fact,  it  was  even  argued  that  the  magnitude  of  the  glu- 
onic  contribution  to  the  spin  of  the  p?oton  may  well  turn  out  to  be  renormalisation 
scheme  dependent!  We  outline  why  this  is  not  the  case,  and  pinpoint  the  source 
of  ambiguity  in  this  sector.  To  do  this,  we  go  back  to  the  parton  interpretation  of 
gi(x).  This  section  is  mainly  a  review  of  already  existent  formalism  [21,  22,  23,  24] 
addressing  the  question,  "Is  there  a  sizable  singlet  quark/gluon  contribution  to  the 
first  moment  of  <7i?" 

Look  at  the  leading  Q2  dependence  of  the  moments  of  the  parton  densities  (or 
spin  contributions  to  that  of  the  nucleon): 


where  t  =  ln(Q2/^2)  and  n  is  the  renormalisation  scale. 

Since  or,  vanishes  logarithmically  as  Q2  —*  oo,  A<j  grows  with  Q2  at  the  same 
rate.  Hence  Ag  may  be  large  even  at  finite  Q2.  Since  quarks  and  gluons  (as  also 
their  relative  orbital  angular  momentum)  contribute  to  the  nucleon  spin,  there  is 
apriori  no  reason  to  leave  this  contribution  out  of  the  g\  calculation.  This  means 
we  need  to  go  beyond  the  leading  order.  (Contrast  this  with  the  spin  independent 
case  where  the  momentum,  xg(x)  —  *•  0  as  at  —  »•  0). 

2.1  The  parton  model  calculation 

Parton  densities  are  not  uniquely  defined  beyond  the  leading  order.  Furthermore, 
the  gluon  contribution  to  g\  suffers  from  both  infrared  and  collinear  divergences.  In 
other  words,  the  result  depends  on  the  mode  of  regularisation  of  the  contributing 
box  diagram  (Fig  6). 


Figure  6.  The  gluon  contribution  to  the  g\  structure  function  in  the  parton  model. 


2.2  The  factorisation  approach 

Here,  there  is  a  clean  separation  of  the  'hard'  and  'soft'  components  of  the  total 


cross  section.  The  structure  function  symbolically  evaluates  to 

9i       =9i®  f\a  +  A 


(1) 


The  two  terms  represent  the  quark  and  gluon  contributions  to  the  structure  func- 
tion, respectively.  fp+/N(x)  (/p_/jv(«))  is  the  probability  of  locating  a  parton 
(p  —  0»  ¥>0)  in  a  nucleon,  JV,  with  spin  parallel  (antiparallel)  to  that  of  the  parent 
nucleon  and  /AP/JV  is  the  difference  of  these  two  probabilities  and  was  denoted  as 
p  in  the  previous  section  for  simplicity.  The  symbol  <g>  stands  for  the  convolution, 


f)®Kx)=   /'% 

Jx    y 


Each  term  is  therefore  a  convolution  of  the  soft  (nonperturbative,  and  therefore 
incomputable)  parton  densities  and  the  hard  (perturbative  and  computable)  scat- 
tering coefficients.  These  hard  scattering  coefficients,  g\'s  are  target  independent. 
The  simplest  way  to  calculate  them  is  to  replace  the  nucleon  target  N  by  a  known 
(point)  target,  say,  gluon.  f&q/g  and  f&g/g  are  now  computable.  So  In  gl  3 .  Hence 
the  only  unknowns  in  eq.  (1)  are  the  hard  scattering  coefficients.  However,  since 
f&q/g  i8  of  order  O(as)  we  need  to  compute  §1  only  to  leading  order,  (9(1).  This  is 
already  known  (=  6(1  —  ar)/2).  This  leaves  #f  as  the  only  unknown  which  can  thus 
be  found. 


g{®h6(l-x). 


Figure  7.  The  gluon-virtual  photon  scattering  cross  section  in  the  factorisation  ap- 
proach. The  LHS  corresponds  to  fig.  6.  The  terms  in  the  RHS  are  the  quark  and  gluon 
contributions  respectively. 

Specifically,  with  gluon  replacing  the  nucleon  target  in  eq.  (1),  the  equation  can  be 
diagrammatically  represented  as  shown  in  fig.  7.  The  LHS  of  fig.  7  is  known  (it 
is  the  parton  model  result)  and  has  both  IR  and  collinear  divergences.  However, 
these  are  exactly  cancelled  by  corresponding  divergences  on  the  RHS.  However,  the 
RHS  is  renormalisation  scheme  dependent,  while  the  LHS  is  not.  The  result  is  that 
g^  is  hard,  prescription  independent,  and  scheme  dependent.  Due  to  this,  the  glu- 
onic  contribution  to  the  first  moment  of  the  nucleon  structure  function  (f  g\(x)dx) 
vanishes  or  does  not  (equals  — a,/2fl-)  depending  on  whether  the  computation  is 
performed  in  the  MS  or  MOM  scheme. 

This  means  that  meaningful  results  can  be  obtained  only  if  all  quantities  (F?, 
01,  etc)  are  defined  upto  next-to-leading  order  in  the  same  scheme.  Then  only  will 
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the  experimentally  measurable  cross  sections  be  scheme  invariant.  However,  we 
emphasise  that  the  spin  dependent  gluon  density  and  its  moment  A0  are  well  defined 
always  and  therefore  can  be  unambiguously  measured.  In  fact,  the  advantage  of  the 
factorisation  approach  is  that  it  yields  [25]  an  operator  definition  of  the  densities 
(both  spin  independent  and  spin  dependent).  It  is  the  coefficient  of  the  A<?  term 
in  the  cross  section  that  is  scheme  dependent. 

A  simple  way  of  seeing  this  is  to  write  the  spin  dependent  DIS  cross  section  for 
a  nucleon  target  of  mass,  M: 


where  q  =  f&9/N  and  g  =  f&9/N  as  before.    Replacing  N  by  a  gluon  target, 
factorisation  yields 


Here  ffq>g  are  target  independent  while  f&g/g  ~  5(1  —  x).  Substituting  for  ffg  from 
eq.  (3)  in  eq.  (2),  we  have 

ffN  =  fft  ®  q  +  ffg  <8>  9  -  ffq  <8>  f&qig  <8>  9  • 

The  last  term  can  be  thought  of  as  either  a  correction  to  q  or  a  correction  to  ffg. 
Hence  we  are  free  to  redefine 


OR 


q' 


That  is,  there  is  ambiguity  in  q  and/or  aff  to  order  O(at),  but  none  in  </  or  in  gi(x)- 
Hence,  although  0i  itself  may  not  be  able  to  distinguish  between  models  with  large 
and  small  A<j,  it  will  be  possible  (say,  in  direct  photon  production  in  polarised  pp 
collisions)  to  establish  the  gluonic  contribution  to  the  spin  of  the  nucleon. 

3.  Conclusions 

We  have  established  that  current  polarised  DIS  data  on  spin  dependent  nucleon 
structure  functions  are  consistent  with  predictions  in  the  nonsinglet  sector  (Bjorken 
Sum  Rule).  The  quark  contribution  to  the  spin  of  the  proton  remains  small  (about 
17%).  It  is  therefore  likely  that  gluons  may  contribute  substantially.  In  view  of 
this,  and  other  controversies  in  the  field,  we  have  reviewed  the  formalism  of  DIS  and 
the  contribution  of  the  gluonic  term  to  the  structure  functions,  in  the  factorisation 
approach.  We  emphasise  that,  although  DIS  cannot  measure  the  gluon  contribution 
to  the  proton  spin,  such  a  quantity  is  well-defined  and  can  be  measured  elsewhere. 
In  fact,  several  new  structure  functions  [25]  have  recently  been  defined,  occurring  in 
both  DIS  and  pp  collision  processes  in  longitudinal  as  well  as  transversely  polarised 
collisions,  that  may  throw  more  light  on  the  issue  of  the  proton  spin. 


References 

[1]     Two  informal  lectures  were  later  given  on  them  by  V.  Ravindran  and  I  in  the  group 

sessions. 

[2]     J.  Ashman  et  al.,  EMC,  Nucl.  Phys.  B328  (1989)  1 
[3]     B.  Adeva  et  al.,  SMC,  Phys.  Lett.  B302  (1993)  533 

[4]     P.L.  Anthony  et  al.,  E142  Collaboration,  Phys.  Rev.  Lett.  71  (1993)  959;  E.  Hughes, 
private  communication 

[5]     J.D.  Bjorken,  Phys.  Rev.  148  (1966)  1467;  Phys.  Rev.  Dl  (1970)  1376 
[6]     P.  Amaudruz  et  al.,  NMC,  Phys.  Lett.  B295  (1992)  159 

[7]     M.  Derrick,  et  al.,  ZEUS  Collaboration,  Phys.   Lett.  B316  (1993)  412;  I.  Abt,  et 
al.,  HI  Collaboration,  Nucl.  Phys.  B407  (1993)  515 

[8]  We  do  not  comment  on  possible  effects  of  Gottfried  sum  rule  violation  here. 

[9]  J.  Ellis  and  R.L.  Jaffe,  Phys.  Rev.  D9  (1974)  1444;  E:  DIG  (1974)  1669 

[10]  B.  Adeva  et  al.,  SMC,  Phys.  Lett.  B320  (1994)  400 

[11]  F.E.  Close  and  R.G.  Roberts,  Phys.  Lett.  B316  (1993)  165 

[12]  J.  Ellis  and  M.  Karliner,  Phys.  Lett.  B313  (1993)  131 

[13]  G.  Altarelli,  P.  Nason  and  G.  Ridolfi,  Phys.  Lett.  B320  (1994)  152 

[14]  L.W.  Whitlow,  Phys.  Lett.  B250  (1990)  193 

[15]  D.  Indumathi,  M.V.N.  Murthy,  V.  Ravindran,  Z.  Phys.  C56  (1992)  427 

[16]     R.  Carlitz  and  J.  Kaur,  Phys.    Rev.    Lett.    38  (1977)  673;  J.  Kaur,  Nucl.    Phys. 
B128  (1977)  219 

[17]     M.  Gluck,  E.  Reya,  A.  Vogt,  Z.  Phys.  C53  (1992)  127 

[18]     Since  we  are  working  at  Q2  =  5  GeV2,  higher  order  corrections  to  this  sum  rule  are 
negligible. 

[19]     S.  Gupta,  D.  Indumathi,  M.V.N.  Murthy,  Z.  Phys.  C47  (1990)  227 
[20]     P.  Amaudruz  et  al.,  NMC,  Phys.  Rev.  Lett.  66  (1991)  2712 

[21]     Ahmed  and  G.G.  Ross,  Phys.   Lett.  B56  (1975)  385;  G.  Altarelli  and  G.G.  Ross, 
Phys.  Lett.  B212  (1988)  391 

[22]     A.  Manohar,  Phys.  Rev.  Lett.  66  (1991)  289 

[23]     P.  Mathews  and  V.  Ravindran,  Phys.  Lett.  B278  (1992)  175 

[24]     U.  Ellwanger,  Phys.  Lett.  B259  (1991)  469 

[25]     R.L.  Jaffe  and  X.-Ji,  Nucl.  Phys.  B375  (1992)  527;  X.  Ji,  Phys.  Lett.  B284  (1992) 
137 


Oscillating  neutrinos:  theory  vs  experiment 

ANJAN  S-.  JOSHIPURA 

Theory  Group,  Physical  Research  Lab.,  Ahmedabad,  India. 


Abstract.  Possible  hints  on  neutrino  masses  are  reviewed.  They  come  from  the  deficits 
in  the  solar  as  well  as  atmospheric  neutrinos  and  from  need  of  a  significant  amount  of 
hot  component  in  the  dark  matter  of  the  universe.  The  role  of  three  generation  mixing  in 
simultaneously  solving  the  solar  and  atmospheric  neutrino  problem  is  discussed.  All  the 
three  hints  can  be  reconciled  if  three  neutrinos  are  almost  degenerate.  Models  for  neutrino 
masses  and  mixing  implied  by  the  above  hints  are  briefly  discussed. 


1.  Introduction 


We  discuss  in  this  talk  the  implications  of  present  experiments  on  the  properties 
-  masses  and  mixing  -  of  the  known  neutrinos.  So  far,  one  does  not  have  any 
conclusive  evidence  from  laboratory  in  favour  of  neutrino  masses.  But  there  exist 
a  number  of  hints  which  strongly  point  to  non- vanishing  neutrino  masses.  More 
importantly,  when  taken  in  totality,  these  hints  imply  a  definite  pattern  for  the 
neutrino  masses.  We  shall  review  [1]  these  hints  and  try  to  present  general  conclu- 
sions about  the  structure  of  neutrino  masses  following  from  them.  Then  we  shall 
briefly  review  theoretical  schemes  which  can  lead  to  the  desired  pattern  for  neutrino 
masses  and  mixing. 

2.  Hints  on  neutrino  masses 


The  hints  on  neutrino  masses  come  at  least  from  three  different  sources:  (1)  Solar 
neutrino  deficit  (2)  Atmospheric  neutrino  deficit  and  (3)  need  for  a  significant  hot 
component  in  the  dark  matter  of  the  universe.  Many  reviews  [2,  3,  4]  exist  which 
summarize  these  hints.  We  briefly  mention  salient  features  of  each  of  these  topics. 

2.1  Solar  neutrino  deficit 

Four  different  experiments  looking  for  the  electron  neutrino  of  the  solar  origin  have 
found  depletion  in  the  solar  ve  flux  compared  to  theoretical  expectations  based  on 
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the  standard  solar  model  (SSM).  The  recent  results  are  [3] 


=0.318  ±0.051    (Homestake) 

=  0.51  ±0.04  ±0.06    (Kamioka  II  +  III) 

=  0.58  ±0.07    (GALLEX-fSAGE) 


•  The  SSM  rates  are  calculated  using  the  flux  of  Bachall's  group. 

•  Different  experiments  observed  different  amount  of  depletion.    Since  these 
experiments  are  sensitive  to  different  parts  of  the  neutrino  energy  spectrum, 
the  above  results  can  be  understood  in  terms  of  some  underlying  mechanism 
which  can  cause  the  energy  dependent  suppression.  This  would  occur  if  the 
Mikhyev  Smirnov  Wolfenstein  (MSW)  [5]  conversion  or  the  long  wave  length 
vacuum  [6]  oscillations  cause  the  depletion  in  the  ve  flux.  In  contrast,  the  vac- 
uum oscillations  with  a  wavelength  much  shorter  than  the  Sun-Earth  distance 
cannot  explain  the  observed  energy  dependence  of  depletion. 

•  It  has  been  argued  [7]  that  various  experimental  results  are  indicative  of  some 
new  physics  rather  than  of  uncertainty  in  our  knowledge  of  the  Sun. 

•  If  one  assumes  neutrino  oscillations  to  be  responsible  for  the  depletion  of 
ve  then  the  relevant  parameter  space  is  restricted  to  the  following  in  case  of 
the  two  generation  mixing.  For  the  vacuum  solution,  one  has 

6m2~(0.5'5-l.l)x  10-10eV2     sin2  29  ~  0.75  -  1  (2) 

while  for  the  MSW  solution,  one  has 

6m2  -  6  x  10~6eV2       sin2  20  ~  0.007     (Non  -  Adiabatic) 

6m2  ~  9  x  ID'6     (eV2)  ;      sin2  20  ~  0.6(  Adiabatic)  (3) 

2.2  Atmospheric  neutrino  deficit 

The  atmospheric  neutrinos  are  produced  in  decays  of  pions,  kaons  and  muons  origi- 
nating from  interactions  of  cosmic  rays  with  the  atmosphere.  Observations  of  these 
neutrinos  seem  to  indicate  the  depletion  in  the  muon  to  electron  neutrino  flux  ratio 
compared  to  the  theoretical  expectations.  Experimental  findings  of  various  groups 
are  as  follows  [3]: 


R    =  =  Q.eOl'.s  ±  0.05    (KAM    SubGeV) 


(KAM    MultiGeV) 
=    0.54  ±0.05  ±0.12,    (1MB  -3) 
=    0.69  ±0.19  ±0.09,    (SOUDAN) 
=     0.87  ±0.16  ±0.08. 


\f      I  /  JW  C/  O  Q  *4%^     J.1.1.V/XA      V^     VrC»J.  iV/     0maU.lCk>UAVJ.l.     Uk71l4^      UJJ.V- 

theoretically  calculated  fluxes. 

The  following  remarks  are  in  order: 

•  The  absolute  flux  of  i/e  and  v^  have  about  30%  uncertainty  but  their  ratio 
is  much  less  uncertain  (~  5%).    Thus  the  experimental  indication  in  the 
depletion  in  R  can  be  pointing  to  genuine  effects  like  oscillations. 

9  The  deficit  is  seen  [8]  by  Kamioka  in  two  sets  of  data.  One  corresponding  to 
the  contained  events  with  energy  <  (9(GeV)  approximately.  The  other  set 
consists  of  contained  and  partially  contained  events  in  the  multi  GeV  range. 

«  1MB  group  has  also  looked  for  the  upward  going  muons.  Their  numbers  in 
this  case  are  consistent  with  no  oscillation  hypothesis. 

•  The  Kamioka  group  finds  [8]  strong  dependence  on  the  zenith  angle  in  the 
multi  GeV  data.   The  down  coming  neutrinos  do  not  show  much  depletion 
compared  to  the  upgoing  ones  which  are  depleted  by  about  50%.  Since,  the 
latter  travels  much  larger  distance  compared  to  the  down  going  neutrinos,  this 
zenith  angle  dependence  is  strongly  suggestive  of  the  oscillation  phenomena 
with  typical  wavelength  of  order  few  tens  of  km.  In  fact,  the  best  fit  values 
of  parameters  quoted  by  Kamioka  'assuming  the  oscillations  to  be  the  cause 
of  depletion  are 

Am2  ~  1.6  x  10~2  eV2  ,  sin2  20  ~  1  ,  (5) 

2.3  Dark  matter 

Combinations  of  experimental  evidences  and  strong  theoretical  arguments  lead  one 
to  believe  that  our  universe  is  mostly  made  up  of  non  baryonic  dark  matter  [9,  10]. 

Neutrinos  are  the  only  known  particles  among  various  candidates  for  the 
dark  matter  that  have  been  proposed.  But  it  is  likely  that  neutrinos  by  themselves 
may  not  be  responsible  for  all  the  non  baryonic  dark  matter.  This  follows  from  the 
arguments  based  on  the  structure  formation  [10]  in  the  early  universe.  Because  of 
relatively  small  masses,  neutrinos  are  relativistic  till  quite  late  in  the  evolution  of 
the  universe.  Free  streaming  of  these  neutrinos  cannot  allow  for  the  growth  of  small 
structures  which  could  have  evolved  to  the  galactic  sizes  at  present.  The  structures 
at  smaller  scales  arise  more  naturally  if  the  dark  matter  is  cold,  for  example,  heavy 
weakly  interacting  particle  like  the  lightest  supersymmetric  state.  But  this  by  itself 
is  not  successful  in  explaining  the  large  scale  structure.  This  has  given  rise  [11] 
to  suggestion  of  the  mixed  dark  matter  scenario  with  simultaneous  presence  of  the 
hot  and  cold  component  in  the  dark  matter.  Assuming  the  hot  component  to  be  in 
the  form  of  neutrinos  one  finds  that  roughly  30%  contribution  by  them  in  addition 
to  cold  matter  simultaneously  fit  the  COBE  and  IRAS  observations.  Taken  at  the 
face  value  this  suggests  that  the  sum  of  the  neutrino  masses  [11]  should  be 

,  -  7eV  (6) 
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It  is  likely  that  neutrino  masses  may  not  be  responsible  for  one  or  more  of  the  phe- 
nomena discussed  in  the  last  section  or  that  the  experimental  findings  themselves 
may  need  some  revision.  But  if  one  takes  all  these  hints  seriously  than  one  could 
draw  strong  theoretical  conclusions  about  the  pattern  of  neutrino  masses.  First, 
we  discuss  the  possible  pattern  of  neutrino  masses  and  mixing  necessary  in  order 
to  simultaneously  solve  the  solar  and  atmospheric  neutrino  problems.  If  one  also 
wants  neutrinos  to  provide  the  hot  component  in  the  dark  matter  then  the  struc- 
ture of  neutrino  masses  get  further  constrained.  We  discuss  this  in  the  subsequent 
subsection. 

3.1  Solar  and  atmospheric  neutrino  deficits 

The  restrictions  imposed  by  the  solar  and  atmospheric  neutrino  deficits  on  masses 
and  mixing  have  already  been  displayed  in  eqs.(2,3)  and  eq.  (5).  One  can  draw  the 
following  obvious  conclusions  from  these  equations: 

(i)  At  least  two  of  the  three  masses  and  mixing  angles  among  neutrinos  must  be 
non-zero. 

(it)  At  least  one  of  the  mixing  angles  must  be  large  and 

(in)  two  independent  (mass)2  differences  between  neutrinos  must  be  hierarchical. 
Such  hierarchy  can  result  if  the  non-zero  masses  themselves  are  hierarchical  or  if  two 
are  nearly  degenerate  with  the  common  (mass)2  around  10~2  (eV)2  and  (mass)2 
splitting  around  KT5  -  10~6  (eV)2. 

The  (i)  above  imply  that  all  three  neutrinos  would  be  involved  in  the  os- 
cillations and  conventional  analysis  of  the  neutrino  oscillations  in  terms  of  two 
generations  may  be  inadequate  in  general.  More  importantly,  in  many  cases  the 
observed  atmospheric  neutrino  deficit  can  influence  the  region  of  parameter  space 
allowed  by  the  solar  neutrino  experiments  and  vice  versa. 

A  priori,  the  analysis  of  neutrino  experiments  in  terms  of  three  neutrino 
generations  seems  complicated  since  in  general  two  independent  (mass)2  differences 
and  three  mixing  angles  are  involved.  But  the  fact  that  (mass)2  differences  are 
hierarchical  greatly  simplifies  the  analysis  of  oscillations  both  in  vacuum  as  well  as 
in  the  presence  of  the  matter.  Thus  it  becomes  possible  to  simultaneously  analyze 
the  combined  implications  of  the  solar  and  atmospheric  neutrino  data. 

Consider  first  the  vacuum  oscillations  and  the  simplest  mass  hierarchy  mVl  <C 
"V,  <  mvs  with  A32  ~  10~2  (eV)2,  Ai2  ~  10~5  (eV)2.  The  length  scale  associated 
with  An  is  ~  K^iooMe^)  km.  The  effects  of  such  oscillations  cannot  be  felt  in 
the  laboratory  or  in  the  atmospheric  neutrino  experiments.  Hence,  one  can  take 
mi  =  ^2  in  analyzing  these  phenomena  and  all  relevant  conversion  probabilities 
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[12]  depend  only  on  A32. 

Pve  ve   =  1  -  4sf3c?35;  PVe  Vt 


where  S  =  sin2  4aj,  .  Note  that  only  two  of  the  three  mixing  angles  appear  in  all 
the  probabilities.  Thus  one  has  effectively  one  more  parameter  compared  to  the 
conventional  analysis  in  terms  of  only  two  generations.  The  detailed  restrictions 
on  the  angles  #13  ,  023  and  on  A23  following  from  the  laboratory  search  of  the 
oscillations  can  be  found  for  example  in  ref.[12]. 

Consider  now  the  matter  effects  inside  the  Sun.  The  matter  can  enhance  the 
conversion  of  ve  if  the  latter  go  through  the  MSW  resonance  inside  the  Sun.  As  is 
well  known,  this  requires  the  relevant  (mass)2  differences  around  10~5  (eV)2.  The 
oscillations  characterized  by  A32  ~  10~2  (eV)2  do  not  go  through  a  resonance  and 
the  matter  effects  also  get  simplified  in  spite  of  the  presence  of  three  generations 
[12,  13,  14].  More  specifically,  the  effective  matter  dependent  (mass)2  matrix  can 
be  expressed  as 


-f-  .Ac13    *i2Ci2A2i       0 
c?2A2i          0 


2A32 

0  A31 


(8) 


where  A  =  2\/2GfF«e£I.  The  above  equation  is  formally  the  same  as  in  case  of  two 
generations.  There  are  two  important  differences  however.  The  resonance  condition 
following  from  eq.(8)  now  becomes 


(9) 

Secondly,  the  survival  probability  for  the  electron  neutrino  now  takes  the 
form 

sJ3  (10) 


where  P2(^i2>  Ai2)  is  the  standard  survival  probability  in  case  of  the  two  gener- 
ations. Both  eqs  (9)  and  (10)  differ  from  the  corresponding  two  genration  case 
because  of  the  presence  of  one  more  mixing  angle  namely,  #13. 

One  could  distinguish  between  two  physically  different  possibilities: 


(b)  Alternatively,  the  solar  neutrino  problem  can  be  solved  by  the  vt  vr 
oscillations  and  atmospheric  neutrino  by  the  ve  v^  [14].  The  survival  probability  is 
given  in  this  case  by 

Pv.v.   =cJ2P2(013,Ai3)  +  st2  (11) 

Note  that  in  case  (b),  the  additional  angle  entering  the  survival  probabilities  for  the 
solar  ve  is  required  to  be  large  from  the  atmospheric  neutrino  anomaly  in  contrast 
to  the  case  (a)  where  the  angle  #13  could  be  small.  Thus,  in  case  (b),  one  cannot 
treat  the  solar  neutrino  deficite  as  purely  two  generation  problem  and  the  allowed 
region  in  parameter  space  changes.  The  region  allowed  by  the  solar  neutrino  data 
including  the  energy  distribution  observed  by  the  Kamioka  is  determined  for  the 
case  (b)  in  [14].  The  allowed  region  is  considerably  larger  in  comparision  to  the  two 
generation  scenario. 

Note  that  the  cases  (b)  above  requires  a  mass  hierarchy  different  from  the 
conventional  one  namely  mVf  <C  mVll  <C  mVr  .  But  such  inverted  hierarchies  are 
not  uncommon  and  occur  for  example  in  one  of  the  most  popular  models  for  the 
neutrino  masses  namely,  the  Zee  model  [15].  The  detailed  mixing  pattern  predicted 
in  this  model  is  however  not  favoured  [16]  by  the  data  in  case  of  scenario  (b). 

3.2  Combined  solution 

While  it  is  possible  to  solve  the  atmospheric  and  solar  neutrino  problems  in  variety 
of  ways,  combined  solution  of  all  three  problems  imply  stringent  constraints  on  the 
neutrino  mass  spectrum.  The  requirement  that  neutrinos  provide  the  hot  compo- 
nent in  the  dark  matter  demands  that  at  least  one  of  the  neutrino  mass  should  be 
in  the  eV  range.  But  if  there  are  only  three  neutrinos  then  one  cannot  generate 
two  (mass)2  differences  both  of  which  are  much  smaller  then  0(eV)  unless  all  the 
neutrinos  are  degenerate  to  a  high  degree.  Alternatively,  one  needs  to  introduce 
additional  light  sterile  [4]  neutrino.  We  shall  consider  the  minimal  scenario  without 
any  light  sterile  neutrinos  in  the  following. 

4.  Models  for  neutrino  masses 


The  mass  terms  for  three  light  neutrinos  can  be  written  in  general  as  follows: 
-A™,,  =  -vl  mej}  VL  (12) 

&i 

where  mtjj  is  a  matrix  in  generation  space  transforming  under  5i7(2)i,  as  a  triplet. 
mejf  breaks  the  lepton  number  but  it  can  preserve  some  linear  combinations  of  the 
family  lepton  numbers.  mejj  can  be  generated  in  one  of  the  following  ways  [4]: 

•  Add  an  SU(2)  triplet  scalar  field  T  and  assign  non-zero  vacuum  expectation 
value  (vev)  to  its  neutral  component.  This  way  of  generating  mass  term  gives 


width  of  Z. 


•  Generate  an  effective  triplet  term  by  radiative  corrections.  A  proto-type  of 
this  way  of  mass  generation  is  provided  by  the  Zee  model  [15]. 

®  Couple  the  VL  to  one  or  more  heavy  neutrinos.  This  results  in  the  effective 
couplings  of  the  above  type.  The  prime  example  of  this  is  the  well  known  see- 
saw mechanism  which  is  characterized  by  the  following  mass  matrix  between 
VL  and  VR 


leading  to  the  effective  mass  matrix 

meff  =  -mDMRlmo  (14) 


Any  one  or  more  of  above  can  be  used  to  generate  masses  for  the  known  left-handed 
neutrinos. 

The  seesaw  models  naturally  arise  in  grand  unified  theories  like  50(10).  The 
simplest  of  such  theory  also  relates  the  neutrino  Dirac  masses  to  the  mass  matrix  for 
the  charged  2/3  quarks.  As  a  result,  the  neutrino  masses  display  strong  generation 
dependence  as  long  as  MR  is  generation  blind.  Typically, 


/1CX 

mv-   ~—  -    ]mVlt    ~  -rf-    ;mVr    ~  -r/-  (15) 

MR  * 


The  above  equations  imply 

(%)4  ~  108  (16) 

21         mc 

Hence  it  is  not  possible  to  solve  both  the  atmospheric  and  solar  neutrino  prob- 
lems simultaneously  in  the  seesaw  model  based  on  the  simplest  50(10).  It  should 
however  be  remembered  that  the  above  results,  many  times  taken  as  predictions 
of  the  seesaw  scheme,  are  strongly  model  dependent.  If  MR  shows  some  genera- 
tion dependence  then  the  predictions  could  change  drastically.  For  example,  it  is 
possible  [17]  to  construct  models  where  all  the  three  neutrinos  have  masses  in  the 
eV  range  and  two  of  them  have  mass  splitting  which  could  account  for  the  solar 
neutrino  deficit  through  the  MSW  mechanism.  Moreover,  such  structure  follows 
from  natural  assumptions  namely  mo  coinciding  with  the  up  quark  masses  and  all 
the  nonzero  elements  of  MR  having  the  same  order  of  magnitudes.  Such  models  are 
interesting  from  the  point  of  view  that  they  display  completely  different  pattern 
compared  to  the  standard  seesaw  prediction.  But,  one  cannot  still  obtain  almost 
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A  possible  mechanism  to  obtain  (almost)  degenerate  spectrum  and  the  re- 
quired departures  from  degeneracy  was  proposed  in  ref.[18,  19].  The  mechanism 
presented  there  is  quite  natural  in  the  context  of  the  left  right  symmetric  theories 
with  manifest  parity  invariance.  The  majorana  masses  for  the  right  handed  neu- 
trinos come  from  the  S77(2)#  triplet.  But  then  the  manifest  left  right  symmetry 
requires  the  presence  of  a  left  handed  triplet  field  with  non-zero  vev  resulting  in 
the  following  structure  instead  of  eq.(13) 


(17) 


where  mi  is  the  majorana  mass  for  the  left  handed  neutrinos.  This  generates  the 
following  effective  mass 

mejj  =  mjr  —  mrtM^mf)  (18) 

If  there  exists  some  horizontal  symmetry  [22]  which  makes  the  majorana  mass 
matrices  m^  ,  MR  proportional  to  identity  matrix  then  the  following  pattern  emerges 
for  the  masses 


TT-  (20) 

MR 


2 


mVr      ~     m0  -  —-  (21) 

MR 

(22) 

This  leads  to  the  following  prediction  [19] 


(23) 

^     > 


i2 


Thus  the  hierarchy  needed  to  solve  the  solar  and  atmospheric  problems  simultane- 
ously is  automatically  reproduced  in  this  picture.  Moreover,  the  common  degener- 
ate mass  mo  can  be.  writen  as 


(ire) 


Hence  if  the  Ai2  is  in  the  range  appropriate  for  a  solution  to  the  solar  neutrino 
problem  through  the  MSW  mechanism  and  MR  around  the  GUT  scale  then  the 
degenerate  mass  m0  is  automatically  fixed  in  the  eV  range  needed  to  provide  the 
hot  component  in  the  dark  matter. 
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v^uc  jjvjooiuic  pujuicoi  iiu  uiic  C&LNJVC  scueme  is  uimcuiiy  m  generating  &  large 
mixing  needed  to  solve  the  atmospheric  neutrino  problem.  The  mixing  in  the  lepton 
sector  is  correlated  to  mixing  in  the  quark  sector  in  50(10)  type  models.  In  case  of 
the  ordinary  seesaw  model,  one  could  destroy  this  correlation  by  making  the  MR 
[23]  to  be  generation  dependent.  This  does  not  happen  in  the  present  case  since 
mL  and  hence  MR  is  required  to  be  proportional  to  the  identity  matrix.  This  can 
be  avoided  if  one  introduce  extra  singlet  [20]  fermions  or  makes  the  Higgs  structure 
complex  enough  to  destroy  the  interesting  relations  between  femion  masses  existing 
in  the  simple  50(10)  scenario. 

The  above  scheme  predicts  a  non-zero  effective  mass  of  the  O(2  eV)  for  the 
neutrinoless  double  beta  decay  amplitude.  This  is  to  be  contrasted  with  the  latest 
[24]  limit  of  about  0.68  eV.  Thus  even  after  allowing  for  a  factor  of  two  uncer- 
tainty in  the  nuclear  matrix  elements,  the  simplest  model  seems  to  be  ruled  out. 
This  however  does  not  exclude  the  theoretically  interesting  possibility  of  degenerate 
neutrino  spectrum  since  in  principal,  there  could  be  cancellations  among  different 
neutrino  contributions.  Constructing  a  realistic  model  displaying  this  cancellation 
is  however  quite  challanging. 

5.  Conclusions 


We  have  reviewed  the  hints  on  neutrino  masses  following  from  present  experiments 
and  have  discussed  their  implications.  All  the  hints  can  be  simultaneously  satisfied 
within  the  minimal  scenario  only  if  all  the  three  neutrinos  are  almost  degenerate. 
This  seemingly  different  pattern  compared  to  other  fermion  masses  can  be  naturally 
understood  in  the  seesaw  picture  with  additional  discrete  symmetry.  The  present 
models  for  degenerate  masses  do  not  seem  to  be  favoured  by  the  neutrinoless  double 
beta  decay  results  if  the  degenerate  mass  is  to  provide  the  hot  dark  matter.  This 
may  be  a  pointer  to  the  existence  of  more  than  three  light  neutrinos. 
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Topics  in  chiral  perturbation  theory 
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Abstract.  I  consider  some  selected  topics  in  chiral  perturbation  theory  (CHPT).  For 
the  meson  sector,  emphasis  is  put  on  processes  involving  pions  in  the  isospin  zero  S-wave 
which  require  multi-loop  calculations.  The  advantages  and  shortcomings  of  heavy  baryon 
CHPT  are  discussed.  Some  recent  results  on  the  structure  of  the  baryons  are  also  pre- 
sented. 


1.  Introduction 

This  talk  will  be  concerned  with  certain  aspects  of  the  standard  model  in  the  long- 
distance regime.  I  will  argue  that  there  exists  a  rigorous  calculational  scheme  and 
that  plenty  of  interesting  and  fundamental  problems  await  a  solution.  I  hope  this 
will  trigger  further  detailed  studies  of  these  topics  (and  others  which  can  only  be 
mentioned  en  passant). 

Our  starting  point  is  the  observation  that  in  the  three  flavor  sector,  the  QCD 
Hamiltonian  can  be  written  as 


—      /  dzx{muuu  +  rriddd  +  mtss}  (I) 

with  #QCD  symmetric  under  chiral  SU(3)i,x  SU(3)fl.  On  a  typical  hadronic  scale, 
say  Mp  =  770  MeV,  the  current  quark  masses  mq  =  mu,  m^,  m,  can  be  considered 
as  perturbations.  The  chiral  symmetry  of  the  Hamiltonian  is  spontaneously  broken 
down  to  its  vectorial  subgroup  SU(3)v  with  the  occurence  of  eight  (almost)  massless 
pseudoscalar  mesons,  the  Goldstone  bosons  (ip  =  TT+,  TTO,  ir~  ,  K+,  K~  ,  K°,  K°,  rf) 

M$=mqB  +  0(m2q)  (2) 

with  B  =  —  <  0|<7g|0  >  /F%  and  F*  the  pion  decay  constant.  In  the  confine- 
ment (long-distance)  regime,  the  properties  of  the  standard  model  related  to  this 
symmetry  can  be  unambigously  worked  out  in  terms  of  an  effective  Lagrangian, 

CqcD  =  JC^[U,d,U,...,M]  (3) 

with  M.  =  diag(mu,7rt<j)  m,)  the  quark  mass  matrix  and  the  Goldstone  bosons  are 
collected  in  the  matrix-valued  field  U(x)  =  exp{i^:_1  ya(x)Aa/FT}.  Of  course, 
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there  is  an  infinity  of  possibilities  of  representing  the  non-linearly  realized  chiral 
symmetry.  While  the  QCD  Lagrangian  is  formulated  in  terms  of  quark  and  gluon 
fields  and  the  rapid  rise  of  the  strong  coupling  constant  as(Q2}  with  decreasing  Q2 
forbids  a  systematic  perturbative  expansion,  matters  are  different  for  the  effective 
field  theory  (EFT)  based  on  the  effective  Lagrangian  (3).  It  can  be  written  as  a 
string  of  terms  with  increasing  dimension, 


if  one  counts  the  quark  masses  as  energy  squared.  To  lowest  order,  the  effec- 
tive Lagrangian  contains  two  parameters,  Fv  and  B.  It  is  worth  to  stress  that  B 
never  appears  alone  but  only  in  combination  with  the  quark  mass  matrix,  alas  the 
pseudoscalar  meson  masses.  Consequently,  any  matrix-element  <  ME  >  for  the 
interactions  between  the  pseudoscalars  can  be  written  as 


<  ME  >=  c0  (  — 

^        L»=i 

This  is  obviously  an  energy  expansion  or,  more  precisely,  a  simultaneous  expansion 
in  small  external  momenta  and  quark  masses.  The  first  term  on  the  r.h.s.  of  (5) 
leads  to  nothing  but  the  well-known  current  algebra  results,  the  pertinent  coeffi- 
cient CQ  can  be  entirely  expressed  in  terms  of  JFV,  the  Goldstone  masses  and  some 
numerical  constants.  As  one  of  the  most  famous  examples  I  quote  Weinberg's  result 
for  the  S-wave,  isospin  zero  scattering  length  [1], 

ao  =  _M_  (6) 

which  is  such  an  interesting  observable  because  it  vanishes  in  the  chiral  limit  Mx  — •» 
0.  At  next-to-leading  order,  life  is  somewhat  more  complicated.  As  first  shown  by 
Weinberg  [2]  and  discussed  in  detail  by  Gasser  and  Leutwyler  [3],  one  has  to  account 
for  meson  loops  which  are  naturally  generated  by  the  interactions.  These  lead  to 
what  I  called  "non-local"  in  (5).  In  fact,  it  can  be  shown  straightforwardly  that 
any  N-loop  contribution  is  suppressed  with  respect  to  the  leading  order  result  by 
(E/A.)2N .  At  O(E4},  the  loop  contributions  do  not  introduce  any  new  parameters. 
However,  one  also  has  to  account  for  the  contact  terms  of  dimension  four  which 
are  accompanied  by  a  priori  unknown  coupling  constants  (the  GI,-  in  (5)).  These 
so-called  low-energy  constants  serve  to  renormalize  the  infinities  related  to  the 
pion  loops.  Their  finite  pieces  are  then  fixed  from  some  experimental  input.  In 
the  case  of  flavor  SU(2),  one  has  n  =  7.  Two  of  these  constants  are  related  to 
interactions  between  the  pseudoscalars,  three  to  quark  mass  insertions  and  the 
remaining  two  have  to  be  determined  from  current  matrix  elements.  The  inclusion 
of  gauge  boson  couplings  to  the  Goldstone  bosons  is  most  simply  and  economically 
done  in  the  framework  of  external  background  sources.  Notice  also  that  at  order  E4 
the  chiral  anomaly  can  be  unambigously  included  in  the  EFT.  At  order  E6 ,  one  has 
to  consider  loop  diagrams  with  insertions  from  £^  and  £^t  **&  we^  ^  contact  terms 
from  Qff  which  introduces  new  couplings.  Once  the  low-energy  constants  are  fixed, 
the  aspects  of  the  dynamics  of  the  standard  model  related  to  the  chiral  symmetry 
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more  cumbersome  (but  can't  always  be  avoided  as  will  be  discussed  below).  This 
is  the  basic  framework  of  CHPT  in  a  nutshell.  For  more  details,  I  refer  to  refs.[2,3], 
my  review  [4]  and  the  extensive  list  of  references  given  therein.  It  is  worth  pointing 
out  that  Leutwyler  has  recently  given  a  more  sound  foundation  of  the  effective 
Lagrangian  approach  by  relating  it  directly  to  the  pertinent  Ward-Identities  [5]. 

2.  Meson-meson  scattering  and  the  mode  of  quark  condensation 

Pion-pion  and  pion-kaon  scattering  are  the  purest  reactions  between  the  pseu- 
doscalar  Goldstone  bosons.  The  Goldstone  theorem  mandates  that  as  the  energy 
goes  to  zero,  the  interaction  between  the  pseudoscalars  vanishes.  Consequently,  TTTT 
and  ftK  scattering  are  the  optimal  testing  grounds  for  CHPT. 

Let  me  first  consider  the  chiral  expansion  of  the  isospin  zero  S-wave  in  TTTT 
scattering.  In  the  standard  formulation  of  CHPT,  Gasser  and  Leutwyler  have 
derived  a  low-energy  theorem  generalizing  Weinberg's  result  (6)  [6], 

aj  =  0.20  ±0.01  (7) 

which  is  compatible  with  the  data,  ag  =  0.23  ±  0.08  [7].  The  theoretical  value  (7) 
rests  on  the  assumption  that  B  is  large,  i.e.  of  the  order  of  1  GeV  (from  current 
values  of  the  scalar  quark  condensates).  However,  if  B  happens  to  be  small,  say  of 
the  order  of  FT,  one  has  to  generalize  the  CHPT  framework  as  proposed  by  Stern 
et  al.[8].  In  that  case,  the  quark  mass  expansion  of  the  Goldstone  bosons  takes  the 
form 

M*=mqB  +  rntA  +  0(m3q)  (8) 

with  the  second  term  of  comparable  size  to  the  first  one.  In  ref.[9],  this  framework 
is  discussed  in  more  detail  and  a  novel  representation  of  the  TTTT  amplitude  which  is 
exact  including  order  E6  and  allows  to  represent  the  whole  TT.TT  scattering  amplitude 
in  terms  of  the  S-  and  P-waves  and  six  subtraction  constants  is  given.  The  presently 
available  data  are  not  sufficiently  accurate  to  disentangle  these  two  possibilities. 
More  light  might  be  shed  on  this  when  the  ^-factory  DA$NE  will  be  in  operation 
(via  precise  measurements  of  jFC^-decays)  or  if  the  proposed  experiment  to  measure 
the  lifetime  of  pionic  molecules  [10]  will  be  done.  It  should  also  be  pointed  out 
that  recent  lattice  QCD  results  seem  to  be  at  variance  with  the  expansion  (8), 
but  this  can  only  be  considered  as  indicative  [11].  Also,  the  experimentally  well- 
fulfilled  GMO  relation  for  the  pseudoscalar  meson  masses  arises  naturally  in  the 
conventional  CHPT  framework  but  requires  parameter  fine-tuning  in  case  of  a  small 
value  of  B  ~  100  MeV.  Novel  high  precision  experiments  at  low  energies  are  called 
for.  This  is  an  important  question  concerning  our  understanding  of  the  standard 
model  and  it  definitively  should  deserve  more  attention.  For  more  details,.  I  refer 
to  sections  4.1  and  4.2  of  ref.[4]  as  well  as  ref.[9]. 

In  fig.l,  I  show  the  phase-shift  6§  from  threshold  (280  MeV)  to  approximately 
600  MeV  [12].  One  notices  the  rapid  rise  of  the  phase  shift,  and  at  600  MeV  it  is 
already  as  large  as  55  degrees  and  passes  through  90  degrees  at  about  850  MeV. 
At  energies  below  600  MeV,  the  other  partial  waves  do  not  exceed  15  degrees  (in 
magnitude).  This  behaviour  of  8$  is  attributed  to  the  so-called  strong  pionic  final 
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Figure  1.  TT  scattering  phase  shift  6%(s).  The  dashed  line  gives  the  tree  result  and 
the  dashed-dotted  the  one-loop  prediction.  Also  shown  is  the  Roy  equation  band.  The 
data  can  be  traced  back  from  ref.[l2].  The  double-dashed  line  corresponds  to  the  one-loop 
result  based  on  another  definition  of  the  phase-shift  which  differs  at  order  E6  from  the  one 
leading  to  the  dashed-dotted  line  (and  thus  gives  a  measure  of  higher  order  corrections). 
On  the  right  side  of  the  hatched  area,  the  one-loop  corrections  exceed  50  per  cent  of  the 
tree  result. 
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phase  of  the  CP-violation  parameter  e'[12], 

=  (45  ±  6)°  (9) 

This  is  due  to  the  fact  that  the  corrections  to  6$  are  of  the  same  sign  as  the  ones 
to  <5§  and  thus  cancel.  At  tree  level,  <3>(e')  =  37°.  The  accuracy  of  the  theoretical 
prediction  is  as  good  as  the  resent  empirical  one,  ^(e^exp  =  (43  ±  8)°  [7].  Notice 
that  it  is  much  more  difficult  to  get  a  precise  number  on  $(f')  from  K  — *  2?r  decays 
because  of  the  variety  of  isospin  breaking  effects  one  has  to  account  for  (this  theme 
is  touched  upon  in  ref.[12]). 

I  briefly  turn  to  the  case  of  nK  scattering.  Here,  the  empirical  situation  is  even 
worse,  which  is  very  unfortunate.  In  the  framework  of  conventional  CHPT,  the 
threshold  behaviour  of  the  low  partial  waves  can  be  unambigously  predicted  [13] 
since  all  low-energy  constants  in  SU(3)  are  fixed.  Furthermore,  since  the  mass  of 
the  strange  quark  is  of  the  order  of  the  QCD  scale-parameter,  it  is  less  obvious  that 
the  chiral  expansion  at  next-to-leading  order  will  be  sufficiently  accurate.  Much 
improved  empirical  information  of  these  threshold  parameters  might  therefore  lead 
to  a  better  understanding  of  the  three  flavor  CHPT.  Another  possibility  is  that  the 
threshold  of  xK  scattering  at  635  MeV  is  alreday  so  high  that  one  has  to  connect 
CHPT  constraints  with  dispersion  theory.  This  concept  has  investigated  in  detail 
by  Dobado  and  Pelaez  [14]  and  certainly  improves  the  prediction  in  the  P-wave 
drastically.  Another  way  of  extending  the  EFT  through  the  implicit  inclusion  of 
resonance  degrees  of  freedom  is  discussed  in  ref.[62].  On  the  experimental  side,  a 
measurement  of  rrK  molecule  decays  would  certainly  help  to  clarify  the  situation 
[11]. 

3.  Two  loops  and  beyond 

The  simplest  object  to  study  in  detail  the  strong  pionic  final  state  interactions  in 
the  isospin  zero  S-wave  is  a  three-point  function,  namely  the  so-called  scalar  form 
factor  (ff)  of  the  pion, 

<  ira(pV(p)|m(«tz  +  <W)|0  >=  Sab  T,(s)M^  (10) 

with  s  =  (p'  +p)2.  To  one  loop  order,  the  scalar  ff  F^i2(s)  has  been  given  in  ref.[3]. 
As  shown  in  fig. 2,  closely  about  the  two-pion  cut,  the  real  as  well  as  the  imaginary 
part  of  the  one  loop  representation  are  at  variance  with  the  empirical  information 
obtained  from  a  dispersion-theoretical  analysis  [15].  However,  unitarity  allows  one 
to  write  down  a  two-loop  representation  [16], 


S- 

*   ./4 


(11) 

4M  •»  s  ~        . 

where  TQ  2  and  TQ  4  are  the  tree  and  one  loop  representations  of  the  TTT  S-wave, 
isospin  zero  scattering  matrix.    Notice  that  the  imaginary  part  of  rT(s)  to  two 
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loops  is  entirely  given  in  terms  ot  known  one  loop  amplitudes.  The  three  subtrac- 
tion constants  appearing  in  (11)  can  be  fixed  from  the  empirical  knowledge  of  the 
normalization,  the  slope  and  the  curvature  of  the  scalar  ff  at  the  origin.  In  the  chi- 
ral  expansion,  these  numbers  are  combinations  of  two  low-energy  constants  from 
£  and  two  from  £ . 
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Figure  2.  Scalar  form  factor  of  the  pion.  The  curves  labelled  T,  '2',  'O'  and  'B' 
correspond  to  the  chiral  prediction  to  one-loop,  to  two-loops,  the  modified  Omnes  repre- 
sentation and  the  result  of  the  dispersive  analysis,  respectively.  The  real  part  is  shown  in 
(a)  and  the  imaginary  part  in  (b). 

The  turnover  of  the  scalar  ff  at  around  550  MeV  can  be  understood  if  one 
rewrites  (11)  in  an  exponential  form, 


Rerr(s)  =  P(s)exp[ReA0(s)]  cos  6°  +  O(E6) 


(12) 


with  Im  A0(s)  =  6J  +  O(E6)  fulfilling  the  final-state  theorem  at  next-to-leading 
order.  Although  this  representation  is  not  unique,  it  allows  to  understand  the 
vanishing  of  ReFT(s)  at  680  MeV  since  the  phase  (in  the  loop  approximation) 
passes  through  90°  at  this  energy  thus  forcing  the  turnover.  Expanding  cos  6g  = 
l-(^o)  +•  •  •  =  l  +  O(s2/F2}  it  becomes  clear  why  this  behaviour  can  only  show  up 
at  two  loop  order  (and  higher).  One  can  do  even  better  and  sum  up  all  leading  and 
next-to-leading  logarithms  by  means  of  an  Omnes  representation  [16].  This  leads  to 


IQfi 
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a  further  improvement  in  RerV(s)  and  allows  to  understand  that  the  very  accurate 
two  loop  result  for  ImF^s)  is  not  spoiled  by  higher  orders,  these  can  be  estimated 
from  the  improved  chiral  expansion  of  the  scalar  ff  and  are  found  to  be  small  below 
550  MeV.  The  physics  behind  all  this  is  that  the  two-loop  corrections  lead  to  the 
two-pion  cut  with  proper  strength  which  dominates  the  scalar  ff  below  600  MeV. 
To  go  further  one  would  have  to  include  inelasticities  (which  start  at  order  E8),  in 
particular  the  strong  coupling  to  the  KK  channel.  It  is  also  worth  pointing  out 
that  the  scalar  ff  can  only  be  represented  by  a  polynomial  below  s  =  4M^.  Notice 
that  in  this  energy  range  the  normalized  scalar  ff  varies  from  1  to  1.4,  signaling 
a  large  scalar  radius  of  the  pion.  For  comparison,  the  vector  ff  changes  from  1  to 
1.15  for  0  <  s  <  4M%.  In  this  way,  unitarity  allows  to  extend  the  range  of  CHPT, 
however,  one  has  to  be  able  to  fix  the  pertinent  subtraction  constants  (which  is  the 
equivalent  to  determining  the  corresponding  low-energy  constants). 

Another  reaction  which  has  attracted  much  attention  recently  is  77  —  *•  7r°7r°  in 
the  threshold  region.  It  belongs  to  the  rare  class  of  processes  which  are  vanishing 
at  tree  level  (since  the  photon  can  only  couple  to  charged  pions,  one  needs  at  least 
one  loop)  and  do  not  involve  any  of  the  low-energy  couplings  from  £^  at  one  loop 
order.  Some  years  ago,  Bijnens  and  Cornet  [17]  and  Donoghue,  Holstein  and  Lin 
[18]  calculated  the  one-loop  cross  section  and  found  that  it  is  at  variance  with  the 
Crystal  ball  data  [19]  even  close  to  threshold  (see  fig.3), 

This  is  another  case  where  one  has  to  account  for  the  strong  pionic  final  state 
interactions.  At  400  MeV,  one  has 


=1-3  (13) 


which  is  a  typical  correction  in  this  channel  (see  discussion  above  on  a°  and  the 
scalar  ff).  In  fact,  dispersion  theoretical  calculations  supplemented  with  current 
algebra  constraints  by  Pennington  [20]  tend  to  give  the  trend  of  the  data  (see  the 
shaded  area  in  fig.3).  An  improved  combination  of  chiral  machinery  and  dispersion 
theory  has  been  given  by  Donoghue  and  Holstein  [21].  Even  better,  Bellucci,  Gasser 
and  Sainio  [22]  have  performed  a  full  two  loop  calculation.  It  involves  some  massive 
algebra  and  three  new  low-energy  constants  have  been  estimated  from  resonance 
exchange  (the  main  contribution  comes  form  the  w).  These  couplings  play,  however, 
no  role  below  400  MeV.  The  solid  line  in  fig.3  shows  the  two-loop  result  for  the 
central  values  of  the  coupling  constants.  One  finds  a  good  agreement  with  the 
data  up  to  Err  =  700  MeV.  This  resolves  the  long-standing  discrepancy  between 
the  chiral  prediction  and  the  data  in  the  threshold  region.  For  a  more  detailed 
discussion  of  these  topics  and  the  related  neutral  pion  polarizabilities,  I  refer  to 
ref.[22]. 

The  last  topic  I  briefly  want  to  mention  is  the  radiative  kaon  decay  KL  —+  7r°77 
which  has  no  tree-level  contribution  and  is  given  by  a  finite  one-loop  calculation  at 
order  EA  [23].  The  predicted  two-photon  invariant  mass  spectrum  turned  out  to  be 
in  amazing  agreement  with  the  later  measurements  [24].  However,  the  branching 
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Figure  3.  Cross  section  for  77  — •»•  T°T°.  The  chiral  one  and  two  loop  predictions  are  given 
by  the  dotted  and  the  solid  line,  in  order.  The  hatched  area  is  a  dispersion-theoretical  fit. 
The  Crystal  ball  data  are  also  shown.  From  [22]. 


4.  Inclusion  of  baryons 

While  the  chiral  Lagrangian  is  particularly  suited  to  investigate  the  properties  of 
the  pseudoscalar  mesons,  it  can  also  be  used  to  gain  insight  into  the  structure  of 
the  low-lying  baryons.  I  will  be  brief  on  the  formal  aspects  but  rather  refer  to  the 
reviews  [4,27]  and  the  extensive  papers  by  Gasser,  Sainio  and  Svarc  [28]  and  Krause 
[29]. 

Let  me  first  restrict  myself  to  the  two  flavor  sector,  the  pion-nucleon  system. 
To  lowest  order  O(E},  the  effective  Lagrangian  takes  the  form 

£*N  =  VfaD1*  -  m  +       ATjiTsM")*  (14) 


with  Dp  the  covariant  derivative,  m  the  nucleon  mass  (in  the  chiral  limit),  g& 
the  axial-vector  coupling  constant  (in  the  chiral  limit)  and  u^  —  iu^V^Uu^  ,  u  = 
\/t7  and  #  embodies  the  proton  and  neutron  fields.  The  physics  becomes  most 
transparent  if  one  expands  the  various  terms  in  powers  of  the  pion  and  external 
fields  (like  e.g.  the  photon).  The  vectorial  coupling  includes  e.g.  the  photon- 
nucleon  vertex  and  the  two-pion  seagull  ("Weinberg  term")  whereas  the  axial- 
vector  term  in  (14)  leads  to  the  pseudovector  irN  coupling  and  the  famous  Kroll- 
Rudermann  vertex  (among  others).  The  presence  of  the  nucleon  mass  term,  which  is 
of  comparable  size  to  the  chiral  symmetry  breaking  scale,  does  not  allow  the  nucleon 
four-momentum  to  be  treated  as  small.  This  spoils  the  one-to-one  correspondence 
between  the  loop  and  the  energy  expansion  (for  more  details  see  refs.[27,28]).  As 
pointed  out  in  particular  by  Jenkins  and  Manohar  [30],  heavy  quark  EFT  methods 
help  to  overcome  this  problem.  Consider  the  nucleon  as  a  very  heavy,  static  source, 
i.e.  non-relativistically.  In  that  case,  one  can  write  its  four-momentum  as 

Pn  =  mvp  -f  l^  (15) 

with  v^  the  four-velocity  (v2  =  1)  and  l^  a  small  off-shell  momemtum,  v  •  /  <C  m. 
One  can  therefore  write  the  nucleon  wave  function  in  terms  of  velocity  eigenstates, 

V  =  exp{-imv>x}(H  +  h)  (16) 

with  i/H  =  H  and  i/h  —  —h  (notice  that  I  have  interchanged  the  H  and  the  h  in 
comparison  to  the  standard  (funny)  notation).  If  one  now  eliminates  h  by  use  of 
its  equation  of  motion  (or  by  a  Foldy-Wouthuysen  transformation),  one  finds  (for 
details,  see  e.g.  ref.[31]) 

CvN  =  H(iv>D  +  gAu-S)H  +  O(l/m)  (17) 

with  Sft  the  covariant  spin-operator,  5P  =  i^crlivvv  /2,  known  to  our  anchestors 
also  as  the  Pauli-Lubanski  vector.  The  cumbersome  baryon  mass  term  has  disap- 
peared and  thus  a  consistent  power  counting  emerges  (this  is  discussed  very  nicely 
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terms  of  t;^  and  S^  thus  faciliating  the  algebra  enormeously.  The  one  loop  graphs 
contribute  at  order  E3.  As  will  be  discussed  below,  it  is  however  mandatory  to 
include  the  terms  of  order  .  E4  in  the  effective  Lagrangian  for  accurate  one-loop 
calculations,  i.e. 


(18) 


where  I  have  not  exhibited  the  meson  Lagrangian  discussed  before.  The  coefficients 
accompanying  Qft  are  all  finite  since  the  loops  start  to  contribute  at  order  E3.  A 
complete  analysis  of  the  divergence  structure  at  order  E3  will  soon  be  available  [33]. 
A  systematic  analysis  of  nucleon  properties  to  order  E3  can  be  found  in  [31]  and 
some  E4  calculations  have  recently  been  performed.  I  will  discuss  one  particular 
case  in  some  more  detail  below.  I  will  also  elaborate  on  two  yet  unsolved  problems 
in  the  heavy  mass  approach,  one  is  related  to  the  analytical  structure  of  S-matrix 
elements  (which  does  not  appear  in  the  relativistic  formulation,  see  section  8)  and 
the  other  is  the  extension  to  flavor  SU(3)  and  the  inclusion  of  decuplet  fields  (see 
sections  6  and  7). 

5.  Baryon  compton  scattering 

Compton  scattering  off  the  nucleon  at  low  energies  offers  important  information 
about  the  structure  of  these  particles  in  the  non-perturbative  regime  of  QCD.  The 
spin-averaged  forward  scattering  amplitude  for  real  photons  in  the  nucleon  rest 
frame  can  be  expanded  as  a  power  series  in  the  photon  energy  u, 

T(u)  =  fl(uz)c}.ci,     /1(u,2)  =  a0-fa1u;2-fa2u,4  +  ...  (19) 

where  QJ  are  the  polarization  vectors  of  the  initial  and  final  photon,  respectively, 
and  due  to  crossing  symmetry  only  even  powers  of  cj  occur.  The  Taylor  coefficients 
a,-  encode  the  information  about  the  nucleon  structure.  The  first  term  in  eq.(19), 
flo  =  —  e2Z2/47rm,  dominates  as  the  photon  energy  approaches  zero,  it  is  only 
sensitive  to  the  charge  Z  and  the  mass  m  of  the  particle  the  photon  scatters  off 
(the  Thomson  limit).  The  term  quadratic  in  the  energy  is  equal  to  the  sum  of 
the  so-called  electric  (a)  and  magnetic  (/?)  Compton  polarizabilities,  ai  =  a  +  J3. 
Corrections  of  higher  order  in  u>  start  out  with  the  term  proportional  to  02-  Over  the 
last  years,  high  precision  measurements  at  Mainz,  Illinois,  Oak  Ridge  and  Saskatoon 
[34]  have  lead  to  the  following  empirical  values:  Oj,  =  (10.4  ±  0.6)  •  10~4fm3  ,  0P  = 
(S.STO^-lO-^m3,^  =  (12.3±1.3)-K)-4fm3,/?n  =  (3.5  T  1-3)  -lO^fm3  making 
use  of  the  dispersion  sum  rules  [35]  (a+/?)p  =  (14.  2  ±0.3)  -10~4  fm3  and  («  +  /?)„  = 
(15.8±0.5)j  10~4  fm3.  The  two  outstanding  features  of  these  numbers  are  the  fact 
that  (a  4-  0}P  ~  (a  +  /3)n  and  that  the  proton  as  well  as  the  neutron  behave 
essentially  as  (induced)  electric  dipoles. 

In  CHPT,  it  was  found  that  to  leading  order  in  the  chiral  expansion  the  nucleon 
em  polarizabilities  are  given  by  a  few  one  loop  diagrams  (whose  sum  is  finite) 
[36]  with  no  counter  term  contributions  (much  like  the  reactions  KL  —  »•  7r°77  and 
77  —  »•  T°ir0  discussed  before).  This  calculation  was  later  redone  in  the  heavy  mass 
approach  [31].  The  pertinent  diagrams  are  shown  in  fig.4. 
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Figure  4.  One-loop  diagrams  which  lead  to  the  nucleon  em  polarizabilities  (20). 

By  isospin  arguments,  one  finds  that  they  will  lead  to  the  same  polarizabilities 
for  the  proton  and  the  neutron.  The  resulting  expressions  for  apitl  and  J3p>n  contain 

therefore  only  parameters  from  the  lowest  order  effective  Lagrangian 
[31,36], 
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In  the  chiral  limit,  the  em  polarizabilities  diverge.  This  is  expected  since  the  Yukawa 
suppression  for  massive  pions  turns  into  a  long-range  power-law  fall-off  as  M*  —  *• 
0.  Clearly,  the  leading  order  CHPT  results  (20)  explain  the  trends  of  the  data. 
However,  one  might  argue  that  the  result  for  the  magnetic  polarizabilities  is  not 
very  meaningful  since  one  has  not  accounted  for  the  strong  NA  Ml  transition. 
In  fact,  this  starts  to  contribute  at  order  E4  (and  higher)  in  agreement  with  the 
decoupling  theorem  [37].  In  ref.[38],  a  complete  calculation  of  the  em  polarizabilities 
to  O(E4)  was  given.  At  this  order,  contact  terms  from  £;.^r  en*er-  Some  of  them 
can  be  directly  related  to  empirical  information  (viz  TrJV  scattering  [39]),  others 
are  estimated  from  resonance  exchange,  and  this  is  where  the  A(1232)  comes  in. 
It  should  be  stressed  that  the  resonance  saturation  hypothesis  to  understand  the 
values  of  the  low-energy  constants  has  only  strictly  been  tested  in  the  meson  sector 
[40].  In  the  absence  of  sufficiently  many  accurate  low-energy  data,  it  serves  as  a 
working  hypothesis  in  the  baryon  sector  (this  situation  will  be  improved  when  more 
data  will  become  available).  The  em  polarizabilities  to  order  E4  take  the  form 


(a, 


+  C2  In  MT  +  C3 


(21) 


with  C\  =  5e2<724/3847T2JF12.  The  coefficient  Cz  contains  some  low-energy  constants 
from  C^  and  63  four  novel  ones  from  £^.  The  second  and  the  third  term  in  (21) 
are  the  new  E4  contributions.  The  A(1232)  strongly  dominates  the  constant  Cs. 
The  non-analytic  loop  contribution  ~  In  MT  is  potentially  large.  Indeed,  in  the 
case  of  J3P  the  In  contribution  is  negative  with  a  large  coefficient  and  cancels  most 
of  the  large  positive  one  related  to  the  A(1232)  exchange.  In  ref.[41],  a  thorough 
study  of  the  theoretical  uncertainties  entering  the  E4  calculation  was  performed 
and  the  following  results  emerged 


ap  =  10.5  ±  2.0  ,  A,  =  3.5  ±  3.6  ,  an  =  13.4  ±  1.5  ,  0n  =  7.8  ±  3.6 , 


(22) 


IYI  o_    T 


1Q3 


In  fig.5,  1  show  the  real  part  of  the  proton  forward  Compton  scattering  ampli- 
tude Ap(u>)  =  —  4T/i(u;2)  in  comparison  to  the  data  which  follow  from  the  total 
photonucleon  absorption  cross  section  via 


(23) 

uo         u-u> 

The  amplitude  has  a  branch  point  related  to  the  threshold  energy  for  single  pion 
photoproduction  at 

wo  =  AMl  +  ^)  (24) 

Notice  that  to  order  Ez  the  chiral  representation  [31]  has  a  cut  starting  at  u>o  =  Mr 
and  only  at  O(E*)  one  recovers  the  recoil  correction  ~  1/m  in  (24).  This  problem 
is  inherent  to  the  heavy  mass  formulation  of  baryon  CHPT.  In  the  relativistic 
formulation,  the  corresponding  analytic  structures  (location  of  cut  singularities) 
are  always  given  correctly  and  do  not  depend  on  the  order  of  the  chiral  expansion. 
I  will  pick  up  this  theme  in  section  8.  Fig.5  also  shows  that  the  E4  result  for  Ap(u] 
reproduces  the  cusp  at  w0.  However,  above  that  energy,  the  chiral  prediction  is  at 
variance  with  the  data.  This  can  be  traced  back  to  the  fact  that  the  imaginary 
part  changes  sign  at  u  ~  180  MeV.  One  would  have  to  go  to  two  loops  to  get  an 
improved  prediction  for  the  imaginary  part  as  alreday  stressed  in  section  3. 

It  is  fairly  straightforward  to  extend  the  lowest  order  E3  calculation  to  the  three 
flavor  sector.  This  allows  to  predict  the  hyperon  polarizabilities  which  eventually 
will  be  measured  at  Fermilab  and  CERN  via  the  Primakoff  effect.  In  SU(3),  one  has 
two  axial  couplings  and  thus  the  lowest  order  effective  meson-baryon  Lagrangian 
reads 

F  Tr  (££>>,,,£])  (25) 


where  U(x]  now  contains  the  eight  pseudoscalar  fields  (TT,  K,  rj)  and  B  is  a  3  x  3 
matrix  containing  the  low-lying  baryon  octet, 


B= 


(26) 


The  numerical  values  for  the  two  axial  couplings  D  and  F  are  D  ~  3/4  and  F  ~  1/2. 
subject  to  the  constraint  D  +  F  =  gA  =  1.26.  The  hyperon  polarizabilities  can  be 
calculated  from  the  diagrams  in  fig.4  with  ir  and  K  loops  and  one  finds  e.g.  [42] 

a£+  =  9,  (27) 


(in  canonical  units),  i.e.  the  £+  is  expected  to  have  a  larger  electric  polarizability 
then  the  E~  due  to  the  kaon  loop  contributions.   Quark  model  estimates  give  a 
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igure  5.  Forward  spin-averaged  Compton  amplitude  for  the  proton  in  comparison  to 
ie  data,  (a)  0  <  w  <  140  MeV  and  (b)  140  <  w  <  210  MeV. 
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similar  pattern  [43].  The  S+  is  made  of  w  and  s  quarks  (which  have  opposite 
charges)  and  this  allows  for  internal  electric  dipole  excitations.  In  contrast,  the 
charge-like  d  and  »  quarks  in  the  E~  tend  to  hinder  such  excitations  leading  to 
a  small  electric  polarizability.  The  numbers  given  in  (27)  should  be  considered  as 
first  estimates  since  a  complete  E4  calculation  in  SU(3)  has  yet  to  be  performed.  I 
will  now  take  a  critical  look  at  the  status  of  three  flavor  baryon  CHPT. 

6.  Bat  yon  masses  and  cr-terms 

The  simplest  observbles  to  investigate  in  flavor  SU(3)  are  the  baryon  masses 
mA>  r^io  ™2  and  the  three  proton  er-terms  defined  via 


=    m  <  p'|««  +  dd\p  > 

|p>  (28) 


f 

VKifW    =     2^  +  m*)  <  P'l  -  ««  +  2<W  +  ss\p  > 

with  t  =  (p1  —  p)2  the  invariant  momentum  transfer  squared  and  m  =  (mu  +  m<f  )/2 
the  average  light  quark  mass.  At  zero  momentum  transfer,  the  strange  quark 
contribution  to  the  proton  mass  is  given  by  [44] 


m,  <  p\ss\p  >=  (1  -  jKjf.)  [3<r«,(0)  +  •&(<>)]  +  (| 


making  use  of  the  leading  order  meson  mass  formulae  M%  =  2mB  and 

(rn-f  ms)B.  This  defines  the  scalar  sector  of  baryon  CHPT.  To  calculate  the  mass 

spectrum  to  order  J53,  we  need  the  symmetry  breaking  terms  from 

4$B  =  ^  ?i(B{x+,B})  +  bF  Tr(%+,B])  +  &0  Tr(5B)Tr(x+)  (30) 

with  x+  =  u^xu^  +  wx^w  and  x  —  2B(M+S)  where  S  denotes  the  nonet  of  external 
scalar  sources.  The  constants  6jo,  bp  and  60  can  be  fixed  from  the  knowledge  of  the 
baryon  masses  and  the  wN  <r-term  (or  one  of  the  KN  cr-terms).  The  constant  bo 
can  not  be  determined  from  the  baryon  mass  spectrum  alone  since  it  contributes 
to  all  octet  members  in  the  same  way.  To  this  order  in  the  chiral  expansion,  any 
baryon  mass  takes  the  form  [44,45] 


The  first  term  on  the  right  hand  side  of  (31)  is  the  average  octet  mass  in  the  chiral 
limit,  the  second  one  comprises  the  Goldstone  boson  loop  contributions  and  the 
third  term  stems  from  the  counter  terms  in  (30)  ("resonance  physics").  Notice  that 
the  loop  contribution  is  ultraviolet  finite  and  non-analytic  in  the  quark  masses  since 

0/2 

Mj  ~  m/  .  The  constants  ip,  6p  and  feo  are  therefore  finite.  A  typical  result  at 
O(E3}  from  a  least-square  fit  to  mjv,  TOA,  ms,  m=  and  <rTjv(0)  =  45  Mev  [46]  is 
[44] 

mN    =    (0.965  -0.018  -0.264  +  0.248)  GeV  =  0.936  GeV 
-  n.RRft  4.  n  ?4.r>  rw  -^  ^ 


terms.  j.u  imvc  a,  mure  weu—  uenaveu  cmrai  expansion,  one  migni  warn  to  include 
the  low-lying  decuplet  baryons  as  will  be  discussed  below.  At  this  order  and  within 
the  accuracy  of  the  E3  calculation,  the  KN  <r-terms  turn  out  to  be 

<r£k(0)  ~  200  ±  50  MeV,     <r^(0)  ~  140  ±  40  MeV  (33) 

which  is  comparable  to  the  first  order  perturbation  theory  analysis  having  no 
strange  quarks,  <r£J,(0)  =  205  MeV  and  <r^(0)  =  63  MeV  [47].  At  present, 
the  KN  <r-terms  are  not  well  determined.  Since  most  of  the  phase  shift  data  stem 
from  kaon-nucleus  scattering,  it  is  of  advantage  to  define  them  in  terms  of  nuclear 
isospin,  ff'KN  =  (3<r^  -f  0tf  ]^)/4  and  <r'^N  =  (a^N  -  cr^N)/2.  The  best  determi- 
nations available  gives  ff'KN(0)  =  599  ±  377  MeV  and  ^^(0)  =  87  ±  66  MeV  [48]. 
This  translates  into  <r^(0)  =  469  ±  390  MeV  and  <r$N(Q)  =  643  ±  378  MeV.  Let 
me  finish  this  section  with  a  remark  on  the  calculation  of  mjv  in  a  recent  lattice 
simulation  in  quenched  baryon  CHPT  [49]  (which  is  discussed  in  some  detail  by 
Golterman  in  these  proceedings), 
mCHPT  _  (o.97- 


(34) 

where  LFIT  denotes  the  fit  to  the  lattice  data.  It  is  most  significant  that  the 
negative  curvature  due  to  the  M%  term  from  quenched  CHPT  and  from  the  lattice 
fit  are  of  the  same  magnitude.  There  is,  however,  a  problem  at  small  M*  .  The 
lattice  data  give  a  too  large  pion  mass  so  that  one  has  to  plot  m^-  versus  M*  and 
interpolate  to  the  physical  pion  mass.  On  finds  a  hook  at  the  lower  end  of  this 
curve  which  sheds  some  doubts  on  the  accuracy  of  the  recently  reported  results  by 
Butler  et  al.  [63].  For  more  details  on  this,  see  Golterman  [64]. 

7.  Inclusion  of  the  decouplet  in  the  EFT 

The  low-lying  decuplet  is  only  separated  by  A  =  231  MeV  from  the  octet  baryons, 
which  is  just  (5/2)  F*  (notice  that  I  have  not  used  the  conventional  argument 
A  =  (5/3)MT  since  the  splitting  stays  finite  in  the  chiral  limit  much  like  F*  and  not 
at  all  like  M*)  and  considerably  smaller  than  the  kaon  or  eta  mass.  One  therefore 
expects  the  excitations  of  these  resonances  to  play  an  important  role  even  at  low 
energies.  This  is  also  backed  by  phenomenological  models  of  the  nucleon  in  which 
the  A(1232)  excitations  play  an  important  role.  In  addition,  there  are  large  NC 
arguments  [50]  which,  however,  have  to  be  taken  cum  grano  salts  since  the  chiral 
and  infinite  number  of  colors  limites  do  not  commute.  In  the  meson  sector,  the 
first  resonances  are  the  vector  mesons  p  and  u>  at  about  800  MeV,  i.e.  they  are 
considerably  heavier  than  the  Goldstone  bosons.  However,  it  is  not  only  the  small 
octet-decuplet  splitting  which  plays  a  role.  One  should  also  notice  that  the  A(1232) 
coupling  to  the  wN  system  is  very  large,  g*&N  ~  %9irNN  with  g*NN  —  13.5.  Sim- 
ilarly, the  7AAT  coupling  is  very  strong.  Would  the  A(1232)  (or  the  decuplet)  be 
weakly  coupled  to  the  nucleon  (octet),  its  role  would  be  very  different.  It  was  there- 
fore argued  by  Jenkins  and  Manohar  [51]  to  include  the  spin-3/2  decuplet  in  the 
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at  lowest  order  reads 

CMBT  =  -iT*  v  -  V  T^.+  A  7*%  +"£(f  %fl  -I-  Bt 

A 

where  we  have  suppressed  the  flavor  SU(3)  indices.  For  an  explicit  expressi 
ref.[65].  Notic  that  there  is  a  remaining  mass  dependence  which  comes  frc 
average  decuplet-octet  splitting  A  which  does  not  vanish  in  the  chiral  limil 
constant  C  is  fixed  from  the  decay  A  — »•  Nv  or  the  average  of  some  strong  de 
decays,  \C\  —  1.5...  1.9.  The  decuplet  propagator  carries  the  information 
the  mass  splitting  A  and  reads 


t;  •  /  -  A  +  ie 

with  P^  a  projector  (for  a  review,  see  ref.[52]).  The  appearance  of  the  maa 
ting  A  spoils  the  exact  one-to-one  correspondence  between  the  loop  and  low- 
expansion.  The  two  scales  F*  and  A  which  are  both  non- vanishing  in  the 
limit  enter  the  loop  calculations  and  they  can  combine  in  the  form  (A/jFV)' 
breakdown  of  the  consistent  chiral  counting  in  the  presence  of  the  decuplet 
in  the  loop  contribution  to  the  baryon  mass.  The  loop  diagrams  with  inte 
ate  decuplets  states  which  naively  count  as  order  E4  renormalize  the  averag< 
baryon  mass  even  in  the  chiral  limit  by  an  infinite  amount.  Therefore  one 
add  a  counter  term  of  chiral  power  E°  to  keep  the  value  mo  fixed 

S£$B     =     -<^"»oTr(55) 


,  10C2A3r  1    ,     2A      5, 

6mo  =   1"*T  L  +  T6^(InX-6> 


Ad~4  r  i      i 

L    =     —^  - — -  +  _(7E^ln47r- 
IDTT^  [a  —  4       2 

with  A  the  scale  introduced   in  dimensional  regularization  and  JE   the 
Mascheroni  constant.  This  mass  shift  is  similar  to  the  one  in  the  relativistic  ^ 
of  pion-nucleon  CHPT,  where  the  non-vanishing  nucleon  mass  in  the  chira 
leads  to  the  same  kind  of  complications  [28].  The  inclusion  of  the  decuplei 
has  three  effects  on  the  mass  formulae  (31).  First  there  is  an  infinite  loop 
bution  with  decuplet  intermediate  states  and,  second,  an  infinite  renormal 
of  the  order  E2  of  the  low-energy  constants  6p,  6jp  and  bQ  plus  a  finite  contri 
which  starts  out  at  O(E4}.  The  constants  6p,  bp  and  bQ  have  to  be  renorn 
as  follows: 


where  the  finite  pieces  b$  FiQ(\)  are  then  determined  by  the  fitting  procedur 
explicit  form  of  the  decuplet  contributions  to  the  octet  masses  can  be  fo 
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tadpole  diagrams  with  insertions  from  CjB  have  also  been  included  but  that  does 
not  alter  these  conclusions).  As  already  stressed  a  couple  of  times,  a  complete  E4 
calculation  should  be  performed.  For  doing  that,  it  might  be  easier  to  use  the 
decuplet  to  estimate  some  low-energy  constants  rather  than  taking  it  as  dynamical 
dof  's  in  the  EFT.  The  role  of  the  decuplet  contributions  has  also  been  critically 
examined  by  Luty  and  White  [53]. 


Table  1.  Results  of  the  calculation  including  the  full  decuplet  intermediate  states.  The 
values  of  D,  F,  A  and  C  are  input.  All  dimensionful  numbers  are  in  MeV.  Empiri- 
cally, A(TTjv  =  15  MeV  and  S?  =  ma  <  p\ss\p  >=  130  MeV  [46]. 
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8.  Spectral  distribution  of  the  nucleon  form  factors 

When  discussing  the  forward  Compton  amplitude,  I  mentioned  that  the  correspond- 
ing branch  point  related  to  the  one-pion  threshold  has  itself  a  1/m  expansion  in  the 
heavy  mass  formulation  which  disturbs  the  analytical  structure  of  the  amplitude. 
To  take  a  closer  look  at  this  problem,  let  us  consider  the  chiral  expansion  of  the 
so-called  (isovector)  Pauli  form  factor  F^(t).  It  is  defined  by  the  matrix-element 
of  the  isovector- vector  quark  current, 


7, 


2m 


(39) 


with  k  =  p'  —  p  and  t  =  Jk2.  As  first  observed  by  Frazer  and  Fulco  [54]  and  discussed 
in  detail  by  Hohler  and  Pietarinen  [55]  the  imaginary  part  of  F^  (<)  exhibits  a  strong 
enhancement  very  close  to  threshold  (t  =  4M^)  as  shown  in  fig.6a.  The  imaginary 
part  of  the  isovector  nucleon  form  factors  inherit  the  singularity  on  the  second  sheet 
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due  to  the  projection  of  the  Born  term  (at  <0  =  4M2(1  -  M2/4m2)  =  3.98M*)  in 
TrN  scattering  (from  diagrams  of  the  type  7  —  +•  TTT  —  »•  NN). 

Let  us  first  consider  the  chiral  expansion  of  Im  F%(t)  in  the  relativistic  formu- 
lation of  baryon  CHPT.  Following  Gasser  et  al.  [28],  one  has 

ImF?(t)  =  Mm4  4ImT4(<)  +  Imr4(ol  (40) 


where  the  loop  functions  and  their  imaginary  parts  74  and  F4  are  given  in  ref.[28]. 
For  our  purpose,  we  only  need  Im  74  (i)  since  its  threshold  is  the  two-pion  cut 
whereas  Im  T^t)  only  starts  to  contribute  at  t  —  4m2.  The  resulting  imaginary 
part  for  Im  Fi(t)/t2  is  shown  in  fig.6b  (solid  line).  One  sees  that  the  strong  in- 
crease at  threshold  is  reproduced  (see  also  the  remarks  in  ref.[28])  since  the  chiral 
representation  of  Im  74(2)  indeed  has  the  proper  analytical  structure,  i.e.  the  sin- 
gularity on  the  second  sheet  at  <0-  The  chiral  representation  of  Im  F%(t)/t2  does 
not  stay  constant  on  the  left  shoulder  of  the  />-resonance  but  rather  drops.  This 
is  due  to  the  fact  that  in  the  one  loop  approximation,  one  is  only  sensitive  to  the 
first  term  in  the  chiral  expansion  of  the  pion  charge  form  factor  F%(i).  In  fig.Ga  I 
also  show  calculations  with  F?(t)  =  1  (dashed  line)  and  F?(t}  -  1+  <  r\  >  </6 
(dash-dotted  line).  These  curves  resemble  very  much  the  chiral  expansion.  To 
reiterate,  this  particular  example  shows  that  in  the  relativistic  version  of  baryon 
CHPT  the  pertinent  analytical  structures  of  current  and  S-matrix  elements  are 
given  correctly. 

Let  us  now  consider  the  heavy  mass  approach.   The  corresponding  imaginary 
part  follows  from  ref.[31], 

(41) 

Here,  the  imaginary  part  comes  form  a  ln(2Mr  —  %/f)  which  has  a  branch  point  at 
t  =  4M2  (chiefly  because  to  lowest  order  in  the  1/m  expansion  the  threshold  energy 
of  TJV  scattering  is  w0  =  MT  [39]  and  the  corresponding  left-handed  cut  starts 
there).  This  also  leads  to  an  enhancement  of  the  imaginary  part  of  F%(i)  as  shown 
by  the  dashed  line  in  fig.6b.  The  enhancement  is  stronger  than  in  the  relativistic 
case.  Stated  differently,  to  this  order  in  the  chiral  expansion  the  analytic  structure 
is  not  given  correctly  much  like  in  the  case  of  the  forward  Compton  scattering 
amplitude  discussed  before.  One  should  therefore  perform  an  order  E5  calculation 
in  the  heavy  mass  approach  to  have  a  sufficently  accurate  and  correct  representation 
of  the  isovector  nucleon  form  factors.  A  two  loop  calculation  will  also  answer  the  yet 
unresolved  question  whether  or  not  in  the  isoscalar  channel  there  is  an  enhancement 
around  t  =  9M2.  State  of  the  art  dispersion  theoretical  analysis  of  the  nucleon  form 
factors  assume  only  a  set  of  poles  in  the  corresponding  spectral  distributions  [56]. 
Finally,  I  wish  to  stress  that  in  this  context  the  matching  formalism  discussed  in 
tef.[31]  starts  to  play  a  role  (which  allows  to  relate  matrix-elements  in  the  heavy 
mass  and  relativistic  formulation  of  CHPT). 

9.  Aspects  of  electroweak  pion  production 

In  this  section,  I  will  discuss  some  physics  aspects  related  to  threshold  pion  produc- 

9fifl  T>-..wn<.nA     T    rn,.,™      o  ----  1  ____  j.  T  _____     -.^^^ 
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lire  6.  (a)  Dispersion-theoretical  result  for  Im  F¥(t)/t*  (t  in  units  of  the  pion  mass, 
>ted  fi  here).  As  D-function,  the  inverse  pion  form  factor  is  used,  D(<)  =  l/Fn  (<). 
dash-dotted  and  dashed  lines  are  explained  in  the  text,  (b)  Chiral  representation 
lie  relativistic  formulation  of  baryon  CHPT  [28]  (solid  line)  and  in  the  heavy  mass 
roach  (dashed  line)  [31]. 


Let  me  nrst  consider  uie  prouucuoii  01  one 
current.  As  particularly  stressed  by  Adler  [57],  a  unified  treatment  of  em  and  weak 
pion  production  allows  to  relate  information  from  neutrino-nucleon  and  electron- 
nucleon  scattering  experiments.  Obviously,  by  using  PCAC,  the  coupling  of  the 
weak  axial  current  to  the  nucleon  in  the  initial  state  and  a  nucleon  plus  a  pion 
in  the  final  state  is  closely  related  to  vN  scattering.  Consider  now  a  reaction  of 
the  type  i/(*i)  +  #(Pi)  -*  W  +  #M  +  *a(?)  and  define  k  =  *i  -  *2-  The 
threshold  energy  squared  is  s  -  (pt  +  Jb)2  =  (m  +  M*)2.  At  threshold  and  in  the 
irN  cms  frame,  one  can  express  the  pertinent  matrix-element  in  terms  of  six  S-wave 
multipoles, 


•  e  = 


with  xi,2  two-component  Paul!  spinors,  /i  =  MT/m  the  ratio  of  the  pion  and 
nucleon  mass  and  c^  ~  11/757,,  Uj/  the  axial  polarization  vector.  The  superscript  '±' 
refers  to  the  isospin  even  /odd  part  of  the  amplitude.  In  ref.[58],  we  derived  the 
chiral  expansions  of  these  threshold  multipoles  to  order  E3.  Of  particular  interest 
is  the  multipole  L^f  since  it  is  directly  proportional  to  the  so-called  scalar  form 
factor  of  the  nucleon,  0>#(2)  ~  <  p'\m(uu  +  dd)\p  >  [58], 


The  constant  C^  subsumes  numerous  P-independent  kinematical,  loop  and 
counter  term  contributions.  If  one  assumes  C^  to  be  of  the  order  of  1  GeV~3,  the 
term  proportional  to  the  scalar  form  factor  dominates  the  amplitude  (43)  in  the 
threshold  region.  This  might  offer  another  determination  of  this  much  discussed 
quantity.  However,  an  analysis  including  also  higher  order  effects  has  to  be  per- 
formed to  find  out  how  cleanly  this  multipole  can  be  separated  in  neutrino-induced 
pion  production  and  how  large  the  corresponding  cross  section  is.  It  is  furthermore 
interesting  to  note  that  although  L$  vanishes  at  the  photon  point  k2  =  0  in  the 
chiral  limit,  its  slope  nevertheless  stays  finite  -  this  is  a  particular  effect  due  to 
chiral  loops. 

Another  reaction  of  interest  is  the  photoproduction  of  two  pions  in  the  threshold 
region.  It  gives  complimentary  information  to  the  extensive  studies  of  single  pion 
photo-  and  electroproduction  performed  over  the  last  few  years.  Dahm  and  Drech- 
sel  [59]  were  the  first  to  systematically  study  the  process  7^  —  *  TTrAT  in  a  chiral 
field  theory.  To  be  specific,  they  considered  Weinberg's  pion-nucleon  Lagrangian 
[60]  coupling  in  the  photon  via  minimal  substitution.  At  threshold,  the  transition 
current  takes  the  form 


'  €thr=  ™  '  (e  x  i)  [MiSat  +  M28ahr3  +  M3(ra663  +  n6a3)]  (44) 

where  'a.b'  are  the  pion  isospin  indices  and  the  r's  act  on  the  nucleon.  The  explicit 
form  of  the  corresponding  five-fold  differential  cross  section  is  given  in  [59].  Here, 
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their  expansion  in  powers  of  M*.  In  general,  the  reaction  fN  -+  irirN  involves  five 
independent  Mandelstam  variables.  At  threshold,  the  kinematics  is  simplified  since 
the  pion  four-momenta  are  equal,  qi  =  <?2  =  (M*,  0, 0,  0).  In  heavy  fermion  CHPT 
the  calculation  furthermore  simplifies  if  one  works  in  the  Coulomb  gauge  e  •  v  =  0 
and  realizes  that  S  •  q\  =  S  •  q?  =  0  [61].  The  lowest  order  result  stems  from  tree 
diagrams  with  one  insertion  from  £^  as  shown  in  fig. 7. 


Figure  7.  Tree  diagrams  which  lead  to  eq.(45).  The  box  denotes  an  insertion  from 
Solid,  dashed  and  wiggly  lines  denote  nucleons,  pions  and  photons,  respectively. 


One  finds 


=  -^  •  (45) 


These  differ  from  the  results  in  [59]  by  the  terms  proportional  to  KV  and  are  a  factor 
two  smaller  in  magnitude.  The  reason  is  that  gauging  the  Weinberg  Lagrangian  can 
not  generate  the  anomalous  couplings  of  the  photon.  Of  course,  simply  calculating 
these  tree  diagrams  is  not  sufficient,  one  has  to  at  least  work  out  the  O(M-K]  correc- 
tions. These  are  (i)  kinematical  corrections  of  the  type  MT/m,  (n)  contributions 

(*}   *\\ 

from  one-loop  diagrams,  (Hi)  further  insertions  from  £^  and  (iv)  contributions 
from  tree  diagrams  with  intermediate  A(1232)  states.  As  a  preliminary  result,  let 
me  consider  the  corrections  of  the  first  two  types  (a  more  thorough  discussion  can 
be  found  in  ref.[61]).  Although  the  A(1232)  is  very  close  to  the  two  pion  produc- 
tion threshold  (the  energy  difference  being  17  MeV),  the  potentially  large  diagrams 
with  small  energy  denominators  of  the  type  m&  —  (m#  -f  2MT)  are  suppressed  by 
corresponding  numerators.  The  corrections  of  type  (i)  and  («)  lead  to  [61] 


Mi     = 


M2    = 
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Ulf-G.  Meaner 


=    (0.159  +  «  0.082)  fm3 


8m2  F* 

(0.032  -ft  0.020)  fm3 


-^-fs(2v/3-ln(2-f  V3)) 


(46) 


For  MI,  the  loops  do  not  contribute  to  order  M*  where  as  for  M^,3  they  lead  to  a 
complex  correction.  This  due  to  some  loop  diagrams  involving  two  or  three  pion 
propagators  which  acquire  an  imaginary  part  for  u  >  Mv  (here,  u  —  2Afr).  The 
first  term  on  the  r.h.s.  of  (46)  stems  from  the  kinematical  corrections.  Comparing 
the  numbers  in  (46)  to  the  lowest  order  results  M\  —  0,  MI  —  -2Ma  =  0.084  fm3, 
one  sees  that  the  Mr  corrections  are  large.  One  might  therefore  question  the  whole 
approach,  but  it  is  conceivable  that  once  the  type  (u>)  corrections  from  the  A  are 
included,  the  dominant  physics  will  be  under  control  and  subsequent  higher  order 
corrections  play  a  minor  role.  This  topic  is  under  investigation  [61].  To  get  an  idea 
about  the  role  of  the  loop  and  kinematical  corrections,  let  us  consider  the  specific 
final  states  like  in  jp  —*  v°v°p.  In  that  case,  the  cross  section  is  proportional  to 
the  quantity  \M00\  =  [(Mi  -f  M2  +  2M3)(Mi  +  M2  +  2M3)*]1/2  and  similarly  for  the 
other  channels.  In  table  2,  I  show  \Mij\  in  fm3  for  the  lowest  order  (45)  and  with 
the  MT  corrections  (46).  The  most  prominent  result  is  that  the  double  TTO  channels, 
which  are  vanishing  to  lowest  order,  are  in  fact  dominant  after  the  inclusion  of  the 
corrections. 


Table  2.  Contribution  of  the  threshold  multipoles  to  two  pion  production  channels  in 
fm3  (no  phase  space  factors  are  accounted  for). 


process 

C?(l)     O(1)  +  O(MT) 

7P  —  +•  ir+ir~p 

0.084 

0.125 

jp  —  t-  T+7r°n 

0.059 

0.108 

W-^p 

0.000 

0.271 

7n  —  »•  7r"l"7r~n 

0.084 

0.099 

7W  —  »•  7T~  7T  p 

0.059 

0.108 

Tn^°n 

0.000 

0.238 

This  is  completely  different  from  the  single  photoproduction  case  in  which  the  final 
states  including  a  charged  pion  have  the  largest  cross  sections.  An  experimental 
verification  of  this  pattern  would  be  of  utmost  interest.  It  also  oersists  when  one 


with  EI  the  photon  energy  in  the  lab  frame,  91,2,3  isospin  factors  (like  e.g.  771  = 
»72  =  1,  Va  -  2  for  the  pic0*0  final  state)  and  £  is  a  Bose  factor  (  =  1/2  in  case  of 
equal  particles  in  the  final  state,  one  otherwise).  The  threshold  energy  is  E*hr  = 
2M»(1  +  p.)  =  320.7  MeV  with  p  =  Mw/m.  The  formula  (47)  is  only  valid  close  to 
threshold  assuming  that  the  amplitude  in  the  threshold  region  can  be  approximated 
by  the  threshold  amplitude.  Furthermore,  the  three-body  phase  space  has  been 
approximated  by  an  analytical  expression  which  is  good  within  a  few  percent.  For 
JEy  =  330  MeV,  i.e.  10  MeV  above  threshold,  the  total  cross  section  is  0.35,  0.15 
and  0.11  nbarn  for  *fp  — +•  pv°Tf°,p7r+v~  and  n7r+7T°,  in  order.  Of  course,  these 
numbers  should  only  be  taken  as  a  first  approximation  due  to  the  assumptions 
going  into  their  calculation.  It  is  therefore  a  tough  experimental  task  to  measure 
these  reactions  close  to  threshold  and  verify  the  effects  of  the  chiral  loops. 

10.  Brief  outlook 

In  this  lecture  I  could  only  give  a  glimpse  of  the  many  facets  of  chiral  perturba- 
tion theory.  The  role  of  effective  Lagrangian  methods  in  the  parametrization  of 
physics  beyond  the  standard  model  and  in  the  context  of  longitudinal  vector  boson 
scattering  to  test  the  Higgs  sector  of  the  electroweak  symmetry  breaking  has  been 
discussed  here  by  Burgess  [66]  and  Phillips  [67].  Another  widely  discussed  topic  is 
the  combined  application  of  chiral  symmetry  and  heavy  quark  effective  field  theory 
methods  (some  references  can  be  traced  back  from  [4]).  Furthermore,  these  meth- 
ods allow  also  to  make  precise  statements  for  finite  temperature  and  volume  effects 
and  much  more.  There  remain  many  open  theoretical  problems  and  challenging 
experimental  tasks  to  further  tighten  our  understanding  of  the  strong  interactions 
at  momentum  scales  were  they  are  really  strong. 

I  would  like  to  thank  the  organizers,  in  particular  Dr.  Uma  Sankar,  for  the 
warm  hospitality  extended  to  me  and  an  efficient  organization.  I  also  thank  the 
Saha  Institute  for  Nuclear  Physics,  in  particular  Prof.  S.  Mallik,  for  providing  a 
stimulating  atmosphere  and  support  to  perform  part  of  this  work. 
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M.  K.  PARIDA 

Physics  Department,  North-Easatern  Hill  University, 

P.  O.  Box  21,  Laitumkhrah,  Shillong  793  003. 

Abstract.  We  discuss  recent  contributions  on  threshold  effects  in  grand  unfiied  theories 
including  minimal  SUSY  SU(5),  non-SUSY  modifications  of  the  grand  desert  in  SU(5)  and 
SO(10),  and  SO(10)  with  single  intermediate  symmetires.  Consequences  of  theorems  on 
vanishing  GUT-scale  corrections  to  sin2  6W  in  SO(10)  with  SU(2)z,  XSU(2)j?  XSU(4)C 
(g2i-  =  g2/0  intermediate  symmetry  are  discussed  and  vanishing  corrections  on  the  inter- 
mediate scale  are  explicitly  demonstrated  where  predictions  are  more  precise.  Threshold 
and  higher  dimensional  operator  effects  in  SUSY  SU(5)  recently  derived  by  a  number  of 
authors  are  presented. 


1.  Introduction 

Experimental  measurements  at  the  CERN-LEP  at  the  Z-peak  and  improved  estima- 
tion of  the  finestructure  constant  have  rpovided  more  precise  values  of  sin2  9\y ,  as 
and  a"1, 

sin26w     =     0.2324     ±     0.0006 

as_1          =     0.12         ±     0.01  (1) 

a  =     127.9       ±     0.2 

leading  to  very  accurate  determination  of  the  standard  model  gauge  couplings, 

01     =     0.016887    ±     0.00004 

c*2     =     0.03322      ±     0.000025  (2) 

<*3       =       OtS 

at  the  Z-mass.  These  have  revived  interests  in  grand  unified  theores  [1]  resulting  in 
more  precise  predictions  than  before[2].  In  addition  to  one-  and  two-loop  contribu- 
tions to  the  gauge-symmetry-breaking  scales  threshold  [3]  and  higher  dimensional 
operator  effects  [4]  have  been  estimated  and  their  impact  on  GUT-prediction  on 
sin20iy,as,  proton  lifetime,  and  neutrino  masses  have  been  calculated.  It  has 
also  been  found  in  a  class  of  GUTs  that  in  addition  tot  he  vanishing  of  multi- 
loop  corrections  at  high  mass  scales,  the  unknown  uncertainties  on  sin  Q\y  and 
the  intermediate  scale  due  to  threshold  and  higher  dimensional  operator  effects  are 
absent  leading  to  more  precise  predictions[4-6j.  The  purpose  of  the  present  talk 
is  to  review  these  works  and  other  corrections  obtained  in  SUSY  and  nonSUSY 
GUTS.  In  addition  we  demonstrate  explicitly  the  stability  of  the  G^p-breaking 
scale  under  threshold  and  higher  dimensional  operator  effects  leading  to  a  more 
precise  predictions  of  degenerate  and  see-saw  contributions  to  neutrino  masses  in  a 
class  of  GUTs. 
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generalised  formulas  for  mass  scales  and  threshold  effects.  In  Sec. 2  we  discuss  in 
detail  the  threshold  effects  in  modified  grand  desert  models  such  as  SU(5)  and 
SO(10).  In  Sec.3  we  derive  threshold  corrections  in  modified  grand  desert  models 
[7,8].  In  Sec.4  we  estimate  threshold  effects  in  SO(10)  with  G224P  intermediate 
symmetry.  Threshold  effects  in  other  single  intermediate  scale  models  of  SO(10) 
are  summarized  in  Sec. 5.  Section  6  is  devoted  to  discussions  on  corrections  in  SUSY 
SU(5).  We  provide  a  brief  summary  and  conclusions  in  Sec. 7. 

2.  Generalized  formulas  for  mass  scales 

Although  minimal  nonSUSY  SU(5)  is  ruled  out  experimentally,  the  modified  grand 
desert  models  based  upon  SU(5),  S0(10)  and  others  with  light  degrees  of  freedom 
are  consistent  with  all  the  available  data.  Evidences  on  neutrino  masses  would 
rule  out  SU(5)-based  modifications  of  the  grand  desert,  but  a  number  of  other 
GUT  would  survive.  In  the  absence  of  theoretical  formulas  for  the  mass  scales  or 
sin2  QW  it  might  be  still  possible  to  study  the  unification  constraints  by  plotting  the 
coupling  constants  numerically,  but  accurate  estimations  of  the  model  predictions 
including  experimental  and  theoretical  uncertainties  analytic  formulas  are  essential. 
Such  formulas  can  be  obtained  more  easily  for  single  step  breaking  of  any  GUT  to 
the  standard  with  or  without  SUSY[9,10j.  We  derive  here  generalised  theoretical 
formulas  for  mass  scales  for  modified  grand  desert  models  and  two  step  breakings 
in  GUTs.  These  formulas  contain  explicitly  the  loop  contributions  to  each  order 
although  at  present  contributions  upto  two-loops  only  are  calculated.  More  impor- 
tant is  that  the  formulas  have  analytic  expressions  for  different  types  of  corrections 
arising  at  the  intermediate  or  the  GUT-scales[4.7,8,13j. 
At  first  we  consider  the  following  class  of  models, 

(a)  50(10)  ^  G2i3  ^  Gi3 

where  G2i3(=  SU(2)LXU(l)YXSU(3)c)  is  the  standard  model  (SM),  Gi3  = 
U(l)emXSU(3)c'  and  the  presence  of  light  degrees  of  freedom  corresponding  to 
additional  Higgs  scalars  or  fermions  has  been  assumed.  We  also  consider  the  fol- 
lowing class  of  models  based  upon  5O(10), 

(b)  5O(10)  —3-  GI  — '•+  G2i3  — %•  Gi3 

where  G  =  SU(2)LXU(l)hpXSU(4)c(=  GM)[l^SU(2)LXSU(2)RXSU(4)c 

(—  G22iaXP,  P  =  Parity  =  Left-right  discrete  symmetry,  02L  =  02fl))  G224p 
2L  —  <72fi)-  It  may  be  noted  that  SO(10)  also  permits  the  left-right  dis- 
crete symmetry  to  be  broken  at  a  higher  scale  without  breaking  SU(1)RXU(l}B-L 
or  SU(2}piXSU(4.)c  which  can  survive  down  to  lower  scales  as  in  the  cases  of  G22i3 
or  G224.  In  such  a  situation  the  parity  and  the  5C/(2)2R-breakings  are  decoupled 
[11]. 

In  models  of  the  type  (a)  or  (b)  the  renormalisation  group  equations  (RGEs) 
for  the  gauge  couplings  of  G2is  or  G/  can  be  written  in  the  following  manner  in 
the  two  mass  ranges, 
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MZ  <  I*  <  Mj 

<*i(Mz)          oii(Mi)       2?r     MZ       4ir       12r 

In  model  (a)  M/  =  degenerate  mass  of  additional  degrees  of  freedom  in  the  desert. 
MI  <  n  <  MU 

1  1  n^  " 


where  QG  is  the  GUT-coupling  constant  and  the  two-loop  contributions  are  repre- 
sented by  terms  containing  Pi-functions, 


(5) 


In  (3)-(5)  a;(6,-j)  and  a^(6^)  are  the  one(two)-loop  coefficients  of  the  /3-function  in 
the  two  mass  ranges.  The  A,-functions  in  (3)  and  (4)  represent  threshold  effects 
due  to  heavy  or  superheavy  particles  whose  masses  are  of  the  order  M/  or  MU-  But 
as  these  masses  are  unknown  theoretically,  the  threshold  effects  due  to  heavy  or 
superheavy  particles  whose  masses  are  of  the  order  M/  or  My  .  But  as  these  masses 
are  unknown  theoretically,  the  threshold  effects  contribute  to  the  uncertainties  of 
the  model  predictions  on  the  mass  scales,  sin2  dw,  and  as  etc.  For  fj,  fa  MI  of  M(/, 
the  general  expression  for  A,-(/i)  in  nonSUSY  theories  is[14], 


(6) 


where  tw,tiF,  and  tis  are  the  generators  in  the  representations  of  the  heavy  or 
superheavy  gauge  bosons(V),  fermions(F),  and  scalars(S),  respectively.  In  our  no- 
tation 

Af   =   Al-(Ai  =  M/)l     Af  =  AI-(/i  =  Mtr)  (7) 


Physically  the  first  two  terms  are  the  vacuum  polarisation  contribution  of  the  heavy 
gauge  bosons(V)  and  the  next  two  are  similar  contributions  due  to  fermions(F)  and 
Higgs  scalars(S)  in  the  theory.  Using  (3)  and  (4)  and  suitable  linear  combinations 
of  the  SM  gauge  couplings  we  derive  and  following  generalised  formulas  for  mass 
scales  for  models(a)  and  (b), 

MZ 

ln~rr~  —  [(AifLg  —  BuLs)  4-  (K^Au  —  J?,Bu}  -\-  (J\Bu  —  K\Au)]/D       (9) 
where 


D  =  AuBj  -  AjBu  .  Ls  =    -       a^1  -  -z 

o      \  o 


In  (8)  and  (9)  the  first,  second,  and  the  thrid  terms  represent  the  one-loop,  two-lo< 
and  the  threshold  contributions,  respectively  and  the  values  Ai,  B,-,  J,-  and  K{  diff 
from  one  model  to  the  other, 

MODEL  (a) 

AI      =     a2L  +  fay  -  |a3c  -  (a'2£  +  fa'y  -  fa3C) 
B,     ~     |  (ay  -  a2L  -  a'y  +  a'2L 


(1 


=     a'2L  +  f  a'y  -  f  aj 
|(a'y-a'2L) 


4-  |L?  -  f 


MODEL  (b) 

(l)  G/    =    G^ 


=       Q      - 


fay  -  2a'4C  -f  a72L  -f  a'2 


Au     =     2a'4C  -  a'2L 


=      la/2L  ~  a2 


(1 


(ii)  G/  =  Gz2i3  or  C?22i3P 

Generalised  formulas  for  mass  scales  are  obtained  from  (11)  of  model  (b)  (i) 
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the  replacements, 


2Lfc     —     |1&     -     \IFBL     ,     §L?C     -*     jlgL  (12) 

opt/        _  .       8pC7        _       2nU  2pt7        _  .       2  pl7 

Z/4C         *•     a^ac  a^BL     >     3^4C     —  >     S^BL 

(iii)  G/  =  G2i4  =  SU(2)LXU(l)l3KXSU(4)c 

Generalised  formulas  for  mass  scales  are  obtained  from  (7),  (8)  and  (10)  by 
replacing 


pU      _      pU         TU       _  fc     TU 
FIR    -  »•    *IB  ,   L2R    -  f    LIR 

(iv)  G/  =  Gana  =  SU(2)LXU(l)i9RXU(l)B-LXSU(^c 

The  generalised  formulas  in  this  case  are  obtained  by  combining  (12)  and  (13). 

Before  closing  this  section  it  is  worth  pointing  out  that  other  corrections  due  to 
Yukawa  couplings,  top  quark  mass,  higher  dimensional  operators  etc.  can  be  derived 
following  the  same  procedure  as  threshold  effects  with  appropriate  replacements  of 
the  A-functions. 

3.  Threshold  effects  in  modified  grand  desert  models 

It  has  been  found  that  the  SU(5)  model  with  additional  light  degrees  of  freedom 
corresponding  to  fermions  or  Higgs  scalars  is  consistent  with  the  CERN-LEP  data 
and  proton  lifetime  for  the  p  —  >•  e+7r°  mode  [8,15-160, 

(rp)e«P«.    >   3  x  1032  yrs.  (14) 

The  added  presence  of  new  degrees  of  freedom  such  as  the  Higgs  scalars  signifi- 
cantly below  MU  needs  additional  finetuning  of  parameters  which  is  not  natural. 
In  order  to  keep  this  unnatural  act  to  a  minimum  we  have  successfully  implemented 
new  minimal  modification  of  the  grand  desert  which  needs  only  one  additional  SM 
irreducible  real  scalar  representation  transforming  as  ("(3,0,8)  and  needs  only  one 
additional  finetuning  of  parameters  [7]  .  Moreover  the  model  has  been  implemented 
in  SU(5)  and  the  popular  GUT  like  SO(10)  thereby  accounting  for  the  CERN-LEP 
data,  the  proton  lifetime  and  small  neutrino  masses  necessary  to  explain  the  solar 
neutrino  flux  and  the  dark  matter  of  the  Universe  in  the  latter  case.  The  additional 
Higgs  scalar  C(3,0,8)  C  75  of  SU(5)  and  210  of  SO(10).  Thus  the  implementation 
needs  the  Higgs  representations  24,  5  and  75  in  SU(5)  and  45,  126,  10  and  210 
in  S0(10).  Thus  keeping  MZ  <  Mj  =  M^  <  MU  and  evaluating  the  coefficients  in 
(7)-(10)  yields  the  following  analytic  expressions  for  the  mass  scales. 


ranaa 


f  (Pa^  +  PaVfCf  +  AC)1  +  dn(25A>  -48A2 


+ 


where  the  invrse  of  the  GUT-coupling  constant  a^1  has  been  derived  from  the 
finestructure  constant  relation, 


and  the  RGBs  for  the  gauge  couplings.  Clearly  the  threshold  effects  are, 


II 
AG    =    Aa-JG     = 


(25A1-48A2-23A3) 


where  Af  =  A-7  which  are  evaluated  using  the  decomposition  of  the  superheavy 
components  of  the  Higgs  representations  under  G2i3. 

3.1  The  SU(5)  model 

24    D    D1(3,0,1)+£>2(1,0,8) 

5      D    C(l,-|,3) 
75     D 


Defining  rj,  =  In--*-,  extremisation  of  Ay  requires 


TJc        ~      T]Dl    =    TJE3    =    TIE*    =    T)E7    =    »7~      =    =F 

leading  to 

oync 


Atr  =  iO.38  In0  ,  Ac   =  ±1.0 

Here  the  significance  of  ft  is  that  the  superheavy  scalar  components  can  have  a 
mass  (3Mu(Mu/P),P  =  1  —  10  in  analogy  with  the  standard  model  Higgs  boson. 
Without  threshold  effects  the  solutions  are, 


Mo  =   10io.a*o.6     Mo   =   itfs^GeV  .  a0'1   =  39.05  ±0.7 


a"~li(fJi)  are  prevented  to  cross  below  the  unification  mass  because  of  the  presence  of 
C  at  M}  =  Mc°  =  Wl°-2±0-3GeV  which  changes  the  slopes  of  a^foi)  and  a^(/*)  for 
\i  >  M®.  The  unification  mass  has  been  increased  by  almost  an  order  of  magnitude 
increasing  the  proton  lifetime  consistent  with  the  experimental  limit.  Inlcuding 
threshold  effects  the  values  of  mass  sclaes  and  coupling  constants  are  given  in  Table 
1.  Clearly  the  model  predictions  with  /?  =  10  saturates  the  Suprkamiokande  limit 
forp — >e+  +  7r°. 

Table  1.  Threshold  effects  on  mass  scales,  GUT  coupling  and  proton  lifetime  predic- 
tions in  the  degenerate(D)  and  nondegenerate(ND)  cases  of  superheavy  scalar s  in 
nonSUSY  SU(5)  with  modified  grand  desert. 


-1.0  +.08  +2.9  +.30 

10P)        10+10        10-02        -1.9      10-08 


10+2-2        10±29        +'.4       10±-92 


10±38        +'.7       10±1-52 


3.2  The  SO(10)  model 

One  important  advantage  of  SO(10)  over  SU(5)  is  the  generation  of  Majorana 
neutrino  masses  over  a  wide  range  of  values.  The  real  scalar  £(3,  0,  8)  C  210  of 
SO(1).  The  implementation  of  the  modified  grand  desert  model  in  SO(10)  requires 
the  Higgs  representation  10,  45,  126,  and  210  although  the  model  also  works  by 
replacing  45  by  another  210.  The  superhevy  components  are  identified  in  each 
representation.  Then  following  the  procedure  explained  for  SU(5)  we  obtain [7], 

Au     =     [-M]ln(3 

(22) 

A,      =     [+2'.65]/n/3 

Numerical  values  on  threshold  effects  on  mass  scales,  GUT  coupling  contants  and 
proton  lifetimes  are  presented  in  Table  2  in  the  degenerate  (D)  and  nondegenerate 
cases  for  ft  —  5  —  10  where  ft  =  MifMu-M.  being  the  superheavy  Higgs  scalar 
masses.  It  is  clear  that  for  superheavy  masses  differing  by  a  factor  10(1/10)  from 
MU  the  increase  in  the  proton  lifetime  could  be  as  large  as  50  times  the  uncorrected 
value  and  the  model  can  not  be  rules  out  by  the  planned  experiments  in  near  future. 
The  Majorna  netrino  masses  in  the  jodel  can  arise  in  various  ways.  For  example 
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m,,   =   \h>  <  4°  <2<  A^  >  /MlL  ,•  =  «,/*,  r  (23) 

with 

mVm     =    (2.5  xlO~6     —    2.5xlO~4)eVr 

m^     =    (7.5  xlO-4     —     2.5xlO~2)eVr  (24) 

m,,r     =     (7  x  10~2        —     7)eV 

where  A^  is  the  neutral  comonent  in  A/{(1,  3,  fO)  carrying  J3-L  =  2  and  A/;(3, 1, 10) 
is  the  corresponding  left-handed  triplet.  Both  are  contained  in  126  of  SO(10).  Here 
<£°  is  the  netural  comonent  of  the  standard  doublet  C  10  of  SO(10)  and  A(/i,-)  is 
the  Higgs  quartic  (Yukawa)  coupling  between  126  and  10.  Here  we  have  used 
hi  <  <f>°  >=  mq . .  It  is  evident  that  while  mVe  and  mn,-M  are  compatible  with  the 
solar  neutrino  flux  nVr  value  can  be  made  to  be  consistent  with  the  dard  matter  of 
the  universe.  The  model  can  be  also  modified  slightly  to  produce  degenerate  and 
see-saw  contributions[17-18j. 

Table  2.  Same  as  Table  1  but  for  nonSUSY  SO(10). 


+  .85  -.03  _12  0  -•a< 

10(£>)       10-°°        10+-°°       +0.0      10+-00 


+a.<  -.46  j_4  5  -I.T 

10~2-6        10+-30       -7.2      10+0-8 


1Q(ND)      lO-^7        10-6-8      ^16680    10+1'"1 


Although  corrections  to  the  intermediate  mass  M(  is  quite  significant  that  on 
and  hence  on  TP  are  small.  Such  results  have  been  also  obtained[8]  in  computing 
threshold  effects  in  SU(5)  with  split  multiplets[16].  We  observe  from  these  analyses 
that  the  presence  of  £213  below  the  GUT-scale  reduces  uncertainty  on  the  proton 
lifetime  prediction. 

4.  Threshold  effects  in  SO(10)  with  intermediate  scale 
and  theorems  on  vanishing  corrections 


Here  we  discuss  threshold  effects  in  SO(10)  with  G^p  intermediate  symmetry  in 
detail  and  breifly  mention  the  results  with  other  intermediate  symmetries. 
(a)  GIMP  Intermediate  Symmetry 
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With  the  choice  of  minimal  Higgs  representations  54,  126,  and  10,  we  have 
of   =   (11/3, 11/3, -14/3),  Au  =  -50/3,  Bv   =  50/9 
AI   =   -17/3  ,  Bj   =  -53/3 


584/3        3         765/2 
3        584/3     765/2 
_ 153/2     153/2     1759/6 


(25) 
L53/2     153/2     1759/6^ 

Using  these  in  (8)-(10)  we  obtain, 


— \'u 

(26) 


It  is  important  to  note  that  the  presence  of  L-R  symmetry  for  p.  >  Mj  demands 
^2L  =  ^2R  which  leads  to  vanishing  GUT-threshold  corrections  on  A/  or  equiv- 
alently  on  Mj.  This  is  a  manifestation  of  theorems  on  vanishing  corrections  on 
sin2  6w  to  be  discussed  a  little  later. 

We  make  the  simplifying  assumption  that  all  the  superheavy  components  beling- 
ing  to  a  single  GUT  representation  have  the  same  mass.  Also  we  use  Mii  —  ML 
and  MR.  =  MR  leading  to  r,Li  =  r)L,r)Ri  =  rjR, 


nooVAJL'/oi  ~  8877126  —  447710  -r  Il77<£  —  13177.R  -f 

(28) 
A/      =     ±(9riR  • 


(a.l)  Theorem  on  vanishing  corrections  on  sin2  6\y 

It  is  important  to  note  that  the  GUT  threshold  contributions  such  as  7710,7754 
and  77126  are  absent  in  A/.  Also  the  contributions  of  two  loop  functions  P¥r ,  P¥D 

--  *•  £if  *        J~il\. 

and  P%c  in  A/  cancel  out.  These  are  due  to  theorems  proved  recently  in  SO(10) 
with  G224P  intermediate  symmetry  occuring  at  the  highest  intermeidate  scale[5,6]. 
Theorem  1  In  all  GUTs  where  the  G224.P  symmetry  breaks  at  the  highest  interme- 
diate scale  (M/),  the  GUT  -  threshold  contribution  to  sin2  QW  vanishes[5j. 
Theorem  2  In  all  GUTs  where  the  <J224P  symmetry  breaks  at  the  highest  interme- 
diate scale  (M/),  the  gauge  boson  renormalization  to  every  m  -  loop  order  (m  >  2) 
in  the  mass  range  MI  —  MU  has  vanishing  contribution  to  sin2  6\y . 
(1.2)   Vanishing  corrections  on  A/n(M//M^)  and  precise  ptediction  on  neutrino 
masses 

In  ref[5]  it  was  shown  that  all  perturbative  and  non-pertubative  corrections 
including  gravity-induced  higher  dimensional  operator  effects  occuring  at  the  GUT 
sclae  also  have  vanishing  contributions  on  sin2  0\\r  making  the  intermediate  scale 
Mj  quite  stable.  Here  we  provide  a  proof  on  the  stability  of  Mj  against  all  such 
uncertainties.  Since  left-right  symmetry  is  restored  in  the  presence  of  GII*P->  we 
have 

/        /  \/U     \iU        s~*U      /-*[/  fe)Q\ 

Un  r        "-"•••       Q*>  I>    i      "O  T        """"       ^O  D     i     ^-^  1  T        """"       »— '  1 D  \  ^  */  / 

4JL/  Z/T      '  X  JL  £.I\.      >  £,Lt  £,ft  \  / 


contrioutions  aue  to  nigner-airnensionai  operators,  me  corrections  to  tne  coupling 
constants  at  fi  —  M\j  can  be  written  in  the  form 


<*i(Mu)    ~    <*G  12          •  , 

Then  all  the  formulas  derived  so  far  hold  hold  if  we  replace  \'f  —  >  A(-y  +  Cf7.  In 
eq.(9), 

Au     =    2«c  -  a'2L)  , 


5  f\(    f  I     \ 

_  **>          3  *     4O  2iLtJ 

(30) 
B/      =     || 


which  by  virtue  of  (29)  lead  to 

A?   =  0  (31) 

This  can  also  be  verified  through  eqs.(17)  and  (29).  It  is  to  be  noted  that  (31) 
holds  irrespective  of  the  nature  of  the  correction  arising  at  the  GUT  scale  and  it  is 
independent  of  the  presence  or  absence  of  supersymmetry.  The  cancellation  occurs 
for  any  possible  nonperturbative  contributions  also.  Thus  the  Gmp  -breaking 
scale  is  not  affected  due  to  any  corrections  emerging  from  the  GUT-sclae  although 
the  GUT-scale  itself  is  affected  by  the  corrections.  In  the  absence  of  threshold 
corrections  we  obtain 

A/g     =     1015±0-25GeV 

Mo      =     i0i3.6*o.2GeV>  (32) 

a'1     =    40.6*0.2 

where  the  uncertainties  are  due  to  those  in  the  input  parameters. 

The  threshold  effects  are  computed  by  extremising  AC/  or  A/  while  taking  into 
account  the  parity  restoration  constraint  for  p,  >  MU  with 

Numerical  values  of  corrections  are  given  in  Table  2  for  ft  =  5  —  10.  Predictions  for 
proton  lifetime  for  the  p  — »•  e+7r°  is  amde  using[21] 

SU(S) 

"G 


rso<io) 
For  /?  =  10  the  model  predicts 

rp   =   1.44  x  1032-i:fc-7±1-o:fcl-9yrs.  (34) 
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parametrs,  and  threshold  effects.  It  is  clear  that  the  maximum  value  for  rf  exceeds 
the  current  Super-kamiokande  limit  by  almost  one  order.  Thus,  as  against  the 
conclusions  of  ref[19],  this  model  can  not  be  ruled  out  by  any  improvement  on 
proton-lifetime  measurement. 


Table  3.  Threshold  effects  in  SO(10)  with  G^4P  intermediate  symmetry. 


G1        Tp/T? 


-.30 

-.30 

—  1  4 

5(ND) 

io+-26 

10-h.27 

-fO.2 

1QT1.2 

-.30 

-.44 

-2.0 

W(ND) 

10+.40 

io+-40 

+0.2 

lQTl-7 

Threshold  effect  on  M/  shown  in  Table  3  is  due  to  that  at  the  intermediate-scale 
boundary  only. 

=  ±0.4 


The  higher-dimensional  operators  scaled  by  the  Planck  mass  do  not  modify  the 
intermediate  scale.  Also  imposition  of  a  horizontal  symmetry  does  not  change  the 
scale  unless  additional  light  scalars  are  introduced  below  M/.  Even  if  the  left- 
handed  triplet  is  not  given  a  VEV  explicitly  [28],  it  can  be  induced[29,30], 


mv     =     7         =   (2  -  200)  x  10±0  2±0  Af\eV 
=     (0.5  -  800)/AeV  , 

where  we  have  evaluated[30] 

T  «  10/A//?2  ,  ft  =  0.1  -  1  , 
/(A)  being  a  Yukawa(quartic)  coupling.  The  see-saw  contributions  are, 

mVe     =     5  x  io-w-«±-*±-«eV  , 

mv^     =     1.6  x  10-3  -6±  2±AeV  ,  (36) 

mV      =     0.5  x  !Q-*±-* 


These  neutrino  masses  are  capable  of  explaining  the  solar  and  the  atmospheric 
neutrino  oscullations  while  offering  neutrinos  as  strong  candidates  for  the  dark 
matter  of  the  Universe.  In  addition  the  degenerate  neutrino  mass  for  i/e  could  be 
confirmed  by  the  neutrinoless  double-beta  decay  experiments. 
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Threshold  effects  in  SO(10)  with  GI  =  G?224  or  C?22i3  without  parity  (g-^L 
have  been  extensively  discussed  in  ref.[3].  Although  larger  undertainty  was  noted 
earlier  [22]  allowing  for  parity  restoration  and  hence  complete  SO(10)  symmetry  at 
^  >  (10  —  30)M(7,  it  has  been  found  in  ref[3]  that  the  imposition  of  the  parity 
restoration  constraint  at  the  GUT-scale  can  reduce  the  uncertainties  significantly. 
Among  the  two  the  models,  the  one  with  GI  =  £224  appear  to  be  more  favoured 
for  explaining  the  solar  neutrino  puzzle. 

For  GI  =  G2213P1  the  threshold  effects  have  been  computed  in  ref.[20]  and  for 
GI  =  £2  H  they  have  been  computed  in  ref.[12].  The  computations  in  all  cases 
can  be  made  following  the  methods  outlined  in  Secs.2-4.  We  summarise  the  results 
without  going  into  details  in  Table  4.  The  predictions  on  the  proton  lifetime  in 
different  cases  with  /?  =  10  are  given  below  for  extremised  A{// 


Table  4.  Threshold  effects  in  SO(10)  with  single  intermediate  symmetries:  Gm  (model 
A),  G22i3  (model  B),  G22i3p  (model  C),  and  G2i4  (model  D)  for  ft  =  10  in  the 
nondegenerate  case. 


^224          ^72213       G2213P 

(A)         (B)         (C)         (D) 


Mj  /Ml     lO-™7     10--*1       10±-1 


lo-"'"4    lo--'5 


+  0.5  +0.8  +3.6 

-5-°      10~2-°      10±L7     10~°'8 


(A)  GI  =  G224 

Tp     =    1.44  X 

(B)  GI  =  G2213 

(C)  GI  =  G2213P 

rp   =  l^xlO34'2*7*-8*1'7}'™. 

(D)  GI  —  G214 

Some  comments  and  conclusions  are  in  order.  Including  threshold  effects  we  find 
that  even  if  ft  =  1  the  model  (D)  is  consistent  with  the  available  experimental  limit 
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(MSSM).  In  one  approach  the  superpartner  scale  has  been  introduced  at  MS  which 
is  expected  not  to  exceed  feyr  TeVs.  Below  MS  global  SUSY  is  assumed  to  have  bro- 
ken down  but  the  effects  of  superpartner  masses  is  taken  into  account  by  threshold 
correction  at  that  scale, 

(i)  Stf(5)  x  SUSY  ^  G2i3  x  SUSY  ^  G2i3  ^  Gi3  In  the  other  case  it  is 
assumed  that  SUSY  survives  down  to  MZ  and  the  effects  of  superpartner  masses 
are  taken  into  account  through  threshold  effects  at  MZ. 

(ii)  5(7(5)  x  SUSY  -^  G2i3  x  SUSY  ^  G13 

Threshold  effects  due  to  superheavy  masses  are  estimated  at  MU  and  the  cor- 
rections due  to  the  top  quark  is  computed  as  in  the  SM  at  MZ  in  both  cases. 

6.1  Threshold  effects  on  Ms 

With  the  chain(i)  the  generalized  form  of  eqs.(3)-(5)  and  (8)  -  (9)  apply  with  M/  = 
MS  but  different  numerical  values  of  the  coefficients  for  i,j  =  1,2,3' 


a.-    = 


33/5 

1 

-3 


and  with 


where 


7.96-   5.4    17.6 
1.8      25       24 
2.2.      9        14 


Af 


,u  _ 


CNV 


+ 


SH 


(37) 


The  first  term  in  the  RHS  of  (37)  represents  conversion  from  DR  to  MS  scheme 
and  the  second  term  includes  the  superheavy  particle  effects  near  MU  .  In  order  to 
find  the  latter,  it  is  noted  that  the  scalar  multiplets  have  the  components, 

24     =    M,(l,  0,8)  +  ^(3,0,1)4^(2,  5/6,  3)  +  Mv(2,  -5/6.3)  +  M0(l,0,  1)  , 


5      = 


At  first  evaluating  only  the  GUT-threshold  corrections  to  the  gauge  couplings  by 
using  the  superheavy-particle  -  one-loop  ^-functions  yields 


A 
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The  superheavy-mass  contributions  to  the  corrections  are  given  in  Table  5.  Using 
(38)  and  the  standard  procedure  gives 


(39) 


Table  5.  Threshold  and  5-dim.  operator  contributions  to  the  matching  functions  [9]. 


ACNV  ^L  ^SH  A? 


5—  dim 


A!         0          £'n$£     -_     mv      u 

-.028r/ 


— f-CA17 

(40) 


(41) 


Ignoring  all  threshold  corrections  except  &%NV  and  using  the  input  parameters 
from  eq.(l)  in  (38)-(40)  gives 

Afg   =   1016'1:fc-49GeV;  Mj   =   102-44±1'85(7e^  0^=26.5  ±.3          (42) 

Two  loop  contributions  which  were  ignored  from  M§  estimation  in  [24]  is  found  to 
be  comparable  to  the  one-loop  contribution  in 


/n(AfJ/Mz)   =   -2.53  +  3.66  ±4.26  (43) 


GUT-threshold  corrections  are 


The  GUT-threshold  effects  are  estimated  in  Table  6  in  the  degenerate(D),  and 
nondegenerate(ND)  cases  assuming  the  superheavy  masses  to  vary  between  MU  10 
and  j3Mu  with  /?  =  5  --  10  where,  as  per  the  conclusion  of  ref.[24],  the  uncertainty 
in  Ms  is  of  the  order  10*1-2  for  /9  =  10. 


Table  6.  GUT-threshold  effects  on  the  unification  mass  and  superpartner  scale  in  MSSM 
in  the  degenerate(D)  and  nondegenerate(ND)  cases. 


MS/MI 


5(£>)  10*-1  10±44 

10(D)  10?-14  lO*-66 

5(ND)  10±33  10±66 

10(D)  10±-47  10±1'26 

6.2  Constraint  on  superheavy  masses  from  the  CERN-LEP  data 

Hisano,  Murayama  and  Yanagida  [25]  have  observed  that  the  combination  p/o-i  -f 
q/ct2  +  »*/«3  is  dependent  only  on  the  Higgs  colour-triplet  mass(M/fc)  if  (p,  q,  r)  — 
(—1,3,  —2).  Similarly  the  other  combination  (5,  -3,  —2)  depends  on  My  and  ME 
leading  to, 

f 


Incorporating  Af  effects  leads  to  the  replacements  of  second  terms  in  the  RHS  of 
(45)  by  the  logarithmic  functions  of  super  partner  and  the  second  Higgs-doublet 
masses  [25].  Restricting  the  model  to  the  minimal  supergravity  SU(5),  the  particle 
spectrum  and  the  CERN-LEP  measurements  impose  the  following  constraints  on 


the  colour  troplet  mass  and  the  effective  unification  scale 

2.2  x  Wl3GeV       <     MHo  <  2.3  x  Wl7GeV 

(46) 
0.95  x  !Ql6GeV     <     (MMs1/3   <  3.3  x  Wl*GeV 


for  the  gluino  mass  100  GeV  -  ITeV. 

6.3  Higher-  dimensional  operator  effects 

The  effects  of  the  5-dim.  operator, 

(47) 

on  the  MSSM  predictions  have  been  analysed  by  Langacker  and  Polonsky  [9]  and 
Hall  and  Sarid  [26].  The  analytic  expressions  for  the  strong  interaction  coupling 
and  MHc  are  [26], 

a,        =     .132(1  -  .024<r  - 


-.02517(1  -  .la)(m1/2/Mz)-2/9^-1/2}  (48) 

=    (3  x 


where  MQ  =  (STrGjv)"1'2  =  2.4  x  lQl8GeV,  MHI  =  second  Higgs  doublet  mass 
mi/2  =  universal  gaugino  mass,  p.  =  two-doublets  mixing  parameter,  <r  —  (s2  — 
.2326)/.008,  and  AS  and  A24  are  the  trilinear  couplings  in  the  superpotential, 

W  = 

It  is  clear  from  (48)  that  as  increases  with  logarithmic  increment  of  the  colour 
triplet  mass  when  rj  —  0.  Due  to  gravitational  smearing  (e.g.,  77  =  —  1  to  4-1)  the 
allowed  region  in  the  a,  vs.  M//c  plot  is  found  to  be  enhanced  and  blurred. 

6.4  Threshold  effects  through  effective  mass  parameters 

Using  the  symmetry  breaking  pattern(i)  Langacker  and  Polonsky[9]  have  computed 
SUSY  particle  threshold  effects  near  the  boundary  MZ,  in  addition  to  the  GUT- 
boundary  corrections [9]  by  parametrising  these  corrections  in  terms  two  separate 
sets  of  effective  masses.  The  RGEs  for  the  three  gauge  couplings  can  be  written  as, 

— 7TTT    = r-  —  In— -   +   — P[  -  A,-  (49) 

&i(Mz)         CXQ       2x     Mz         4ir 

The  correction  term  A,-  r  dudes  several  contributions, 

Af   =  Af  4  A*°  4-  Aj  4  A^  4  Af 

where  Af  is  the  same  as  (37),  Af  is  the  anlogue  of  Af  but  evaluated  at  the  bound- 
ary MZ,  and  the  others  are  the  Higgs(ho),  the  top  quartk(t),  and  the  Yukawa 
coupling(Y)  contributions  evaluated  at  the  boundary  MZ-  Using  the  RGEs  and 
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lytic  expressions  which  are  independent  of  the  GUT  symmetry  but  only  dependent 
upon  the  grand  desert  hypothesis.  We  furnish  analytic  expressions  on  threshold 
effects  for  one  set  of  predictions, 

4-  6u 


+(a3  *  aD/3  +  (ai  -  aa)Pj5 


(50) 

The  threshold  corrections  are  given  below  which  can  be  computed  numerically  from 
the  Tables  given  in  ref.[9], 

-f  3A2  +  8A3)/F 


6G     =     [(5a;  +  3a'2)A3  -  (5Ai  +  3A2)a3]/F 

F      =     5ai  +  3a'2  -  8a£ 

(51) 
Using.  a-^Mz)   =   127.9,  ata(Mz)   =  0.12  gives 

Mu   =   lO159^^^1   =  23.5,  sin2  ^  =  0.2335. 


Table  7.  The  top  quark  and  the  Yukawa  coupling  corrections.  Here  mt  is  in  GeV  and  ht 
is  the  Higg8-top  quark  Yukawa  coupling  [9]. 


Top  quark  corrections                    Yukawa  coupling 
corrections 

Ai       -.15  +  .13/n(mt/138)  -f  .15(m*/138)2  .17/1? 

A2     -.25  +  .065/n(m(/138)-f  .025(mt/138)2  .20/i2 

A3  .04+.105/n(mt/138) 


The  top  quark  and  Yukawa  coupling  corrections  which  are  the  same  as  the 
Standard  Model  are  given  in  Table  7[9].  The  corrections  at  Af^-boundary  due  to 
the  SUSY  particle  masses  have  been  parametrized  in  terms  of  three  effective  mass 
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nSUSY  _  aSM 

(52) 

where  afUSY  —  a(-,a£  is  the  analogue  of  a;  but  for  particle  £  only,  and  M^  is  the 
mass  of  C-  Using  the  SUSY  particle  spectrum  [27]  the  effective  mass  parameters 
have  been  evaluated  :  for  example  a  possible  solution  is 

mt   =  l&OGev  ,  MI   =  ZQIGev  ,  M2   =  2Q7Gev  ,  M3  =  352Geu 

For  evaluation  of  threshold  effects  at  MU  a  parametrisation  similar  to  (52)  has  also 
been  suggested, 


where  £  stands  for  superheavy  particles  near  the  GUT-scale.  The  prediction  for 
sin2  Qw(Mz}  turns  out  to  be  [9], 

+.0013 

sin2  Ow  (Mz}  =  0.2334  ±  .0025  ±  .0014  ±  .0006  -.0005  ±.0016  (53) 

To  predict  (53)  one  starts  with  the  input  parameters  as(Mz)  and  a(Mz)-  One 
can  start  with  the  parameters  sm^Ow^z)  and  o(Mz)  to  predict  ots(Mz\Mu 
and  a<?[9], 

+.005  ,      „ 

<*s(Mz)  =  .125  ±.001  ±.005  ±.002  -.002  ±.006  (54) 

The  first  number  on  the  RHS  of  (53)  or  (54)  is  for  mt  =  138GeK.  The  second 
entry  in  (53)  ((54))  is  due  to  the  uncertainty  in  the  input  parameter  as(sin2  Ow}- 
The  third,  fouth,  fifth,  and  sixth  entries  are  due  to  SUSY  threshold.  mt  and  rn/,0) 
GUT-threshold  and  5-dimensional  operator  effects.  Predicted  values  of  MU  and  ac 
with  uncertainties  are  given  in  ref.[9j. 

7.  Summary  and  conclusions 

We  have  discussed  threshold  effects  in  modified  nonSUSY  grand  desert  models 
with  SU(5)  and  SO(10)  GUTs,  single  intermediate  scale  models  of  SO(10),  and  in 
the  minimal  SUSY  SU(5).  We  have  also  estimated  neutrino  masses  predicted  in 
nonSUSY  S0(10)  with  G214P  intermediate  symmetry  where  vanishing  corrections 
on  the  intermediate  scale  has  been  explicitly  demonstrated.  We  conlcude  that 

1.  A  minimal  extension  of  the  grand  desert  by  the  introduction  of  a  single  real 
scalar  C(3,0,8)  makes  SU(5)  consistent  with  the  CERN-LEP  data  and  the 
proton  lifetime  measurment.  In  addition  SO(10)  can  account  for  small  neu- 
trino masses  needed  for  solar  neutrino  oscillation  and  the  dark  matter  of  the 
Universe.  Also  the  model  has  the  potentiality  to  provide  degenerate  neutrino 
masses. 

2.  When  the  GUT  symmetry  breaks  down  to  the  SM  gauge  group  in  one  step 
threshold  uncertainty  appears  to  be  smaller  on  the  proton  lifetime  prediction. 
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6.  Certain  moaeis  appearing  10  DC  ruiea  out  oy  two-loop  calculations  coma  sur- 
vive when  threshold  and  5-dim.  operator  effects  are  taken  into  account.  It 
is  essential  to  include  such  effects  while  questioning  their  survival  against  the 
CERN-LEP  data  and  the  proton  lifetime. 

4.  Including  threshold  effects  alone  the  SO(10)  model  with  £214  intermediate 
symmetry  is  consistent  with  the  CERN-LEP  data,  proton  lifetime  and  small 
neutrino  masses. 


5.  In  a  number  of  GUTs  with  Gj24p  intermediate  symmetry  we  prove  explicitly 
due  to  high-sclae-loop  effects,  GUT-threshold  and  higher  dimensional  opera- 
tor effects  with 

Mi  =    1013  6±°-2± 


leading  to  more  precise  values  of  degenerate  and  see-saw  values  of  neutrino 
masses  needed  to  explain  the  solar  and  atmospheric  neutrino  oscillations 
and  the  dark  matter  of  the  universe.  The  degenerate  mass  predictions, 
mVf  =  0(l)eV\  could  be  confirmed  by  the  neutrinoless  double  beta-decay 
experiments.  This  prediction  is  universal  to  a  large  class  of  grand  unified  the- 
ories such  as  SO(10),  SO(18),  SU(16),  E6,SU(8)L  x  5*7(8)*  and  all  SO(2N) 
with  N  >  5. 

6.  As  shown  by  Hisano  et.al.  [25]  the  Higgs-colour-  triplet  mass  and  the  effective 
unification  mass  in  MSSM  can  be  constrained  by  the  CERN-LEP  data. 

7.  There  could  be  gravitational  smearing  effects  in  MSSM  on  a,  prediction  due 
to  5-dim.  operator  effects[26]. 

8.  There  threshold  uncertainties  on  MSSM  predictions  can  also  be  parametrised 
in  terms  of  effective  mass  parameters[9]  . 
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1.  Introduction 

This  lecture  is  about  aspects  of  Electroweak  Symmetry  Breaking  (EWSB).  There 
are  sections  on  Higgs  searches  in  the  Standard  Model  (SM),  on  WW,  WZ  and 
ZZ  scattering  as  probes  of  EWSB,  and  on  Higgs  phenomenology  in  the  Minimal 
Supersymmetric  Standard  Model  (MSSM). 

The  SM  has  one  complex  SU(2)-doublet  of  scalar  fields, 


with  vacuum  expectation  value  v  arising  from  the  self-interaction 

t;2/2]2. 


In  the  resulting  EWSB  pattern,  where  W  and  Z  fields  acquire  masses,  the  scalar 
field  components  w^  and  z  become  the  new  longitudinal  components  WL  and  ZL, 
while  H  survives  as  a  real  SU(2)-singlet  scalar  field  -  the  Higgs  boson  -  with  mass 
mfi  —  \/2At;.  To  recover  standard  Fermi  couplings  we  must  have  v  =  (V^Gf)"1/2, 
but  the  quartic  coupling  A  is  not  determined  a  priori. 

How  can  we  probe  the  EWSB  dynamics? 

(i)  Look  for  H°  ;  its  mass  measures  A;  its  couplings  to  W,Z  and  fermions  are  pre- 
scribed by  v  and  the  particle  masses,  and  should  be  checked. 
(ii)  Study  Wj~  and  ZL  scattering;  they  remember  their  origins  as  would-be  Gold- 
stone  bosons  w±  and  z,  and  their  scattering  via  V($)  will  be  strong  if  A  is  large. 
These  strategies,  motivated  here  by  the  SM  case,  extend  naturally  to  more  general 
scenarios.  We  shall  mostly  illustrate  the  possibilities  for  the  proposed  Large  Hadron 
Collider  LHC  at  CERN,  but  for  convenience  shall  also  take  some  examples  from 
studies  for  the  comparable  but  cancelled  SSC  project. 

2.  SM  Higgs  searches  at  hadron  colliders 

Higgs  hadroproduction  goes  mainly  through  the  diagrams  in  Fig.l  [1]: 

(a)  gg  —  »•  H  via  heavy  quark  loops  usually  gives  the  biggest  rate; 

fM  nn(n\  _*  H  via  WW  (r\r  7,  7,\  fusion  is  alsn  hitr   and  has  tameable  iets: 
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Figure  1.  SM  higgs  hadroproduction  subprocesses 

tag  the  leptons  from  W  — >•  tv  and  t  —+  bW  —>  btv. 

Fig. 2  shows  the  corresponding  lowest-order  LHC  cross  sections,  for  mt  =  140  GeV, 

and  Fig.3  gives  the  principal  decay  branching  fractions. 


cr(fb) 
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Figure  2.  SM  expectation  for  Higgs  hadroproduction 


2.1  Intermediate  mass  range. 

Light  Higgses  (mH  ^  Mz)  should  be  detectable  at  LEP1  or  LEP2,  but  the  inter- 
mediate mass  range  MZ  ~  m#  <  2Afz  needs  higher  energy.  The  dominant  H  — »•  66 
mode  was  widely  believed  to  be  undetectable  at  hadron  colliders,  because  of  enor- 
mous 66  backgrounds  from  QCD;  the  rf  mode  too  is  interesting  but  problematical. 
Until  recently,  most  attention  was  therefore  focussed  on  the  more  promising  signals 
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Figure  3.  SM  Higgs  branching  fractions 

Inclusive  H  —  *  77  signals  are  difficult  to  see  above  large  backgrounds  from 
99^19  —*  77  production  and  also  from  jet  +  jet  and  jet  +  7  states  where  jet  —  »•  TT° 
fakes  7.  To  extract  a  signal  requires  excellent  mass  resolution  6M77  <  1.5  GeV  and 
rejection  factor  R(jet  —  »•  fakej)  <  10~4  (see  GEM  studies  for  SSC[2]).  Fig.4  shows 
typical  predicted  M77  distributions,  before  and  after  background  subtraction.  Note 
the  very  low  signal/background  ratios  here,  requiring  very  high  statistics. 

Lepton-tagged  H  —  *  77  signals,  based  on  the  tiff  and  WH  production  chan- 
nels, are  smaller  but  much  cleaner  since  the  lepton  requirement  strongly  suppresses 
backgrounds,  so  less  stringent  mass  resolution  <5M77  <  3  GeV  and  rejection  factor 
R(jet  -*  fake  7)  <  5  x  10~4  may  suffice  (see  SDC  studies  for  SSC[3]).  Fig.5  shows 
typical  M77  signal  and  background  distributions. 

Four-lepton  signals,  based  on  H  —  >•  ZZ*  decays  with  one  Z  off-shell,  are  also 
very  clean  and  have  a  big  enough  branching  fraction  for  mjf  >  120  GeV  -  see  Fig.  3; 
the  dip  near  m/f  =  IMw  =  160  GeV  is  due  to  competition  from  the  H  —  »•  W+W~ 
channel  that  opens  here.  After  suitable^  pr  and  isolation  cuts  on  the  leptons,  the 
backgrounds  from  continuum  ZZ*  ,  Zbb,  Ztf  and  tf  production  are  small;  Fig.  6 
shows  typical  signal  and  background  curves  for  an  SSC  example[3]. 

Very  recently  it  has  been  argued  that  H  —  »  bb  dijet  signals  may  be  detectable 
after  all,  with  the  backgrounds  sufficiently  suppressed  by  suitable  tagging. 
(a)  Tag  two  leptonic  W-decays  in  the  gg  —  »•  tfH  —*  (bW+)(bW~)(bb)  channel;  this 
gives  a  2-lepton  +  4-jet  -f  missing-J^T  signature,  with  a  peak  at  m#  in  the  dijet 
invariant  mass  m(jj)  distribution  [4].  There  is  a  combinatorial  background  (from 
'wrong'  dijet  pairings)  and  a  much  bigger  background  from  ttjj  production;  a  nar- 


»-.  A 


Kilt      t  V> 


10000 


5000 


a)  H-*r7  :  BaF2 
After  Shower  Shape 


b)  Back.  Subtracted 


250 


100 


150 


150 


Figure  4.    H  -»  77  signals  for  mH  =  80,100,120,150  GeV  [2].    Plotted  on  y-axis: 
Events/0.4  GeV/SSC-year. 
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Figure  5.  Lepton-tagged  5"  -»•  77  signals  for  m#  =  80, 100,  ...160  GeV  [3] 
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Figure  6.  H  -»•  tf/'f'  signals  for  mH  =  120,  130,  ...170  GeV  [3] 


SSC  illustration  in  Fig.7  suggests  this  approach  could  possibly  work. 

(b)  Tag  3  or  4  6-jets  in  the  gg  ->  UH  -+  (bW+)(bW-)(bb)  channel,  using  a 
microvertex  detector  to  pick  out  the  displaced  vertices  from  6-decays,  and  look  for  a 
peak  in  the  26  invariant  mass  spectrum[5].  With  sufficiently  high  6-tag  efficiency  and 
purity,  all  background  channels  except  ttZ  (with  Z  —  *•  66)  and  tfbb  are  suppressed. 
Fig.  8  shows  an  SSC  illustration  with  3-6-tagging;  the  upper  curves  assume  tagging 
efficiencies  6&  =  0.4  and  cg  =  0.005  for  6-jets  and  gluoh-jets,  respectively;  the  lower 
curves  (displaced  by  100  units)  assume  values  0.3  and  0.01  instead.  Note  once 
more  the  low  signal/background  ratios,  requiring  high  statistics  in  this  game  !  For 
LHC,  taking  mn  =  100  GeV  and  mt  =  140  GeV  and  the  less  optimistic  tagging 
efficiencies,  we  need  integrated  luminosity  L  =  200  /6"1  to  get  a  signal  S  =  600 
events  over  a  background  B  =  14000  events  with  statistical  significance  S/\/B  = 
5  standard  deviations.  But  the  situation  improves  if  t  is  heavier;  with  mt  =  180 
GeV,  L  =  60  /6-1  is  enough  to  give  5  =  210,  B  =  1800,  and  S/JB  =  5  again[5]. 
(c)  Rapidity-gap  tagging  may  also  work  [6].  The  qq  —  *•  qqH  mechanism  has  no  colour 
exchange  and  is  therefore  unlikely  to  radiate  additional  soft  gluons  or  qq  pairs  in 
the  central  region,  unlike  the  other  H  production  processes  or  typical  background 
channels.  By  requiring  that  there  is  no  particle  production  (apart  from  an  H  —  »•  66 
dijet  candidate)  in  a  wide  central  interval  of  rapidity,  it  may  be  possible  to  suppress 
backgrounds  enough  to  extract  a  qq  —  >•  qqH  \H  —*•  66  dijet  signal.  But  such  rapidity 
gaps  can  get  filled  in  by  soft  interactions  of  spectator  partons  ('the  underlying 
event'),  or  faked  by  statistical  fluctuations.  Detailed  studies  are  needed  [6];  the 
intermediate-  mass  Higgs  case  has  received  little  attention  so  far[7]. 

We  see  that  one  or  other  of  the  popular  77  and  4-lepton  signals  should  be 
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Figure  7.  Doubk-lepton  tagged  H  —<•  bb  dijet  signal  and  background  for  mn  =  100 
[4].  tfjff  and  tfZ  contributions  are  shaded. 
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Figure  8.    H  -*•  W  dijet  signals  and  background  with  3  -  fc-jet  tagging  for 
80, 100, 120, 140  GeV  [5] 


For  heavy  Higgses  (rnjj  >  2Afz),  the  decay  chain  H  -»•  ZZ  —»•  4£  offers  a  very  clean 
'gold-plated'  signal  with  a  nice  peak  in  the  invariant  mass  m(££££)  and  relatively 
little  background,  mostly  from  continuum  ZZ  production.  Fig.  9  shows  cross  sec- 
tions for  this  signal  versus  m/y  ;  note  that  they  are  an  order  of  magnitude  smaller  at 
LHC  than  at  the  late  lamented  SSC,  requiring  that  much  more  luminosity.  Fig.  10 
shows  the  mass  peaks  for  ma  =  600  and  800  GeV  at  LHC  after  suitable  cuts[8]. 
We  see  that  the  mass  peak  broadens  and  becomes  harder  to  detect  as  mjj  increases; 
in  fact 


Thus  the  range 


TH  ^  (0.5TW)(mH/lTeV03. 
<  0.6  TeV  is  relatively  easy;  m#  >  0.8  TeV  is  hard. 
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Figure  9.  Cross  sections  for  the  H  —  »•  lU'l'  gold-plated  signals  at  SSC  and  LHC 


An  alternative  'silver-plated'  H  -*  ZZ  -*•  (t+t~}(vv)  signal  is  also  promising. 
It  has  a  somewhat  less  clean  signature,  namely  one  leptonic  Z  plus  missing-  ET, 
but  has  six  times  bigger  branching  fraction  than  the  gold-plated  signal  -  which 
could  be  helpful  at  large  m/j  .  The  main  backgrounds  from  Z  +  jets,  it  and  ZZ 
continuum  production  can  be  suppressed  by  suitable  cuts,  especially  by  requiring 
large  missing-  ET,  since  the  H  —  »  ZZ  decay  gives  a  Jacobian  peak  near  pr(Z)  = 
~  \ynijj  —  4M%  that  translates  into  a  peak  in  missing-  ET',  see  Fig.ll.  The  mass  mjf 
can  be  estimated  from  the  missing-  ET  distribution,  but  a  cleaner  estimate  (from 
a  narrower  peak)  can  be  made  using  the  'transverse  mass'  variable  mr^,  ^^)[9] 
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where  pr(yv)  =  #T-nrissing.  Both  pr(£fy  and  pr(^^)  have  Jacobian  peaks  that  are 
smeared  by  the  transverse  motion  of  H,  but  this  smearing  approximately  cancels 
in  the  combination  my. 
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Figure  11.  Silver-plated  H  -»  ZZ 
ET  plot  (an  SSC  example  [3]). 


llvv  signals  appearing  as- shoulders  in  the  missing- 


The  H  — »•  W+ W~  — »•  (tv}((.'v'}  signal  has  the  biggest  branching  fraction  of  the 
all-leptonic  modes;  the  signature  is  two  isolated  large-pr  leptons  plus  missing- ET-> 
but  there  is  a  horrendous  background  from  tt  — *  bW+bW~.  A  strategy  of  jet- 
tagging  and  anti-tagging  (vetoing)  has  been  proposed  to  rescue  this  signal.  Note 
first  that  the  important  qq  -+  qqH  production  subprocess  (see  Figs.l  and  2)  contains 
two  jets  with  typically  large  pseudo-rapidity  |tj|  ~  3  -  5,  but  no  central  jets  when 
fj  _>  tvKv1.  In  contrast,  the  QCD  background  process  gg,qq  -*  tt  -*  bW+bW~ 
contains  typically  no  forward  jets  but  has  two  central  6-jets.  Hence,  by  requiring 
at  least  one  forward  jet  with  3  <  \rj\  <  5  and  no  central  jet  with  |iyj  <  3,  the 
background  can  be  suppressed  relative  to  the  signal.  [Jet  tagging/vetoing  can  help 
to  reduce  backgrounds  to  the  previous  H  — *  ZZ  signals  also].  It  has  been  shown 
that  the  H  -+  WW  signal  can  be  separated  by  these  and  other  suitable  cuts,  at  least 
for  0.6  £  m#  ~  1  TeV[10j.  The  mass  mn  can  then  be  estimated  from  the  dilepton 
invariant  mass  m(tt' )  or  another  transverse  mass  m(tt' ,  vv'}  defined  by[ll] 
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because  we  no  longer  nave  the  constraints  m(tt)  —  m(yv)  —  MZ  and  indeed  m(yv  ) 
cannot  be  measured.  These  variables  peak  at  m(t?)  ~  ^m/f  and  mf(f.(!  \w'}  ~ 
f  m/f  ,  respectively. 

Combining  the  coverage  of  H  -*•  ZZ  ->  U,  H  -+  ZZ  -»  2^2i/  and  H  ->  WW  -+ 
signals,  we  see  that  heavy  Higgses  will  be  detectable  for  m#  %  1  TeV. 


2.3.  Studying  biggs  couplings  separately. 

In  principle,  the  strengths  of  H  couplings  to  WW,  ZZ,  it,  bb  can  be  separated  in 
various  ways;  examples  follow.  Note  that  the  signal  rate  in  each  particular  channel 
has  the  general  form 

<r(pp  -+  H  —+  channel}  —  <r(pp  —*  H)  x.  T(H  —  >•  channel)/Ta 

(a)  The  dominant  untagged  or  iepton-tagged  77  signals  depend  directly  on  the  Hit 
coupling,  both  at  the  H  production  vertex  (<-quark  loop  or  associated  production 
with  if)  and  at  the  H  decay  (via_<-quark  loop),  but  the  total  decay  width  in  the 
denominator  is  dominated  by  Hbb  at  the  relevant  low-intermediate  masses.  Thus 
the  rate  determines  (Hit}4  /(Hbb)2  approximately. 

(b)  If  the  WH  and  ttH  Iepton-tagged  77  signals  can  be  separated  via  final-state 
criteria,  their  ratio  will  determine 

(WH  -+  77  rate)  /(tiH  -+  77  rate)  ~  (HWW)2  /(Hit)2 

(c)  The  ttH  -*  tt(bb)  tagged  dijet  signals  d_epend  on  the  H  if  coupling  for  H  pro- 
duction, with  branching  fraction  B(H  —  »•  66)  ~  1  for  decay.  Thus  the  signal  rate 
determines  (Hit)2  approximately,  and  the  ratio  to  77  rates  determines 

(tagged  bb  dijet  rate)/(yj  rate)  ~  (Hbb)2  /  (Hit)2. 

(d)  For  high-intermediate  masses  m#  >  120  GeV,  the  total  width  Tjf  depends 
non-  trivially  on  Hbb,  HWW  and  HZZ  couplings,  so  nothing  simple  can  be  said 
about  the  H  —  *  4i  signals  here. 

(e)  For  heavy  Higgses,  the  ratio  of  WW/ZZ  signals  could  determine  the  ratio  of 
(HWW)f(HZZ)  couplings. 

(f)  The  ratio  of  jet-tagged/untagged  heavy  Higgs  signals  could  determine  the  ratio 
of  qq  —  *•  qqH  and  gg  —  *  H  production  mechanisms,  that  depends  directly  on  the 
(HVV)/(Ht?)  coupling  ratio,  where  V  =  W,  Z  but  W  dominates. 

(g)  TH  for  a  heavy  Higgs,  directly  measurable  from  the  width  of  the  mass  peak, 
should  be  dominated  by  the  HWW  coupling  with  smaller  contributions  from  H  ZZ 
and  Hit. 

3.  WW,  WZ  and  ZZ  scattering 

In  the  SM,  the  coupling  strength  of  the  scalar  potential  is 


A  =          =  r        H     I2 
2u2       L348GeVrJ 
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which  is  large  if  H  is  heavy,  implying  strong  scattering  of  the  Goldstone  bosons  w± 
and  z  that  become  Wj-f  and  ZL-  This  can  also  happen  more  generally,  outside  the 
SM.  Figure  12  shows  a  generic  WL,  ZL  scattering  process  at  LHC.  Different  final 
states  are  sensitive  to  different  physics:  W^W^  and  Z[,ZL  states  probe  Higgs-like 
resonances;  W^W^  and  W£Wj~  states  probe  non-resonant  scattering  (except  in 
some  exotic  models);  WI,ZL  states  probe  techni-/>-type  resonances. 


Figure  12.  Longitudinal  weak  boson  scattering  at  a  pp  collider 

The  idea  is  that  enhanced  WW,  WZ,  ZZ  production  may  be  attributed  to 
strong  WL  and  ZL  scattering,  since  the  transversely  polarized  states  WT  and  ZT 
only  have  the  weak  5(7(2)  x  t/(l)  gauge-  interactions.  Details  of  the  enhancement 
will  shed  light  on  EWSB.  To  discuss  this  scattering,  we  must  first  introduce  a 
number  of  ideas,  approximations  and  constraints. 

3.1  The  effective  W  approximation  (EWA). 

In  analogy  with  the  Weizsacker-  Williams  approximation  for  photons,  the  incident 
quark  beams  can  be  replaced  by  beams  of  approximately  on-shell  W  and  Z  bosons. 
Writing  P^,(x)  as  the  known  probability  distribution  (splitting  function)  for  find- 
ing a  W  with  polarization  a  and  momentum  fraction  x  in  quark  q,  the  subprocess 
cross  section  for  q^q-2  —*•  qaq^W^W^  is 


The  physical  cross  section  for  pp  —  »•  q^q^W^W^X  is  found  by  further  integrating 
over  incident  quark  distributions  and  summing  over  final  W  spins.  The  splitting 
functions  for  WL  and  ZL  are 

pL  _  pL  _   pL          _   pL  _ 

+      -  ^-      -  -  -TV-  /a  - 


L    _L 


where  xw  —  sin2  Ow  —  0.23,  Q  is  the  electric  charge  of  quark  q  and  g2  = 

is  the  SU(2)  gauge  coupling  squared.  The  EWA  is  the  only  known  way  to  separate 

the  contributions  of  WL  and  ZL  scattering  in  a  high  energy  process.  The  approxi- 


3.2  The  Goldstone  boson  equivalence  theorem  (GBET). 

This  theorem  [12]  relates  any  scattering  amplitude  for  real  WL  and  ZL  to  the  am- 
plitude for  the  corresponding  Goldstone  bosons  w  and  z: 

M[WL(Pl),  Wjtfo),  ...]  -  M[w(Pl),  w(p2),  ...}R  +  0(Mw/Ew), 

where  R  indicates  a  renormalizable  R%  gauge.  The  p,-  are  momenta  (masses  are 
neglected  here).  The  EWA  provides  essentially  real  incoming  WL  and  ZL\  *-ne 
GBET  allows  us  to  compute  their  scattering  from  any  given  model  of  the  Goldstone 
boson  sector,  up  to  corrections  of  order 


3.3  Isospin,  crossing  and  unitarity. 

The  various  ww,  wz  and  zz  scattering  amplitudes  are  all  related  by  SU(2)  isospin 
and  crossing  symmetries,  just  like  TTTT  scattering.  In  terms  of  components  it;,-  (i  — 
1,  2,  3;  w±  =  (wi  :p  ti;2)/\/2,  z  =  u>3),  the  general  amplitude  has  the  form 

M(waWb  —  »•  wcWd)  =  A(s,  t,  u}6at,8cd  +  A(t  ,  s,  u)8ac^bd  +  A(u,  t,  s)8ad8bc, 

where  s  =  (pa  -f  p6)2,  t  —  (pa  —  pc)2  ,  u  =  (pa  -  pd)2  and  A  is  symmetrical  in  its 
last  two  arguments:  A(s,t,u)  =  A(s,u,t).  In  terms  of  this  single  basic  amplitude 
A,  the  various  channel  amplitudes  are 

M(w+w~  —  >•  w+w~)  =  A(s,t,  u)  +  A(t,s,  u)., 

M(w+w~  —  *  zz}  =  A(s,t,u), 

M(zz-^zz)  -  J4(«,<,ti)  +  A(<,s,«)  +  A(«,<,*)> 

M(w±z  —  )•  w±z)  =  A(t,s,u), 

Af(w+w+  -+w+w+)  =  A(<>sltt)  +  A(u,f,s). 

The  amplitudes  for  s-channel  isospin  T  =  0,  1,  2  are 


Af(T=l)     = 


These  pure-isospin  channels  diagonalize  the  S-matrix,  so  their  s-channel  partial 
wave  amplitudes  a^  defined  by 


M(T)  =  327r(2L  +  1)  aj  PL  (cos  9) 

L 

should  satisfy  the  unitarity  condition  |2a£  —  i\  <  1,  i.e.  lie  within  a  circle  in  the 
complex  plane. 

3.4  Low  energy  theorem  (LET). 

Ifi  the  low-energy  limit,  Goldstone  boson  scattering  is  expected  to  be  controlled  by 
the  dynamics  of  EWSB.  Since  EWSB  in  the  SM  is  essentially  the  non-linear  sigma 
model  with  chiral  5f/(2)  x  S£7(2)  symmetry,  it  has  been  pointed  out[13]  that  the 
low-energy  theorems  of  TTTT  scattering[14]  can  be  directly  applied  to  the  case  of  w 
and  z  scattering,  simply  replacing  the  pion  decay  constant  Fr  —  94  MeV  by  the 

t)Af\  T»  ______         T       T*l  ____         C"  ____  1  ____  j.    T  _____        t  nnf 


A(S,l,  U)  ~  S/V    . 

Hence  the  low-energy  partial  wave  amplitudes  are 

a°  =       s        a1  =       s        a2 

This  linear  rise  with  s  cannot  be  extrapolated  indefinitely  without  violating  uni- 
tarity. Strictly  speaking,  since  these  formulae  give  purely  real  cr[,  any  non-zero 
value  lies  outside  the  unitarity  circle,  but  this  could  at  first  be  cured  by  adding 
a  small  imaginary  part.  However,  the  real  part  of  OQ  exceeds  its  unitarity  limit 
SftetjT  <  7}  when  v/s  >  V&nv  =  1.2  TeV,  while  the  T=2  amplitude  exceeds  this  limit 
for  A/S  >  1.7  TeV.  We  see  that  unitarity  is  likely  to  be  an  important  constraint  in 
the  TeV  range. 

3.5  Models  for  Goldstone  boson  scattering. 

A  wide  variety  of  models  have  been  proposed,  to  illustrate  and  parameterize  the 
scattering  of  w±  and  z.    We  confine  ourselves  here  to  the  models  evaluated  in  a 
recent  survey  by  Bagger  et  al[15]  of  possible  strong  WW,  WZ,  ZZ  scattering  at 
hadron  colliders, 
(i)  Standard  Model. 
This  gives  the  form 

_    2  m2 

A(s,t,u)  =  —^     (1  +  -         2    i  -•— F:    fl77\)> 
v  s  —  nifr  -f 


where  m/y  is  the  Higgs  mass,  v  =  246  GeV  is  the  usual  vev,  and  a  Breit-Wigner 

form  has  been  imposed  for  the  Higgs  resonance.  Q  is  the  step-function:  0(s)  =  1(0) 

for  s  >  Q(s  <  0).    A(t,s,u)  and  A(u,t,s)  are  obtained  by  permuting  s,t,u.   For 

illustration  we  shall  take  m/f  =  1  TeV[15]. 

(ii)  O(2N)  Model  [16]. 

Motivated  by  the  isomorphism  between  SU(2)L  *  SU(2}n  and  0(4)  symmetries, 

and  using  a  large-N  approximation,  this  model  gives 


ln(A2/k|)  +  i*0(8)]  ' 

where  Q(s)  is  the  step-  function  as  before.  Following  Ref.[15],  we  take  the  case  TV  =  2 

with  physical  cut-off  A  =  3  TeV. 

(iii)  Chirally-Coupled  Scalar  Model[15]. 

This  model  describes  the  low-energy  behaviour  of  a  technicolor-type  model  with  a 

techni-sigma  scalar  resonance,  through  the  amplitude 


where  MS  and  FS  are  the  scalar  resonance  mass  and  its  decay  width  into  Goldstone 
fields,  Ts  =  3<72M|/(327rv2).  We  choose  Ms  =  1.0  TeV  and  Fs  =  0.35  TeV, 
following  Ref.  [15]. 
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inis  model  describes  the  low-energy  behaviour  of  a  technicolor-  type  model  witn  a 
techni-rho  vector  resonance,  through  the  amplitude 

.         s  aM    .  u  —  s  t  —  s  , 


v 


where  My  and  TV  are  the  vector  resonance  mass  and  width  while  a  = 

We  choose  the  two  cases  Mv  =  2.0  TeV  with  Tv  =0.7  TeV  and  Mv  -  2.5  TeV 

with  IV  =  1.3  TeV,  following  Ref.[15]. 

(v)  LET-CG  Model[13] 

This  model  simply  extrapolates  the  low-energy  amplitudes,  prescribed  by  low- 
energy  theorems  (LET)  in  terms  of  the  vev  v: 

A(s,t,u)  =  s/v2. 

These  amplitudes  eventually  violate  unitarity,  as  we  saw  above,  so  the  partial  wave 

amplitudes  are  held  constant  after  they  reach  the  bound  |a£|  <  1  (less  stringent 

than  the  bound  SfajT  <  ^  we  mentioned  earlier). 

(vi)  LET-K  Model[18]. 

Here  the  LET  amplitudes  above  are  unitarized  instead  by  the  K-matrix  prescription, 


that  enforces  the  elastic  unitarity  condition  |2a£  —  i\  =  1. 

(vii)  Delay-K  Model  [15]. 

This  is  an  example  of  non-resonant  models,  based  on  an  effective  Lagrangian 

method[19].  To  order  p4  in  the  momentum  expansion,  all  model  amplitudes  have 

the  form 


where  fi  is  the  renormalization  scale  and  £,-(/i)  are  renormalized  coefficients  in  the 
effective  Lagrangian.  The  parameters  £<i(/-0,  ^(A*)  and  p  essentially  cover  all  pos- 
sible non-resonant  physics  in  this  approximation.  Following  Ref.  [15]  we  choose  the 
example  LI(^)  =  -0.26,  L^n]  —  0.23  with  fi  -  1.5  TeV,  for  which  unitarity  break- 
down is  delayed  until  ^fs  —  2  TeV.  These  amplitudes  are  complex;  for  simplicity, 
unitarity  is  imposed  by  the  K-matrix  prescription  based  on  the  real  part  of  ot£,  i.e. 


3.6  Model  results 

At  a  hadron  collider  we  can  only  hope  to  recognize  W  —>•  tv  and  Z  —*•  it  lep- 
tonic  final  states  in  the  face  of  QCD  multijet  backgrounds.  Even  so,  there  re- 
main backgrounds  from  electroweak  W  and  Z  production,  from  Drell-Yan  lepton 
pairs  and  from  the  semileptonic  decays  of  heavy  quarks.  Selection  cuts  to  enhance 
signal/background  include  lepton  isolation,  high  lepton  pr,  central  leptons  with 
\rj(l)\  £  2  ,  high  invariant  mass  m(it'}  for  WW  and  m(ZZ)  for  ZZ  ;  in  WW  cases 
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it  also  helps  to  require  the  two  leptons  to  be  back-to-back  in  azimuth  and  have 
high  relative  momentum.  Finally,  it  can  be  helpful  to  tag  one  of  the  spectator  jets 
and/or  veto  central  jets  with  \ij(j)\  <  3  (as  in  Section  2.2  for  heavy  Higgs  searches). 
The  following  table  shows  results  from  Ref.[15]  for  the  LHC,  with  typical  realistic 
cuts.  The  signals  here  are  the  total  production  of  WL^L  —  *  ^vtv  (etc);  the  back- 
ground column  (Bkgd)  includes  SM  production  of  Wj^V/Tt  WxWi  (etc)  calculated 
using  small  m//. 


Table  1.  Events  for  luminosity  100/6"1,  assuming  mt  =  140  GeV,  v^=  16  TeV. 


Case 
ZZ 

w+w~ 
w+z 

W+W+ 

Case 
ZZ 

w+w~ 
w+z 

W+W+ 

Vec2.Q 
0.4 
7.4 
3.3 

4.8 

Bkgd 
0.1 
15 
0.3 
1.7 

Kec2 
0.6 
6.1 
1.8 
7.3 

SM     Scalar 
3.9        2.7 
32          21 
0.3        0.4 
3.7        5.2 

.5     LETCG 
1.1 
8.3 
1.6 
16 

O(2N 
1.8 
16 
0.3 
4.3 

LETK 
0.9 
6.3 
1.4 

14 

) 

DelayK 
0.6 
5.5 
1.7 
8.3 

We  see  that  every  model  predicts  a  signal  in  at  least  one  channel,  although 
more  luminosity  would  be  desirable,  and  that  the  different  channels  are  helpfully 
complementary.  There  seem  to  be  good  prospects  for  probing  EWSB  via  weak 
boson  scattering. 

Fig.  13  illustrates  some  of  the  more  promising  signals  versus  invariant  mass 
m(ZZ)  for  ZZ,  m(ti'}  for  WW  and  a  suitable  tttv  transverse  mass  variable  mT  for 
WZ  cases  [15].  These  illustrations  are  for  the  defunct  SSC,  but  something  similar 
should  hold  for  LHC. 


4.  SUSY  higgs  possibilities  at  LEP  and  LHC 

The  Higgs  sector  is  the  shakiest  part  of  the  SM,  where  something  extra  may  well 
be  added.  Let  us  consider  the  new  degrees  of  freedom  coming  from  some  possible 
extensions[l]. 

(a)  SM  has  one  complex  scalar  doublet  $  with  vev  \/2  <  (j>°  >  =  t;  =  246  GeV. 
Its  4  degrees  of  freedom  show  up  as  W£,  W£~ ,  ZL  and  H°.   There  is  just  1  free 
parameter  m//  in  the  scalar  sector. 

(b)  MSSM  (minimal  SUSY  SM)  has  two  scalar  doublets  with  v/2  <  $  >=  v  cos  /? 
and  \/2  <  ^  >=  vsin(3.    Their  8  degrees  of  freedom  show  up  as  W£ ,  W£",  ZL 
plus  five  spin-0  Higgs  bosons  h° ,  H°,  A°,  H+ ,  H~ .    h°  and  H°  have  CP  =  +1 
(by  convention  mh  <  m//)  and  A  has  CP  =  -I.  There  are  now  2  free  parameters, 
often  taken  to  be  rn.*  and  tan  Q  = 


100 


100 


0.01 


2000 


Figure  13.  Typical  weak  boson  scattering  signals  (unshaded)  and  backgrounds  (shaded) 
[15].  Plotted  on  y-axis:  Events/50  GeV/SSC-year. 
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n  with  vev  <  n  >  =  x  non-zero  but  free.  The  10  degrees  of  freedom  show  up 
as  W£,  Wl,  ZL  plus  seven  spin-0  Higgs  bosons  h°,  tff,  #J,  A\,  A$,  H+  ,  H~ 
(where  A\  and  A?  have  CP  —  —  1).  The  number  of  free  parameters  now  jumps  to 
6,  however,  because  there  are  suddenly  many  more  possible  terms  in  the  general 
scalar  potential.  This  model  has  a  richer  spectrum  and  also  looser  limits. 
For  today  we  shall  concentrate  on  the  more  constrained  example  of  MSSM. 

4.1  MSSM  at  tree  level. 

All  MSSM  Higgs  masses  and  couplings,  plus  a  mixing  angle  «  that  relates  h  and  H 
to  Sft^i  and  Sfofo,  are  determined  at  tree  level  by  m^  and  tan  ft: 


tan  la     =     tan  2/?  (m2A  +  A/|)/(m^  -  M|) 


=        [ml  +  M  \  T       m    +  ^2  -         M    cos' 


with  —  £  <  a  <  0.  Hence  at  tree  level 

£,        "•"•  —— 


Thus  there  is  always  at  least  one  Higgs  boson  h  lighter  than  Z.  However,  in  the 
limit  mji  >  m^,  all  the  other  bosons  H,  A  and  H±  become  very  heavy  and 
approximately  degenerate  while  mh  ~  MZ  cos  2/?,'so  it  is  conceivable  that  only  this 
one  light  boson  may  be  accessible. 

4.2  MSSM  with  radiative  corrections. 


/      N 

,1  V 

/ 


Figure  14.  Typical  t  and  t  one-loop  graphs. 

MSSM  Higgs  physics  does  not  depend  only  on  the  scalar-sector  parameters  mA  and 
tan  0,  but  also  has  important  loop  corrections  because  the  top  quark  is  so  heavy 
(Fig.14).  These  radiative  corrections  depend  on  mt,  m(-  (the  mean  <-squark  mass), 
mQ,  mu  and  mD  (soft  SUSY-breaking  squark  masses),  At  and  Ab  (coefficients  of 
trilinear  iLiRfa  and  IL^R^I  soft  SUSY-breaking  terms),  and  n  (coefficient  of  fafo 
mixing  in  the  superpotential);  see  e.g.  Ref.[20j.  This  dependence  is  mainly  on  mt 
and  (logarithmically)  on  mf,  neglecting  the  other  parameters,  the  upper  limit  on 
m/i  becomes 

M|  +  2   ln(mf/m<), 


^  mZ 

lw 


in  one-loop  approximation. 

In  the  following  we  shall  estimate  these  radiative  corrections  by  assuming  that 
m(  ~  150  GeV,  mt-  ~  1  TeV,  and  that  squark  mixing  is  moderate  (explicitly 
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mQ  =  mu  -  mD  =  1At  -  lAi  =  4/i  =  1  TeV,  following  Ref.[21j).  With  this 
choice,  the  m&  upper  limit  is  m^  ^S  115  GeV;  the  full  dependence  of  mj,,  mjj  and 
m,H±  on  the  remaining  parameters  m^  and  tan/?  is  shown  in  Fig.l5[21].  Searches 
for  h  at  LEP1  (via  e+e~  -+  Z/»,  Ah)  have  already  covered  the  region  m/,  £  50  GeV, 
depending  somewhat  on  tan/?;  the  higher  energy  v^  ~  170  —  190  GeV  at  LEP2 
will  arguably  allow  searches  up  to  m/,  ~  90  GeV;  but  Fig.  15  shows  there  is  a  large 
region  of  parameter  space  (m^  £  90  GeV,  tan/?  £  2)  where  h  and  all  the  heavier 
Higgses  could  be  hiding  undetected.  This  is  LHC  territory. 
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Figure  15.  Behaviour  of  h,  H  and  H  *  masses  in  the 
corrections  [21]. 


,  tan  ft]  plane  including  radiative 


Compared  to  the  SM  Higgs  coupling  to  fermions  and  gauge  bosons,  given  by 
L  =  -(mi/v)fifiH  +  ±g2vWWH  +  ±(02  +  g'^vZZH,  the  MSSM  neutral  Higgs 
couplings  have  the  following  extra  factors: 


h  H  A 

it  cos  a  I  sin/?  sin  a/  sin/?     —175  cot  0 

>,  rf        —sin  a  I  cos/?  cos  a/ cos/?     —175  tan/? 

££       sin(/?  —  or)  cos(/?  —  cr) 

cos(/?  —  a)  sin(/?  —  a) 


We  see  that  h  and  #  share  their  coupling  strengths,  that  A  does  not  couple  directly 
to  WW,  ZZ  or  ZA,  and  that  for  large  tan/?  (small  cos/?)  all  the  66  couplings  are 


1  ne  charged  scalars  n  -1-  couple  to  typical  iermions  by 
L  =        ^M    H+[mtVt*  cot/3  t(l  -  J5)b  +  mbVtb  tan/?  f(l  +  78)* 
+meVe,  cot  /?  c(l  -  7s)s  +  mr  tan  /?  t/(l  -f  7s  )r], 


but  have  no  tree-level  coupling  to 

These  new  coupling  factors  and  new  kinds  of  coupling  show  that  the  production 
and  decay  of  MSSM  Higgses  are  not  just  trivial  extensions  of  the  SM  situation,  in 
general.  There  is  however  one  particularly  simple  region,  where  m^  becomes  large 
(  £  300  GeV);  here  A,  H  and  /f*  become  approximately  degenerate,  a  approaches 
ft  —  7T/2,  and  all  h  couplings  approach  SM  values.  In  this  region,  h  fakes  HSM 
while  the  heavier  Higgses  are  largely  irrelevant;  we  essentially  recover  the  SM  but 
with  the  extra  constraint  m^  ~  130  GeV  (from  MSSM  with  radiative  corrections), 
whereas  the  SM  alone  has  a  much  weaker  bound  mjfSM  ~  600  —  1000  GeV  from 
triviality  and  perturbative-unitarity  considerations. 

4.3  Parameter  constraints. 

What  are  the  constraints  on  the  MSSM  parameters  tan  ft  and  VYIA  ? 

(a)  Requiring  the  charged-Higgs  couplings  to  tb  above  to  remain  perturbative  gives 
mt/500GeV^  tan/??$500GeV7m&  [22];  the  precise  limits  are  somewhat  subjective. 
Essentially,  values  tan/?  «  1  and  tan/?  »  30  are  disfavoured. 

(b)  Proton  decay  sets  limits  in  SUSY-GUT  models,  typically  tan/?  <  85  [23]. 

(c)  Since  up-type  (down-  type)  quarks  get  their  masses  via  the  vev  v2  (vi),  the 
empirical  mass  hierarchy  rac  »  m,  and  m<  »  mj  suggests  a  corresponding  hi- 
erarchy of  vevs  v-2  »  vi,  i.e.  tan/9  »  1[24]. 

(d)  SUSY-GUT  models  generally  find  solutions  with  1  ^  tan/5  ^  mt/m&.  For  ex- 
ample, Ref.[25]  obtains  solutions  with  2  <  tan/?  <  40  while  Ref.[26]  finds  solutions 
in  the  range  0.6  <  tan/?  <  60. 

(e)  SUSY-GUT  models  where  electroweak  symmetry  is  broken  radiatively  (a  Higgs 
mass-square  goes  negative  during  evolution  downward  from  the  GUT  scale)  require 
tan£  >  1  (see  e.g.  Ref.[25]). 

The  consensus  on  tan  /?  is  therefore 

0£  tan/?<60. 

What  now  about  the  other  parameter,  m^  ? 

(f)  Higgs  boson  searches  at  LEP1  give  20  GeV  ^  m^  (assuming  tan  /?  >  1). 

(g)  In  SUSY-GUT  models,  mA  <  MSUSY  S  1  -  2  TeV. 

(h)  Global  electroweak  analysis  in  the  MSSM  finds  a  preferred  value  for  m^  below 
100  GeV  or  so,  but  much  larger  values  are  not  excluded  [27]. 
The  consensus  on  m^  is  therefore 

20  GeV  <  mA  <  1  -  2  TeV. 

Various  other  indirect  constraints  should  also  be  mentioned. 

(i)  LEP1  searches  for  e+e~  —  »•  H+H~  exclude  a  parameter  region  with  tan  ft  <  0 

[28]. 

(j)  Virtual  /f±  can  contribute  to  B  —  »  rvX,  but  essentially  not  to  B  —  »•  evX, 

T>»nmnno_     T       T>Viira        Si  i  Tvnl  Am  on  tr   TaaiiA      1  QQK 


limit  on  this  contribution  and  hence  a  bound  tan/9  <  0.54(m/r/l GeV)  [2yJ.  Inis 
is  not  very  stringent,  because  the  virtual  H*  and  standard  W^  contributions  add 
incoherently,  i.e.  a  10%  amplitude  gives  only  a  1%  correction, 
(k)  Virtual  H~t  loop  contributions  to  b  —>•  57  decays  apparently  offer  a  much 
more  stringent  constraint;  they  add  coherently  to  the  SM  W~t  loop  amplitudes. 
Neglecting  possible  additional  SUSY  loops,  the  present  CLEO  upper  bound  on 
B(B  —»•  yX)  ~  B(b  -+  s*y)  gives  rather  strong  lower  bounds  on  the  charged-Higgs 
mass  and  hence  on  regions  of  the  (m^tan/?)  plane  [30].  However,  in  the  present 
MSSM  context  we  cannot  ignore  SUSY  loops,  especially  chargino-stop  loops,  that 
can  interfere  strongly  and  destructively  with  the  SM  amplitude[31].  To  apply  this 
constraint  we  need  more  information  about  the  sparticle  spectrum,  e.g.  a  complete 
SUSY-GUT  model  with  all  masses  and  mixings  prescribed. 

4.4  MSSM  Higgs  production  and  decay  at  LHC. 

Production  and  decay  of  neutral  Higgses  can  proceed  by  the  SM  processes,  modulo 
the  additional  factors  discussed  above.  However,  there  are  new  non-SM  decay 
modes  like  h  — +  AA,  H  — *  AZ,  H  -*•  hh,  that  tend  to  dominate  when  kinematically 
allowed,  blotting  out  the  familiar  SM  signals  in  various  regions  of  parameter  space; 
see  Fig.  16.  Note  that  H  — *  hh  is  allowed  almost  everywhere  (except  in  the  shaded 
region  and  along  a  dashed  line  where  the  H  hh  coupling  vanishes),  making  H  usually 
hard  to  detect  at  LHC. 
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Figure  16.  Regions  wherenon-SM  neutral  Higgs  decays  compete  [21]. 
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>  mt  mere  can  sun  DC  signals  irom  gg  — >•  ton  production  with  tt~  -+  bt 
decay,  leading  to  a  bbbbW+W~  final  state;  multiple  6-tagging  is  essential  here[33j. 
There  are  regions  of  SUSY  parameter  space  where  SUSY  decay  modes  like 
h  —*  XiXi  to  lightest  neutralino  pairs  are  allowed;  this  is  potentially  dangerous 
since  x°  is  usually  the  lightest  sparticle  (LSP),  stable,  weakly  interacting  and  es- 
sentially invisible.  However,  after  LEP1  constraints  there  remains  only  a  very  small 
parameter  range  where  this  decay  dominates;  unless  we  are  very  unlucky,  this  may 
dilute  the  standard  signals  a  bit  but  will  not  wash  them  out.  Other  decays  like 
A  -+  xlxl  -+  (t+t~X°i)(t+t~X°i)  might  provide  interesting  new  signals[34]. 

4.5  Expected  MSSM  Higgs  signals. 

For  neutral  Higgses,  the  strategy  has  been  first  to  think  about  SM-Sike  signals  in 
conventional  77  and  4-lepton  channels,  and  also  in  66  and  rr  channels;  h,  H  and  A 
can  all  in  principle  give  such  signals.  For  charged  Higgses,  the  emphasis  has  been 
mostly  on  t  — >  bH+  — »•  brv  signals. 

Four  groups  have  made  extensive  calculations  for  LHC  and  SSC,  covering  the 
range  mA  >  20  GeV  and  0  <  tan/?  <  50  [21,  35,  36,  37].  All  get  broadly  similar 
results,  including  radiative  corrections  but  mostly  ignoring  possible  SUSY  decay 
modes,  and  estimate  in  what  regions  of  parameter  space  the  various  MSSM  Higgs 
signals  would  be  detectable.  Typical  results  from  Ref.[21]  for  mt  =  150  GeV  are 
shown  in  Fig.  17.  Notice  that  H  and  A  have  only  small  detectability  regions  for 
these  standard  signals,  and  even  the  h  regions  are  restricted,  although  the  MSSM 
— *  SM  limiting  region  (mA  —+  large)  is  covered  as  we  would  expect.  In  fact,  the 
conventional  h,H,A  — *  77,  ZZ* ,  ZZ  and  t  -*  H+b  searches  do  not  cover  the 
whole  (rnA,tan/?)  parameter  plane.  Even  when  we  include  LEP1  and  LEP2  search 
potential,  there  is  an  intermediate  region,  approximately  110  <  mA  <  160  GeV 
and  tan/?  >  3  (depending  on  your  choice  of  m<  and  authors),  where  none  of  the 
usual  signals  would  be  detectable. 

There  have  been  various  responses  to  this  inaccessible  area  (IA). 

(a)  Additional  signals  from  H  — >•  rr  and  A  — *  rr  could  possibly  cover  the  large- 
tan/?  part  of  the  I A  [37],  but  identifying  and  measuring  tau  pairs  is  very  difficult. 

(b)  Additional  signals  from  A  — »  Zh  — »  iirr  would  not  reduce  the  IA;  neither 
would  new  A  — »•  ^2X2  ~~*  ^^-r-missing-pj'  signals[34]. 

(c)  SUSY-GUT  models  have  attractive  solutions  where  the  top  Yukawa  coupling  At 
approaches  a  fixed  point  (see  e.g.  Ref.[26])  and  gives  a  constraint  m<  ~  (200  GeV} 
sin/?.    With  mt  ~  150  —  170  GeV,  this  defines  a  small-tan/?  region  outside  the 
IA,  and  incidentally  predicts  that  h  is  light  enough  to  be  discovered  at  LEP2  [38]. 
However,  even  in  this  optimistic  scenario,  not  all  5  Higgses  can  be  simultaneously 
detectable. 

(d)  With  efficient  multiple-6-tagging,  the  signals  from  tih(H,A)  — >•  6666 WW  or 
bbh(H,A)  — >•  6666  could  be  visible  at  LHC.  For  tan/?^5,  where  the  Higgs  couplings 
to  6  are  enhanced,  Ref.[39]  argues  that  it  will  be  possible  to  detect  either  A  and  /i, 
or  A  and  J7,  and  essentially  remove  the  IA.  Fig.  18  shows  discovery  regions  at  the 
two-sigma  level  for  luminosity  100  fb~1   =  one  LHC  year  (i.e.  the  two-sigma  level 
after  four  years)  with  mt  =  150  GeV  and  mt-  =  1  TeV;  the  boundary  curves  marked 
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x  (GeV) 


Figure  17.  (a)  Combined  Potential  discovery  regions  for  at  least  one  MSSM  Higgs  boson 
at  SSC  are  shown  as  the  unshaded  area,  (b)  Expanded  view  of  the  region  of  no  coverage 
shown  as  the  shaded  area.  Contours  of  m*  (solid),  m#  (dashed)  and  mH+  (dotted)  are 
superimposed. 
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tiiggs  Searches  ana  WW  scattering 

h,ij,k,l,m  correspond  to  UH,  tth,  it  A,  bbH,  bbh,  bbA  signals  respectively,  and 
the  label  lies  on  the  detectable  side[39]. 
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Figure  18.  Expected  LHC  discovery  regions  for  tth(H,A)  and  bbh(ff,A)  signals  [38]. 

(e)  Alternatively,  e+e~  colliders  with  energy  above  LEP2  (where  ^/s  ~  170  — 
190  GeV  at  most  is  planned)  would  be  able  to  find  at  least  one  of  the  neutral 
Higgses  via  e+e~~  — >•  hZ,hA,HZ,HA  searches,  since  there  is  an  upper  bound  on 
rrih-  Fig. 19  shows  the  necessary  conditions  on  ^/s  and  luminosity  C  to  cover  the 
whole  (mA ,  tan  (3)  half-plane  with  tan  /?  >  1 ,  using  Zh  — *  rrjj,  ttjj,  vvjj  and  jjjj 
final  states  with  b-tagging  [40].  The  condition  depends  on  mt  because  of  radiative 
corrections  (mf  =  1  TeV  is  assumed  here). 

4.6  Summary. 

(a)  The  five  MSSM  Higgses  have  a  richer  spectrum  than  the  SM,  but  are  in  some 
ways  more  elusive. 

(b)  The  lightest  scalar  mass  is  bounded  much  more  stringently  than  in  the  SM. 

(c)  There  will  be  evidence  for  physics  beyond  the  SM  if  we  find  any  neutral  scalar 
with  non-SM  coupling,  or  more  than  one  neutral  scalar,  or  a  charged  scalar. 

(d)  The  principal  Higgs-sector  parameters  are  niA  and  tan  (3,  with  radiative  correc- 
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Figure  19.  Conditions  for  covering  the  (m^,  tan  /?)  parameter  space  with  an  e+e     collider 
[39]. 


ana  nence  mggs  signals  -  vary  strongly  across  me  ^m^ ,  tan  p )  plane. 

(e)  LEP1  plus  LEP2  coverage  in  this  plane  is  limited  (approximately  to  the  m/,  <  90 
GeV  region  in  Fig.l5(a). 

(f )  LHC  coverage  (earlier  SSC  studies  are  similar)  of  the  remaining  parameter  space 
was  thought  to  be  incomplete;  but  recent  studies  of  tth(H,  A)  — *  bbbbWW  and 
bbh(H,  A)  — »•  6666  signals  assuming  efficient  multiple-6-tagging  remove  this  weak- 
ness. 

(g)  An  e+e~  collider  with  energy  just  a  little  way  above  LEP200  could  also  guar- 
antee to  see  at  least  h. 

(h)  Future  useful  signals  could  include  A  —*•  Zh  — »•  l£rr,  A  — +  XiXi  ~+  4^-J-missing- 

py,  heavy  H^  — • »•  tb. 

(i)  Future  theory  constraints  could  come  from  SUSY-GUT  refinements  and  6  — *  sf. 
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Abstract.  In  this  talk,  we  review  the  QCD  calculations  of  the  lepton  spectrum  from 
inclusive  semileptonic  B  decay.  We  compare  this  prediction  to  that  of  the  ACCMM  model. 
This  latter  work  was  done  in  collaboration  with  Csaba  CsakL 


In  this  talk  we  discuss  the  calculation  of  the  inclusive  semileptonic  decay  lepton 
spectrum  from  B  — »  Xev.  We  first  discuss  the  motivation  for  renewed  interest 
in  this  calculation,  and  discuss  the  use  of  the  OPE  and  HQET  to  determine  this 
spectrum.  We  then  compare  this  prediction  to  that  of  heavy  quark  models,  in 
particular  that  of  Altarelli,  Cabbibo,  Corbo,  Maiani,  and  Martinelli  [1]  (hereafter 
referred  to  as  ACCMM).  I  will  concentrate  on  B  — »•  Xcev  and  not  the  detailed 
questions  of  the  endpoint  of  the  spectrum  which  are  important  for  B  — >•  Xuev. 

There  are  several  reasons  for  renewed  interest  in  this  prediction.  First  of  all  is 
the  copious  production  of  6  quarks  at  LEP  and  CLEO.  A  good  understanding  of 
the  decay  spectrum  is  necessary  for  understanding  fundamental  6  quark  parameters, 
which  are  being  precisely  studied,  such  as  the  6  width  and  the  forward-backward 
asymmetry  at  LEP  and  those  couplings  relevant  to  the  rare  decays  studied  by 
CLEO.  Furthermore,  with  a  good  understanding  of  the  spectrum  one  can  do  de- 
tailed measurements  of  heavy  quark  fragmentation,  which  could  give  interesting 
tests  of  heavy  quark  theory  and  QCD  predictions  [2,  3].  Furthermore,  in  order  to 
do  high  statistics  measurements,  one  needs  to  do  inclusive  measurements.  On  the 
theoretical  side,  it  is  of  interest  to  study  the  inclusive  decays  because  they  are  under 
much  better  control  theoretically  than  predictions  for  exclusive  modes,  where  one 
needs  to  know  hadronic  matrix  elements.  An  understanding  of  the  lepton  spectrum 
is  therefore  crucial  to  extracting  fundamental  parameters. 

In  the  seminal  paper  of  Chay,  Georgi,  and  Grinstein  [4]  it  was  shown  that  one 
can  treat  the  decay  spectrum  with  the  operator  product  expansion  and  heavy  quark 
methods.  They  showed  the  leading  A/m  corrections  to  the  free  quark  result  vanish 
in  the  matrix  elements.  Subsequently,  Bigi,  Blok,  Koryakh,  Shifman,  Uraltsev, 
and  Vainshtein  [5]  studied  (A/m)2  corrections.  They  compared  the  results  to  a  free 
quark,  ACCMM  model  where  they  did  not  see  the  vanishing  of  the  A/m  corrections 
as  manifest.  Following,  there  were  further  calculations  of  (A/m)2  suppressed  effects 
by  Falk,  Luke,  Savage  [7]  in  the  context  of  b  — *  sj  and  by  Manohar  and  Wise  [6]  in 
the  context  of  semileptonic  decays.  We  will  borrow  heavily  from  these  latter  two 
papers  in  our  review  of  the  QCD  calculation. 
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,  E)  _  G*Fm*  x\xm  -  a;)2 
dz        ~    96*3      (i_x)3     K1  ~  *X3  ~  2*)  +  (!  -  *«)(3  ~  *)J  C1) 

The  idea  is  then  to  justify  and  improve  on  this  free  quark  result.  For  the  study 
of  the  inclusive  decay,  we  sum  over  all  final  states  with  given  quantum  numbers. 
The  spectrum  and  rate  of  the  inclusive  decay  is  then  governed  by  short  distance 
physics.  Intuitively,the  justification  is  that  in  the  m  -*  oo  limit,  the  decay  time 
is  much  shorter  than  hadronization  time  scale.  With  the  OPE  and  HQET,  this 
can  be  made  precise.  The  crucial  observation  is  that  there  is  a  large  range  of  mass 
scales  in  the  final  state  m|j  <  P%  <  nig  so  that  the  energy  flowing  through  the 
hadron  system  scales  with  the  mass  of  the  decaying  heavy  quark.  In  the  m  —  *•  oo 
limit,  this  is  much  larger  than  the  QCD  scale.  Away  from  P%  «  m^,,  the  internal 
quark  is  far  from  mass  shell.  In  this  case,  we  expect  the  OPE  to  be  useful;  it  will 
prove  valid  except  near  the  endpoint. 

The  general  idea  is  to  relate  the  square  of  the  matrix  element  of  interest  to  the 
imaginary  part  of  the  time  ordered  product  of  currents.  One  can  then  perturb  atively 
compute  the  time  ordered  product  with  the  operator  product  expansion  (in  the 
region  of  phase  space  where  it  is  valid).  However  unlike  a  standard  OPE  it  is  useful 
to  expand  in  the  heavy  quark  mass  rather  than  q2  in  order  to  apply  HQET.  The 
coefficients  of  the  heavy  quark  operators  are  then  obtained  by  evaluating  the  time 
ordered  product  between  quark  and  gluon  states.  The  time  ordered  product  is 
evaluated  in  the  end  by  taking  matrix  elements  of  the  operators  appearing  in  the 
OPE  between  meson  states.  One  can  use  the  HQET  to  learn  about  these  matrix 
elements. 

Semileptonic  decay  is  determined  from  the  weak  hamiltonian  density: 

~  " 


The  inclusive  differential  decay  rate  for  HI  —  >•  XUtCeI^  (here  we  will  be  restricting 
attention  to  meson  decay)  is  governed  by 

Wfv  =  (2T)3  £  6*  (Plfb  -q-Px)  (Hb(v,  8)\J^  \X}  (X\  J/  \H>(v,  «))  (3) 

x 

where 

PH.  =  MH.V^  =  mbv»  +  k»  (4) 

The  W*v  can  be  expanded  in  terms  of  five  form  factors 

IV**  =  -g^Wi  +  v^vTWi  -  i^v^vaq^W3  +  q»qvW4  +  (q»vv  +  g'w")  Ws,   (5) 
One  then  relates  this  to  a  time  ordered  product  via  Im  T»v  =  -TrW*"  where 

T"*     =     -i 


q"qvT4  +  (q"vv  -f-  q"^)  T5. 
Let  us  consider  the  analytic  structure.    There  are  cuts  corresponding  to  the 


As  an  example,we  find  the  leading  order  result.  This  is  done  explicitly  in  [6].  The 
matrix  element  of  the  time  Fourier  transformed,  time  ordered  product  of  currents 

-i  fd^xe-iq-xT(J^Jv)  (6) 

is 

V)  V  ft  «,  (7) 


Using 

7"7V  =  0"V  +  f  "V  -  ^V  +  «V""*7/,7s.  (8) 

one  finds  the  order  Jb°  term  is 

~         u{  (m|t»  -  ?)"  7"  +  (m6t,  -  g)"  7*  -  (m^  -  ^)  g*» 

L\Q  \- 

-*•""•"  (miv-q)^  }PLU,  (9) 

where 

A0  =  (mbv  -  q)2  -  mj  +  ie  (10) 

By  replacing  the  matrix  element  with  the  6  quark  operator  which  yields  this 
amplitude  when  evaluated  between  6  quark  states,  one  obtains 

•—         [(mbv  -  q?  gvx  +  (mbv  -  q}v  g^  -  (mbv  -  q)X  g»" 


Because  it  is  a  current,  one  has 

(Hb(v,s)\bixb\Hb(v,s))=v\  (11) 

since  6  quark  number  is  an  exact  symmetry. 

So  we  can  evaluate  the  leading  order  contribution  exactly.  By  taking  the  matrix 
element  of  the  OPE  between  H  states,  we  get 


Z?     =     X-"»> 

^0 

rpQ         _  1 

3     ~     2A0 

So  to  get  the  amplitude  W^v  which  determines  the  spectrum,  we  need  the  imaginary 
part  of  T^v  .  This  is  readily  obtained  from 


2-m2)  (12) 

=     -6'((mbv-q)2-m2}  (13) 

qf-m2)  (14) 
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The  mass  sneu  condition  above  wnen  integrated  over  phase  space  leads  to  just 
the  free  quark  decay  distribution! 

However,  with  these  methods,  one  can  also  consider  higher  order  corrections. 
The  important  result  will  be  that  AQCD/TO  corrections  vanish  and  only  two  matrix 
elements,  one  of  which  is  known,  are  required  to  obtain  (Aqcn/m)2  corrections 

We  first  introduce  Heavy  quark  fields  and  the  heavy  quark  effective  theory. 
Recall  that  in  the  heavy  quark  limit,  the  heavy  quark  mass  and  spin  decouple  from 
the  soft  degrees  of  freedom.  The  heavy  quark  state  looks  like  a  static  point  source 
of  charge.  The  b  quark  is  almost  on  mass  shell  p  =  mbv  +  k. 

Define  the  heavy  quark  field 

i   ,  ,    _N 

*+•-.,  (15) 

with  leading  order  lagrangian 


Now  expand  in  terms  of  HQET  operators.  We  follow  [7]  who  give  the  leading 
term  in  terms  of  the  standard  quark  field,  but  the  mass  suppressed  operators  in 
terms  of  heavy  quark  fields. 


As  before,  at  leading  order,  we  have 

(fl|b/b|B)=2P5  (18) 

Now  consider  the  first  order  mass  suppressed  terms. 

(M\  01  \M)  =  (MlhTiD^h  \M)  =  (M\hTv^  •  itoh  \M)  (19) 


This  vanishes  by  the  equation  of  motion!    Notice  this  critically  depends  on  our 
phase  choice  in  defining  heavy  quark  field. 
An  arbitrary  counterterm 

Smbvbv  (20) 

was  taken  to  vanish.  With  this  choice  of  quark  mass,  all  linear  corrections  to  the 
free  quark  result  vanish! 

At  higher  order  in  1/m,  one  needs  the  two  matrix  elements 

Kb    =    -W«,«)l*«^  6.^(1;,  «)) 

<7,    =    Z>(ffM\t*£j^bv\H>(v,s)},  (21) 

b 
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Ine  second  matrix  element  is  known  because  it  is  related  to  the  known  spin 
dependent  mass  splittings.  The  first  is  not  known,  and  is  a  parameter  to  be  deter- 
mined. 

So,  to  summarize,  the  leading  order  result  for  the  rate  and  spectrum  is  the 
free  quark  result.  The  first  order  corrections  vanish.  At  second  order,  there  is  one 
unknown  coefficient  and  one  which  has  been  determined.  This  analysis  has  assumed 
we  are  far  from  endpoint,  where  the  OPE  is  valid.  Had  we  applied  this  near  the 
endpoint,  higher  dimension  operators  would  not  be  suppressed  and  it  would  be 
necessary  to  average  the  spectrum  over  a  range  of  energies. 

One  can  compare  this  result  to  the  predictions  of  models.  It  is  interesting  to 
see  how  models  reflect  and  differ  from  these  general  QCD  predictions.  Consider 
for  example  the  ACCMM  model.  Here  I  review  work  of  Csaba  Csaki  and  myself 
[8],  but  other  interesting  references  are  that  by  Grant  Baillie  [9]  and  Ref.  [10]. 
One  finds  that  one  can  always  define  a  quark  mass  so  that  1/m  corrections  vanish. 
However,  at  1/m2,  the  model  would  differ  from  the  QCD  result. 

In  the  ACCMM  model,  one  models  a  B  meson  decay  as  disintegration.  There 
is  a  spectator  quark  with  mass  mip  and  momentum  distribution  </>(|p|). 

(22) 

The  b  quark  momentum  is  determined  by  kinematic  constraints.  The  lepton 
spectrum  is  determined  from  the  decaying  off  shell  6  quark.  Define 


+  rn}p  (23) 

The  invariant  mass  of  the  6  quark  will  then  be 


W2  =  ml  +  m]p  -  2mB  ,/p2  +  m^  (24) 

Define 


Xm      =      l-€ 

Recall  the  formula  for  free  quark  decay. 


Then  the  lepton  spectrum  in  the  ACCMM  model  is 


dE 

dcosO 


f  ±^L6(E  -  yE'  -  f/3E'  cos  Bp 


>na        T      T>V,,,a        C i , rvnl om cm t   TaaiiA      1  QQK 


dTB 

~dE  =  I  UPV  9U™2*?  /  dE'dcosd"™0-]?  (26) 


Using  /?72  =  pEw/W(p)2  yields 

dTB        f       o             W2      y^ma*  fifii  dY(W  £j'\ 
•••  =   /  dpp  <f>\\p\) /          —jr — -  (e)'7\ 

where 

E*  =  'EW~±~P  (28) 

•f-ffnax      ~~      mtn^xS^.,         II  ~~  Xfnjj 

2 
mB       nit 

Now  do  a  heavy  meson  mass  expansion. 

"'•dE1 


/ 

= yo 


777  I      I  3- 

0 

We  can  explicitly  evaluate  the  E1  integral,  to  get 
dTB       G2FE2   fpm"  dp   f  ,8 

TrT  =     nt    q      /  -=. —      6EwW    —  SEEfir V*E  1  n2(h(\n\\  (1*)} 

dE         24;r3  Jn         Ew  \  ^      I  ^  ™\\P\)  \A^) 

Take  pj  small. 


dE        dE  +    12^^  (SmB      LZ)mB+P^^'JE  ^ 

It  looks  like  there  are  nonvanishing  linear  corrections,  but  that  is  because  we  ex- 
panded in  the  meson  mass  mBt  rather  than  the  quark  mass  mb.  To  do  a  heavy 
quark  expansion,  one  needs  to  define  a  quark  mass.  This  can  be  done  so  that  the 
linear  correction  vanishes. 

dTB  __  G2FE2 


dE 
If  we  define 

mb  = 
Up  to  quadratic  terms  in  p,we  have 

(W)  =  (Ew)  =mb  =  mB-  (p} 

(f(W,Ew))  =  f(mB)mB)  -  (p}f'(mB,mB)  +  O(p2)  =  /(m6,m6)  +  O(p2)   (37) 

We  see  that  it  was  possible  to  define  a  6  quark  mass  so  that  linear  corrections 
vanished  because  the  p  dependence  was  all  through  Ew.  So  we  can  eliminate  linear 
corrections  through  a  proper  choice  of  mj.  It  is  easy  to  extend  this  to  nonzero 
spectator  mass. 


(Ew)  =  mB~  Um2 


The  inclusive  semi-leptonic  B  decay  lepton  spectrum  ... 
Now  consider  the  total  rate  and  the  full  spectrum. 

(39) 
However,  the  spectra  themselves  are  very  different  near  the  endpoint. 


B 


where  £,  =  W(l  -  f)/2  ,  pi  =  mfl/2(l  -  ^f  -  £  for  m.f  =  0.  This  spectrum 
extends  up  to 


We  see  the  ACCMM  spectrum  and  the  free  quark  spectrum  deviate  due  to  two 
effects  near  the  endpoint.  The  first  term  is  not  integrated  up  to  p  w  oo  and  the 
second  term  integrates  over  a  rapidly  falling  spectrum  so  it  is  no  longer  a  good 
approximation  to  replace  (f(W)}  by  /((W)).  The  spectrum  extends  beyond  naive 
quark  mass  endpoint. 

Since  we  have  a  specific  model,  we  can  also  investigate  the  question  of  how  large 
an  energy  interval  must  be  averaged  over  to  get  good  agreement  between  the  model 
and  the  free  quark  prediction.  In  ref.  [8]  we  used  the  averaging  function 

(40) 

We  expect  to  require  an  average  of  approximately  2p/  to  get  (p//ra)2  agreement. 
This  was  appproximately  correct.  We  also  checked  that  the  best  fit  quark  mass  is 
generally  very  close  to  the  exact  quark  mass,  so  the  deviation  of  the  model  from 
the  free  quark  prediction  is  probably  not  very  important  for  practical  purposes. 

At  higher  order,  the  predictions  of  this  model  will  not  agree  with  general  QCD 
predictions.  This  is  because  gauge  invariance  is  not  incorporated.  There  are  cor- 
rections to  the  free  quark  result  due  to  the  fact  that  terms  proportional  to  {p2}  and 
(p2}  both  yield  1/m2  corrections.  Recall  that  in  the  heavy  quark  theory  the  op- 
erator (DQ)  contributions  only  at  higher  order  in  the  heavy  quark  mass  expansion 
by  the  equation  of  motion.  In  the  ACCMM  model,  both  contribute  at  the  same 
order.  Obviously,  the  ACCMM  model  also  doesn't  give  you  spin  dependent  gluon 
operator.  We  conclude  that  although  the  model  is  probably  adequate  in  practice 
(as  would  be  free  quark  decay)  it  does  not  properly  incorporate  QCD  dynamics. 

We  conclude  that  there  has  been  much  advancement  in  our  understanding  of  the 
lepton  decay  spectrum.  It  is  unclear  at  what  level  this  will  be  tested.  Most  of  the 
large  deviations  from  the  free  quark  prediction  are  in  the  endpoint  region,  where 
one  must  smear  to  get  reliable  predictions.  Furthermore,  the  higher  order  correction 
involves  an  unknown  parameter  and  requires  very  accurate  measurements.  How- 
ever, from  a  practical  point  of  view,  when  using  the  spectrum  to  extract  6  quark 
couplings,  we  see  that  the  free  quark  decay  models  6  quark  decay  very  well.  It  might 
be  better  to  fit  the  spectrum  to  a  free  quark  spectrum  that  that  of  a  model,  such 
as  the  ACCMM  model.  The  vanishing  of  AQCD/TO  corrections  is  very  important  to 
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CP  violation  at  colliders 


SAURABH  D  RINDANI 
Theory  Group,  Physical  Research  Laboratory, 
Navrangpura,  Ahmedabad  380  009,  INDIA 

Abstract.  The  prospects  of  experimental  detection  oiCP  violation  at  e+e~  and  pp/pp 
colliders  are  reviewed.  After  a  general  discussion  on  the  quantities  which  can  measure  CP 
violation  and  on  the  implications  of  the  CPT  theorem,  various  possibilities  of  measuring 
CP  violation  arising  outside  the  standard  model  are  taken  up.  CP  violation  in  leptonic 
processes,  especially  polarization  effects  in  e+e~  —*  l+l~  are  discussed  next.  CP  violation 
in  ft  and  W*  W~  production  and  decay  is  also  described. 


1.  Introduction 


1.1  CP  violation  in  the  standard  model 

CP  violation  in  the  standard  model  (SM),  as  is  well  known,  arises  due  to  complex 
Yukawa  couplings,  and  finally  shows  up  through  quark  mixing  in  the  Cabibbo- 
Kobayashi-Maskawa  matrix  as  a  single  phase.  The  reparametrization  invariant 
quantity 

J  =  sin2  &i  sin  #2  sin  #3  cos  Q\  cos  02  cos  63  sin  6  (1) 

is  a  measure  of  CP  violation  in  SM,  and  though  small,  produces  measurable  effects 
in  the  K  meson  system,  and  hopefully  the  effects  in  the  6-quark  states  will  be 
measurable.  The  effects  in  other  sectors  (as  for  example  the  neutron  electric  dipole 
moment)  and  in  high-energy  processes  is  generally  predicted  to  be  too  small  to  be 
observed. 

In  the  leptonic  sector  of  SM  the  prospects  of  CP  violation  being  observable 
are  worse,  since  there  is  no  analogue  of  the  CKM  phase  in  the  absence  of  neutrino 
masses.  CP  violation  in  leptonic  systems  has  to  feed  in  from  the  ha«ironic  sector 
through  loops.  For  example,  electric  dipole  moments  (EDM)  of  charged  leptons  are 
induced  due  to  the  W  EDM,  and  are  found  to  vanish  up  to  three-loop  order.  The 
electron  EDM  is  then  estimated  to  be  around  10~41  e  cm.  [1] 

Thus,  any  new  observable  CP  violation  would  be  a  signal  of  non-standard 
physics.  It  might  be  mentioned  that  perhaps  there  is  already  a  hint  towards  non- 
standard  CP  violation  in  current  ideas  on  electroweak  baryogenesis. 

1.2  Scenarios  for  CP  violation  beyond  SM 

CP  violation  beyond  SM  can  arise  due  to  almost  any  extra  Yukawa  couplings  which 
can  be  complex,  and  possibly  also  due  to  new  Higgs  self-interactions  and  complex 


scalar  vacuum  expectation  values.  Thus,  introduction  of  extra  fermions  or  scalars 
could  give  rise  to  new  sources  of  CP  violation. 

Retaining  the  gauge  group  to  be  5T/(2)£,  x  ^(1)»  CP  violation  can  arise  due  to 
extra  fermion  or  Higgs  doublets  or  singlets.  Since  the  SM  measure  of  CP  violation 
J  (eq.(l))  is  small  owing  to  the  small  mixing  angles  among  quark  generations  as 
experimentally  observed,  larger  CP  violating  effects  would  arise  if  there  are  extra 
generations  of  quarks,  whose  mixing  angles  may  be  less  constrained.  If  there  are  new 
fermions  (quarks  or  leptons)  in  exotic  representations  (left-handed  singlets  and/or 
right-handed  doublets  of  5^(2)^)  there  are  further  complex  flavour-changing  cou- 
plings to  Z  which  violate  CP. 

Supersymmetry  requires  the  addition  of  extra  scalars  and  fermions,  whose  cou- 
plings violate  CP.  In  left-right  symmetric  models,  again,  there  further  sources  of 
CP  violation. 

1.3  Use  of  effective  Lagraugians  for  model- independent  analysis 

There  is  a  large  variety  of  sources  of  CP  violation  beyond  SM,  and  rather  than 
discuss  predictions  of  each  model  for  each  observable  quantity,  it  is  more  econom- 
ical to  analyze  CP- violating  quantities  in  terms  of  the  parameters  of  an  effective 
Lagrangian.  Examples  of  CP-violating  terms  in  an  effective  Lagrangian  with  which 
we  will  be  concerned  here  are  given  below: 


2  i 

\v 


+ 


(2) 


Wpv  =  dpWv-dvWn).  These  terms  are  of 
dimension  <  6.  In  eq.(2),  V>i  refers  to  various  fermionic  fields  (quarks  and  leptons), 
whose  electric  and  "weak"  dipole  moment  is  given  by  d^i  and  d^^  respectively. 
It  should  be  noted  that  all  the  parameters  are  in  reality  scale  dependent  "form 
factors",  and  can  be  complex. 

2.  Some  general  considerations 


2.1  Observable  quantities  which  measure  CP  violation 

There  are  basically  two  types  of  observables  which  can  be  used  to  characterize  CP 
violation:  asymmetries  and  correlations. 

An  example  of  an  asymmetry  is  the  partial-width  asymmetry  for  decay  of  par- 
ticles i  and  t  which  are  CP  conjugates  of  each  other: 


(  j 


If  CP  is  a  symmetry  of  the  theory,  A  =  0.   Non-vanishing  A  implies  violation  of 
CP.  A  is  a  convenient  parameter  because  it  is  dimensionless  and  lies  between  —1 


and  1.  In  particular,  if  »  is  an  eigenstate  of  CP,  i  =  i,  and  A  in  (3)  simplifies  and 
measures  the  fractional  difference  in  the  decay  rates  of  i  to  two  CP-conjugate  states 
/  and  /.  As  we  shall  see  later,  the  CPT  theorem  implies  that  A  is  zero  even  if  CP 
is  violated  unless  the  amplitude  has  an  absorptive  part  which  can  arise  because  of 
final-state  interactions  or  loop  effects  in  perturbation  theory. 

Another  type  of  asymmetry  is  an  asymmetry  in  a  final-state  variable  like  energy 
or  angle.  It  is  defined  in  general  (for  i  =  »)  as 

,_ 

Here  S  and  5  are  sets  of  states  with  CP-conjugate  kinematic  ranges,  and  /  is  a 
final  state  assumed  to  have  particles  conjugate  to  one  another  in  pairs.  An  example 
is  energy  asymmetry  between  the  energies  £"+.  and  £_  of  CP-conjugate  particles 
in/: 

A  - 

The  other  category  of  quantities  consists  of  CP-odd  correlations  which  are  ex- 
pectations values  of  CP-odd  operators  in.  a  process  with  initial  as  well  final  states 
described  by  CP-even  density  matrices.  Thus  for  an  observable  O({PA,  ;,  s^}) 
which  is  a  function  of  momenta  p^.  and  spins  sAi  of  particles  A{,  and  which  trans- 
forms under  CP  as 


,BAi}),  (6) 

the  CP-odd  correlation  is 
(0)  =  / 


. 

A  non-zero  value  of  such  a  correlation  signals  CP  violation. 

It  may  be  noted  that  an  asymmetry  in  the  variable  C?may  be  described  as  a 
special  case  of  a  correlation  of  e  ((!?),  where  6  is  the  antisymmetric  step  function: 

.  (8) 


2.2  Statistical  significance 

Whether  or  not  a  measured  asymmetry  or  correlation  can  really  signal  CP  violation 
naturally  depends  on  its  statistical  significance  decided  by  the  statistical  fluctuation 
expected  in  the  event  sample. 

For  a  rate  asymmetry  A,  the  number  of  asymmetric  events  AN  is 

=  AN,  (9) 


where  N  is  the  total  number  of  events  in  the  channel  considered.  The  statistical 
fluctuation  in  these  ./V  events  is  -\/N.  Hence  for  discrimination  of  the  signal,  at  the 
one  standard  deviation  level,  we  require 


AJV  >  VN,  (10) 


VN  v    ' 

Thus,  it  would  be  possible  to  measure  an  asymmetry  if  its  predicted  value  is  larger 
than  l/x/AT.  To  put  it  differently,  the  number  of  events  should  be  larger  than  I/A2. 
In  the  case  of  a  CP-odd  correlation  (0),  CP-invariant  interactions  can  give  in- 
dividual events  with  0  ^  0,  averaging  out  to  zero.  Thus  the  mean  square  deviation 


-  (O?  (12) 

is  a  measure  of  the  background  coming  from  CP-invariant  interactions.  For  N 
events  in  the  channel,  the  CP-even  events  give  rise  to  a  fluctuation  AC?/\/77.  The 
signal  (O)  should  be  larger  than  this  to  be  measurable  at  the  one  standard  deviation 
level: 


(0}  >         ^F{0p.  (13) 


There  is  a  further  experimental  requirement  for  measuring  CP  violation.  All  ex- 
perimental cuts  must  respect  CP  invariance.  If  not,  they  would  introduce  artificial 
asymmetries,  diluting  or  obliterating  the  genuine  signal  of  CP  violation. 

2.3  CPT  theorem  and  all  that 

Since  a  combined  CPT  transformation  is  a  good  symmetry  according  to  the  CPT 
theorem,  CP  invariance  (or  violation)  implies  T  invariance  (or  violation),  and  vice 
versa.  However,  it  should  be  borne  in  mind  that  observation  of  a  T-odd  asymmetry 
or  correlation  is  not  necessarily  an  indication  of  CP  (or  even  T)  violation.  The 
reason  for  this  is  the  anti-unitary  nature  of  the  time-reversal  operator  in  quantum 
mechanics.  As  a  consequence  of  this,  a  T  operation  not  only  reverses  spin  and 
three-momenta  of  all  particles,  but  also  interchanges  initial  and  final  states.  This 
last  interchange  is  difficult  to  meet  with  in  practice,  and  one  usually  has  a  situation 
where  only  momenta  and  spins  are  reversed,  with  the  initial  and  final  states  kept 
as  such.  In  that  case,  non-zero  T-odd  observables  do  not  necessarily  signal  genuine 
T  violation. 

There  is,  however,  a  case  when  T-odd  observables  imply  T  violation,  and  that 
is  when  final-state  interactions  and  loop  effects  can  be  neglected.  In  that  case  the 
transition  operator  T  obeys  T  =  T',  since  the  right-hand  side  in  the  unitarity 
relation 

T-T*=iT*T  (14) 

can  be  neglected.  Then 

(/|T|i)«(/|Tt|,-)  =  (i|r|/>*.  (15) 

Now  if  T  invariance  holds,  then 

(f\T\i)  =  (MTI/T),  (16) 


where  [ty),    |/T)  represent  states  with  all  momenta  and  spins  inverted  in  sign. 
Combining  eq.(16)  with  (15)  for  time-reversed  states,  we  have 


I  =  l{/r|T|ir>[.  (17) 
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in  WHS  case,  u  a  JL  -oaa  ooservaoie  is  non-zero,  it  implies  i    violation,    Tnus,  j. 
invariance  (and  CP  invariance  through  the  CPT  theorem)  implies  equality  of  am- 
plitudes with  all  momenta  and  spins  reversed  if  and  only  if  final-state  interaction 
(absorptive  part  for  the  amplitude)  vanishes. 

Put  differently,  this  means  that  final-state  interactions  can  mimic  T  violation, 
but  not  genuine  CP  violation.  One  should  therefore  use,  as  far  as  possible,  CP-odd 
observables  to  test  CP  invariance,  not  T-odd  observables  (unless  they  are  also  CP 
odd). 

For  a  genuine  CP-odd  quantity,  there  are  two  possibilities, 

A.  it  is  T  odd,  and  therefore  CPT  even,  or 

B.  it  is  T  even,  and  therefore  CPT  odd. 

In  case  B,  there  is  no  violation  of  the  CPT  theorem  provided  the  amplitude  has  an 
absorptive  part.  (This  is  again  due  to  the  fact  the  CPT  operator  is  antiunitary,  and 
interchanges  initial  and  final  states).  Thus  non-vanishing  of  CPT-odd  operators 
necessarily  requires  an  absorptive  part  of  the  amplitude  to  exist. 

The  absorptive  part  of  CP-odd  CPT-odd  quantities  in  perturbation  theory 
usually  comes  from  loop  contributions  where  the  intermediate  state  c?»i  be  on  shell. 
An  interesting  way  of  realizing  this  possibility  in  the  case  of  an  intermediate  state 
of  an  unstable  particle  is  through  the  Breit-Wigner  form  of  its  propagator.  In  this 
case  the  absorptive  part  is  proportional  to  its  width.  This  trick  has  been  used  in 
the  case  of  the  top,  Z  and  Higgs  propagators  [2,  3,  4].  One  must  however  be  careful 
to  subtract  out  the  part  of  the  width  corresponding  to  decay  into  the  initial  or 
final  state  for  consistency  with  the  CPT  theorem  [5].  It  has  also  been  pointed  out 
recently  [6]  that  off-diagonal  contributions  to  the  self-energy  of  the  virtual  particles 
are  also  needed  for  consistency  with  the  CPT  theorem. 

3.  CP  violation  in  the  leptonic  sector 


3.1  Scenarios  for  leptonic  CP  violation 

In  the  standard  model,  no  right-handed  neutrinos  are  introduced.  As  a  result, 
there  is  no  mass  matrix  to  diagonalize  for  the  neutrinos.  Hence  the  CKM  matrix 
is  the  unit  matrix,  and  no  CP-violating  phases  can  arise.  However,  in  extensions 
of  the  standard  model,  CP  violation  can  arise  either  because  of  the  presence  of 
neutrino  masses  or  because  of  extra  leptons  introduced  (even  though  neutrinos 
may  be  massless),  or  both. 

A.  Massive  neutrinos.    Neutrinos  can  have  Dirac  or  Majorana  masses.    CP 
violation  in  the  Dirac  case  is  exactly  analogous  to  that  in  the  quark  sector  of  the 
standard  model.  In  case  of  Majorana  masses,  the  freedom  of  phase  redefinition  of 
the  neutral  lepton  fields  is  reduced  because  Majorana  mass  terms  are  not  invariant 
under  phase  transformations.   As  a  result  there  are  more  CP-violating  phases  in 
the  CKM  matrix  than  the  corresponding  Dirac  case.    It  is  thus  possible  to  have 
CP  violation  with  even  two  generations  of  Majorana  neutrinos. 

B.  Massless  neutrinos  with  exotic  leptons.  It  is  possible  to  have  CP  violation 
because  of  either  charged  or  neutral  leptons  in  exotic  representations  of  SU(2)  x 
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The  leptons  then  have  flavour-  violating  couplings  to  Z  or  Higgses,  which  can 
be  complex  and  hence  CP  violating. 

We  consider  below  some  leptonic  CP-violating  processes  at  high-energy  collid- 
ers which  make  use  of  the  above  mechanisms  of  CP  violation.  The  importance 
of  leptonic  processes  s£ems  from  the  fact  that  they  are  relatively  clean  from  the 
experimental  point  of  view. 

3.2  Leptonic  flavour  violating  Z  decays 

Leptons  can  have  flavour-violating  couplings  to  Z  giving  rise  to  flavour  violating  Z 
decays  into  charged  leptons  either  at  tree  level  or  at  one-loop  level: 


The  corresponding  (7P-violating  asymmetry  is 


T(Z  ->/,-/,•)  +  T(Z  -*/,-/,)' 

This  is  T  even  and  therefore  CPT  odd.  It  therefore  needs  an  absorptive  part  to  be 
present. 

Flavour-violating  tree-level  couplings  of  charged  leptons  to  Z  arise  in  mod- 
els with  exotic  charged  leptons  transforming  as  either  left-handed  singlets  and/or 
right-handed  doublets.  In  such  a  case  the  Glashow-Weinberg  condition  for  flavour- 
diagonal  couplings  is  not  satisfied,  and  (18)  occurs  at  tree  level.  For  A  of  (19)  to  be 
non-zero,  one-loop  correction  to  (18)  is  also  needed,  and  only  the  absorptive  part 
of  that  amplitude  contributes.  The  asymmetry  is  then  0(a)[7]. 

On  the  other  hand,  models  with  exotic  neutral  leptons  have  flavour-  violating 
couplings  of  neutral  leptons  at  the  tree-level  giving  rise  to  (18)  at  the  loop  level 
[8]  .  The  absorptive  part  now  comes  from  one  of  these  loop  diagrams.  A  is  now 
O(l).  However,  unlike  in  the  previous  case,  the  rate  of  the  flavour-violating  process 
(18)  is  O(cn3}.  Thus,  the  minimum  total  number  NZ  of  Z  events  for  an  observable 
asymmetry  is  in  both  cases  O(l/a3).  However,  constraints  on  leptonic  mixing 
angles  and  masses  of  exotic  leptons  make  this  process  too  rare  to  observe  at  LEP. 

3.3  CP  violation  in  e+e~  -»  -y*,  Z*  -»  l+T 

In  case  of  CP  violation  in  e+e~  —  »•  7,  Z  —  »•  /+/~  there  are  two  general  results  [9]: 
(i)  No  CP  violation  can  be  seen  without  measuring  initial  or  final  spins.  This 
follows  basically  because  no  CP-odd  observable  can  be  constructed  without  spins. 
(ii)  The  only  CP-violating  couplings  for  the  on-shell  process  are  the  dipole  moment 
type  couplings  of  e  or  /  (electric  or  "weak"  dipole  moments).  Since  there  are  strong 
experimental  limits  on  the  electron  and  muon  electric  dipole  moments  (de  ;$  10~27e 
cm,  dp  £  10~19e  cm),  r  may  be  a  good  candidate  for  /.  In  fact,  the  weak  moment 
of  T  has  been  constrained  using  r  polarization  in  this  reaction  (see  below). 

Since  it  is  clear  from  (i)  that  either  initial  or  final  spins  have  to  be  observed  to 
detect  CP  violation,  we  consider  below  both  these  cases  for  e+e~  —+  Z  —+  r+r~  . 

3.3.1  CP  tests  using  T  polarization  in  e+e~  —  +  T+T~:  This  possibility  has 
been  discussed  by  several  authors  [2,10-12].  For  the  process 

e-(p_)  +  e+(p+)-*r-(fc_,S_)  +  r+(fc+,s+),  (20) 


possible  CP-odd  quantities  that  can  be  constructed  out  of  the  momenta  and  spins 
in  the  centre-of-mass  (cm)  frame  are  (p_  —  p+)  •  (s_  —  s+)  and  (p_  —  p+)  -s_  x  s+. 
To  measure  these  quantities,  one  must,  of  course,  be  able  to  measure  s±.  This 
can  be  done  by  looking  at  decay  distributions  of  T±.  In  the  rest  frame  of  r,  the 
angular  distribution  of  an  observed  decay  particle  is 


where  q*  is  the  unit  vector  along  the  momentum  of  the  observed  particle,  and  a  is 
a  constant  called  the  analyzing  power  of  the  channel.  For  example, 

or     =       =pl       for      r^  —  >•  Tr*i/T, 
=     ±1/3    for     r±  ->/*!/,!/,., 

as  deduced  from  the  theory  of  weak  r  decay.  Using  (21),  spin  correlations  can  be 
translated  to  momentum  correlations. 

In  terms  of  the  observed  momenta,  possible  CP-odd  variables  are  p  •  (q+  x  q_) 


p  •  (q+  +  q_)p  •  (q+  —  q_)  (CPT  =  —  1).  Expectation  values  of  the  last  two  were 
suggested  by  Bernreuther  et  al.  [11,  12]  for  measuring  respectively  the  real  and 
imaginary  parts  of  the  r  weak  dipole  form  factor  dT(m^).  The  suggestion  has  been 
carried  out  at  LEP  for  Re  dT(m?z}  by  the  OPAL  [13]  and  ALEPH  [14]  groups.  OPAL 
looked  at  inclusive  leptonic  and  hadronic  decays  of  r,  whereas  ALEPH  analyzed 
all  channels  exclusively.  The  results  obtained  are  the  95%  confidence-level  upper 
limits  Rei(m|)  <  7.0  x  10~17e  cm  (OPAL  [13])  and  Redr(m|)  <  3.7  x  10~17e 
cm  (ALEPH  [14]). 

The  theoretical  prediction  for  the  1  s.d.  limit  obtainable  in  the  measurement  of 
Imdr(m|)  is  10~16  using  the  correlation  (p  •  (q+  +  q_)p  -(q+  —  q_)}  and  a  sample 
of  107Z's  [12]. 

3.3.2  Longitudinal  beam  polarization:  The  Stanford  Linear  Collider  (SLC), 
operating  presently  at  the  Z  resonance,  has  an  e~  polarization  of  about  62%,  and 
is  likely  to  reach  75%  in  the  future.  The  present  sample  collected  is  of  50,000  Z's, 
and  the  hope  is  to  reach  5xl05,  or  even  106  Z's. 

Can  this  longitudinal  e~  polarization  help  in  measuring  the  r  weak  dipole  mo- 
ment? The  answer  is  "yes"  [15].  In  fact,  as  we  shall  see,  SLC  can  do  better  than 
LEP  so  far  as  ImefT  is  concerned. 

The  essential  point  is  that  the  vector  polarization  of  Z  gets  enhanced  in  the 
presence  of  e+e~  longitudinal  polarization.  For  vanishing  beam  polarization,  Pe-  — 
Pe+  =  0,  the  Z  vector  polarization  is 

(o)_ 


where  gvt->  9Ae  *re  vector  and  axial-  vector  couplings  of  e+e~  to  Z.   For  non-zero 
polarization, 


_ 
PZ=     z          ''-,  (23) 


(24) 


Thus,  Pz  «  0.71  for  Pc-  =  -.62  and  Pe+  =  0,  which  is  an  enhancement  by  a  factor 
of  about  4.5. 

It  is  therefore  profitable  to  look  for  CP-odd  observables  involving  the  Z  spin 
sz  =  Pzp,  where  p  is  the  unit  vector  along  p+  =  —  p_  in  the  c.m.  frame.  Examples 
of  such  observables  are  p  •  (q+  x  q_)  and  p  •  (q+  +  q_).  While  both  are  CP  odd, 
the  former  is  CPT  even  and  the  latter  is  CPT  odd. 

The  above  is  not  entirely  correct  in  principle.  CP-odd  correlations  give  a  mea- 
sure of  underlying  CP  violation  only  if  the  initial  state  is  CP  even.  Otherwise  there 
maybe  contributions  to  correlations  which  arise  from  C  P-invariant  interactions  due 
to  the  CP-odd  part  of  the  initial  state.  In  the  case  when  only  the  electron  beam 
is  polarized,  the  initial  state  is  not  CP  even.  In  practice,  however,  this  CP-even 
background  is  small  because  for  me  —+  0,  only  the  CP-even  helicity  combinations 
e^ejk  and  e^e£  survive,  making  the  corrections  proportional  to  me/mz  «  5  x  10~6. 
If  one  includes  order  a  collinear  photon  emission  from  the  initial  state,  which  could 
flip  the  helicity  of  the  e^ ,  then  like-helicity  e+e~  states  could  also  survive  for  van- 
ishing electron  mass  [16].  However,  it  turns  out  that  this  being  a  non-resonant 
effect,  the  corresponding  cross  section  at  the  Z  peak  is  small.  It  is  therefore  ex- 
pected that  the  correlations  coming  from  CP-invariant  SM  interactions  in  such  a 
case  will  be  negligible. 

The  correlations  (Oi)  ~  {p  •  (q+  x  q_))  and  (O2)  ~  (p  •  (q+  +  Q-)}  have 
been  calculated  analytically  for  the  single-pion  and  p  decay  mode  of  each  r.  Also 
calculated  analytically  are  (Oj)  and  {O2,}  needed  for  obtaining  the  1  s.d.  limit  on 
the  measurability  of  dr  obtained  using  eq.(13)  [15]. 

The  results  for  only  the  single-pion  channel  are  summarized  in  tables  la  and  Ib, 
which  give,  respectively  for  0\  and  O-i  and  for  electron  polarizations  Pe-  =  0,  ±0.62, 
the  correlations  in  units  ofdrm.z/e,  the  square  root  of  the  variance,  and  the  Icr  limit 
on  RedT  and  Imdr  obtainable  with  106  Z's.  The  enhancement  of  (01,2}  and  hence 
the  sensitivity  of  dT  measurement  with  polarization  is  evident  from  the  tables. 


Table  la  Correlation  of  Oi,  the  standard  deviation  from  SM  interactions,  and  the  1 
s.d.  limit  on  the  real  and  imaginary  parts  of  the  weak  dipole  moment,  for  different 
polarizations  Pe. 


Pe 

(Oi)(GeV2)for 
Re  dT  =  e/mz 

vW 

(GeV2) 

1  s.d.  limit  on  RedT 
for  106  Z's  (in  e  cm) 

0 

+0.62 
-0.62 

0.90 
-2.89 
4.01 

12.86 
12.86 
12.86 

1.5  x  10~16 
4.6  x  10-17 
3.3  x  10-17 

270 


Pramana-  J.  Phys.,  Supplement  Issue,  1995 


CP  violation  at  colliders 


Table  Ib  Quantities  as  in  table  la,  but  for 


Pe 

(OaHGeV)  for 
Im  dr  —  e/mz 

vW 

(GeV) 

1  s.d.  limit  on  lmdr 
for  106  Z's  (in  e  cm) 

0 

+0.62 
-0.62 

-0.16 
0.51 
-0.70 

9.57 
9.57 
9.57 

6.2  x  10~16 
1.9  x  10~16 
1.4  x  10~16 

The  sensitivity  can  be  further  improved  by  looking  at  only  the  Pc-  dependent 
part  of  the  cross  section.  This  can  be  done  by  looking  at  a  sample  corresponding  to 
the  difference  in  the  number  of  events  for  a  polarization  Pe-  and  polarization  —  Pe- . 
The  correlations  are  evaluated  with  respect  to  d<r(Pe-}  —  dcr(—Pe-).  This  reduces 
the  number  of  events.  However,  correlations  are  enhanced  by  a  larger  factor  giving 
a  net  gain  in  the  sensitivity.  The  result  for  Pt-  =  .62  and  106  Z's  is  given  in  table 
2. 


Table  2  Quantities  as  in  table  la,  but  for  a  distribution  asymmetrized  between  polariza- 
tions +0.62  and  -0.62. 


Observable 

_  (0)  for 
\dT\  -  e/mz 

V(o2) 

1  s.d.  limit  on  \dT\ 
for  106  Z's  (in  e  cm) 

0i 
02 

35.55  GeV2 
-6.22  GeV 

12.86  GeV2 
9.57  GeV 

1.2  x  10~17 
5.0  x  10-17 

The  1  s.d.  limit  for  Im<fr  should  be  compared  with  the  LEP  expectation  of 
10~16  e  cm  for  a  larger  sample  of  107  Z's  [12]. 

The  use  of  correlations  for  measuring  the  r  edm  at  the  proposed  r-charm  fac- 
tories employing  longitudinal  polarization  of  electron  and  positron  beams  has  also 
been  studied  in  detail  in  [17]. 

3.3.3  Transverse  beam  polarization:  Use  of  transversely  polarized  e+e~  beams 
for  the  study  of  CP  violation  has  been  studied  by  several  people  (see  for  example 
[2,  18]).  For  a  reaction 


e~(p. , «_)  +  e+(p+,  *+)  ->  /(*+)  +  /(*_) 


(25) 


in  the  c.rn.  frame,  where  /  denotes  a  fermion,  possible  T-odd  triple  products  are 
s_  •  (p  x  k),  s+  •  (p  x  k),  p  •  (s_  x  s+),  k  •  (s_  x  s+)  ,  where  p+  =  — p_  =  p  and 
k+  =  — k_  =  k.  Of  these  triple  products,  the  last  two  are  purely  CP  odd,  whereas 
only  the  difference  of  the  first  two  is  CP  odd.  Burgess  and  Robinson  [18]  have  done 
an  analysis  of  e+e~  — »•  T+T~  ,  cc,  66  in  terms  of  operators 


(27) 


Their  results  for  Xw  =  (400  GeV)'2  and  L  =  4.8  x  105pb~1  at  LEP  are  given 
in  table  3,  where  A  is  the  asymmetry  for  s_  •  (p  x  k)  given  by 


=\j 


t,  -Si)l  s-  •  (p  x  k). 


(28) 


Though  this  is  an  interesting  effect,  a  theoretical  estimate  for  \w  is  needed  before 
concluding  whether  the  effect  would  be  observable.  Assuming  a  systematic  error 
of  0.1%,  the  2-0-  limits  possible  on  \w  are  estimated  to  be  (570  GeV)~2  and  (660 
GeV)~2  respectively  for  up-  and  down-type  quarks. 

Table  3  The  transverse  polarization  asymmetry  A  (defined  in  the  text)  compared  with 
the  standard  model  events  for  e*e~  -+  r+  r~,cc,  66. 


r+r- 

cc 

66 

N  (SM  events)  xlO"8 
A  x  10-5 
A/N  x  103 

6.5 
-6.1 
0.9 

24 
97 
4.0 

31 
-170 
5.5 

4.  CP  violation  in  top  pair  production 


Evidence  for  a  top  quark  of  mass  of  about  174  GeV  at  the  Tevatron  has  been  re- 
ported by  the  CDF  collaboration  [19].  Even  though  the  data  is  not  conclusive,  it 
is  generally  believed  that  the  top  quark  will  eventually  be  found  with  a  mass  of  a 
similar  magnitude.  Top-antitop  pairs  can  then  be  produced  at  large  rates  at  future 
colliders  and  used  for  various  studies.  In  particular,  since  a  heavy  top  (mt  >  120 
GeV)  decays  before  it  hadronizes  [20],  information  about  its  polarization  is  pre- 
served in  its  decay  products.  Schmidt  and  Peskin  [21]  have  suggested  (elaborating 
on  an  old  suggestion  of  Donoghue  and  Valencia  [22])  looking  for  the  asymmetry 
between  t^ti  and  t^tp.  as  a  signal  for  CP  violation  (see  also  [23]).  Note  that  this  is 
possible  only  for  a  heavy  particle  like  the  top  quark  because  for  a  light  particle,  the 
dominant  helicity  combination  would  be  tiJijt  or  <«?£,,  each  being  self-conjugate. 

The  asymmetry  N(tLti)  -  N(ipiR)  can  be  probed  through  the  energy  spectra 
of  prompt  leptons  from  t  —>  Wb;  W  — »•  \v.  This  is  understood  as  follows. 

For  a  heavy  top,  the  dominant  W  helicity  in  t  -*  Wb  is  0.  Now,  due  to  V  —  A 
interaction,  6  is  produced  with  left-handed  helicity  (neglecting  the  6  mass).  Hence 
in  the  t  rest  frame,  W+  momentum  is  dominantly  along  the  t  spin  direction.  It 
follows  that  /+  in  W+  decay  is  produced  preferentially  in  the  direction  of  the  t 
spin.  In  fact,  the  distribution  is  1  -f  cosV>,  where  V>  is  the  angle  between  the  /+ 
momentum  direction  and  the  t  spin  direction.  In  going  to  the  laboratory  frame,  the 
t  gets  boosted.  Thus  J+  from  IR  is  more  energetic  than  /+  from  IL,  and  /""  from  IL 
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has  more  energy  than  /~  from  tR.  Therefore,  in  the  decay  of  <£,?£,  /"  from  ti  has 
higher  energy  than  /+  from  fjr ,  and  the  reverse  is  true  for  tRtR.  Thus  the  energy 
asymmetry  of  leptons  measures  N(titi,}  —  N(tRtR). 

Schmidt  and  Peskin  [21]  looked  at  this  asymmetry  in  hadron  collisions  in  a  CP- 
violating  multi-Higgs  model  where  the  CP  violation  is  described  by  the  Lagrangian 
terms 

6C  =  ~^t[APL  +  A*PR]t,  (29) 

where  only  the  effect  of  the  dominant  lightest  Higgs  field  (j>  is  kept,  v  is  the  SM 
Higgs  vacuum  expectation  value,  and  A  is  a  complex  combination  of  mixing  angles 
and  phases. 

Since  #(<£?*,)  -  N(tRiR)  is  CP  odd  and  T  even,  the  CPT  theorem  requires 
the  existence  of  an  absorptive  part  for  it  to  be  non-zero.  Looking  at  the  tree  and 
one-loop  diagrams  for  qq  — t-  tt,  they  conclude  that 

A=N(tLtL)-N(tRiR) 

A      N(tLiL)  +  N(tRiR)~W  (30) 

for  mt  =  150  GeV,  m^  =  100-400  GeV  and  Im[^2]  =  \/2.  Thus  the  asymmetry  A 
would  be  observable  with  10*8  it  a  year. 

For  pp  collisions,  since  the  pp  state  is  not  a  CP  eigenstate,  there  is  also  a 
CP-invariant  contribution  present,  but  this  is  shown  to  be  small  [21]. 

As  in  the  case  of  pp  or  pp  collisions,  lepton  energy  asymmetry  can  be  used  to 
measure  CP  violation  in  e+e~~  — *  ft  [24].  The  authors  of  ref.[24]  also  consider  an- 
other asymmetry,  viz.,  the  up-down  asymmetry  of  the  charged  leptons  with  respect 
to  the  it  production  plane  in  the  laboratory  frame.  It  is  also  possible  to  construct 
a  "left-right"  asymmetry  of  leptons  with  respect  to  a  plane  perpendicular  to  the 
it  production  plane,  but  containing  the  it  momentum  direction  [25].  Certain  com- 
binations of  up-down  and  left-right  symmetry  with  forward-backward  asymmetry 
can  also  be  considered.  All  these  probe  different  combinations  of  CP-violating 
couplings  [25]. 

The  result  of  ref.[24]  is  that  asymmetry  is  at  the  few  per  cent  level  for  the  top- 
quark  electric  and  weak  dipole  moments  dt,dt  ~  e/m*.  For  ^/s  =  300  GeV  and 
m£  =  120  GeV,  cr  w  1400  fb,  which  corresponds  to  20,000  prompt  leptons  a  year 
for  a  luminosity  of  1033  cm"^"1.  In  such  a  case,  dt,dt  can  be  determined  to  a  few 
percent  level.  This  should  be  compared  with  the  prediction  of  10~2  —  10~3  from 
the  Higgs  model  for  CP  violation. 

Apart  from  the  above  asymmetries,  (7P-odd  correlations  could  provide  a  mea- 
sure of  CP  violation  in  it  production  [26].  Certain  correlations  are  more  sensitive 
to  CP  violation  in  t  decay,  rather  than  production  [26]. 

Another  process  which  has  been  suggested  is  e+e~  — +  iti/V  through  W+  W~ 
—>•  it  [4].  Two  diagrams  contribute  to  W^W""  — *  <t,  one  corresponding  to  t-channel 
6  exchange  and  the  other  with  an  s-channel  heavy  Higgs  <j>,  which  can  be  on  shell  for 
rrty  >  2mt.  Then,  the  absorptive  part  needed  for  a  CP-odd  CPT-odd  asymmetry 
is  provided  by  the  width  of  the  Higgs.  A  sizable  asymmetry  can  be  obtained  thus 
[41. 


dipole  moments  independently  [25,  27] 


5.  CP  violation  in  other  processes 


The  process  e+e~  -*  W+W~  ,  which  will  be  studied  in  the  near  future  at  LEP200, 
will  be  the  first  one  to  be  able  to  test  the  SM  couplings  of  the  electroweak  gauge 
bosons.  The  process  is  expected  to  put  bounds  on  non-standard  7  and  Z  couplings 
to  W+W~.  The  non-standard  couplings  could  be  CP  violating  ones,  as  in  (2). 
These  can  be  studied  in  a  way  similar  to  the  one  used  for  probing  CP  violation  in 
<s+e~  —  »  r+r~  and  e+  e~  —  >  it  considered  earlier.  As  in  the  case  of  e*e~~  —  *•  it  with 
leptonic  t,t  decays,  energy  or  up-down  asymmetry  of  leptons  may  be  used  [28,  29], 
Whrereas  Chang  ei  al  [28]  consider  asymmetries,  Mani  et  al  [29]  gave  estimated 
the  energy  correlation  ratio 

,_(£-) 

(E-) 

and  suggest  the  angular  correlation  ratio 

<*-)-(*+) 


where  E±  are  the  energies  of  the  leptons  /±  produced  in  the  decay  of  W±  ,  and  6± 
are  the  angles  of  /±  momenta  with  respect  to  the  e+  beam  direction.  The  general 
expectation  is  .4  fa  10~3  for  K  or  A  «  0.1. 

Some  other  processes  considered  in  the  literature  are  t,t  decay  asymmetries  in 
(j)  —  »•  «,  where  ^  is  a  heavy  Higgs  [30],  decay  lepton  asymmetries  in  e+e~  -*  XX  > 
where  £  is  a  neutralino  in  the  minimal  supersymmetric  standard  model  [31],  decay 
correlations  in  77  -+  W+W~  [32],  and  forward-backward  asymmetry  in  e+e~  -»  jZ 
[33]. 


6.  Conclusions 

We  have  seen  above  the  various  points  to  be  kept  in  mind  when  selecting  processes 
and  variables  for  detecting  and  measuring  CP  violation,  a  number  of  processes 
and  scenarios  of  CP-violating  signatures  which  could  be  looked  for.  The  above 
discussion  is  mainly  aimed  at  arriving  at  an  idea  of  the>, sensitivities  possible  in 
different  measurements.  In  general,  the  results  in  most  popular  models  of  CP 
violation  beyond  SM  indicate  that  CP  violation  in  the  most  optimistic  theoretical 
scenario  would  be  measurable  only  with  some  difficulty  in  the  existing  or  presently 
envisaged  experiments.  Nevertheless,  it  would  be  good  to  keep  one's  eyes  open  to 
these  possibilities. 
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Structure  functions  and  parton  distributions 
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Abstract.  We  review  recent  developments  in  the  determination  of  parton  densities 
from  deep  inelastic  and  related  data.  We  show  how  the  asymmetries  observed  in  the  W± 
rapidity  distributions  and  in  pp/pn  Drell-Yan  production  further  constrain  the  partons  at 
moderate  i. 


1.  Introduction 

An  idea  of  the  rapid  improvement  in  our  knowledge  of  the  structure  function, 
F-z(x,Q2),  of  deep  inelastic  electron  (or  muon)-proton  scattering  can  be  glimpsed 
from  the  data  shown  in  Fig.  1.  In  fact  it  should  not  be  too  long  before  we  have 
precision  measurements  (perhaps  as  good  as  ±5%)  of  FI  at  HERA  almost  down 
to  x  ~  10~4.  The  data  show  a  dramatic  rise  of  FI  with  decreasing  x,  which  some 
optimists  had  anticipated  might  occur  from  the  precocious  onset  of  the  leading 
cv,log(l/x)  resummation  of  soft  gluon  emissions. 

The  curves  marked  D'_  and  DQ  are  simply  two  extrapolations  obtained  from 
parton  fits  to  the  fixed  target  data,  which  at  the  time  it  was  thought  might  span 
the  subsequent  measurements  of  F^  at  HERA  -  one  represents  the  upper  limit 
expected  for  the  precocious  resummation  behaviour,  the  other  the  lower  limit  of 
conventional  (Regge-motivated)  expectations.  Such  extrapolations  are  notoriously 
unreliable. 

Section  2  is  mainly  devoted  to  recent  developments  in  the  determination  of  par- 
tons  in  the  x  >  10~2  region.  For  the  purposes  of  illustration  we  mainly  concentrate 
on  the  MRS  analyses.  In  Section  3  we  discuss  the  description  of  F2ep  at  small  x 
and  find  that  both  GLAP-based  extrapolations  and  BFKL-based  predictions  have 
sufficient  freedom  to  describe  the  HERA  data. 

2.  Recent  developments  in  the  determination  of  parton  densities 

There  is  a  long  history  of  determining  parton  distributions  from  deep  inelastic  scat- 
tering and  related  data.  The  increased  precision  in  the  experimental  measurements 
over  the  last  couple  of  years  has  led  to  a  considerable  improvement  in  our  knowledge 
of  the  partons,  at  least  for  x  >  0.01.  The  recent  experiments  are  listed  in  Table 
1,  together  with  the  leading  order  partonic  subprocess,  which  help  us  to  see  which 
partons  are  constrained  by  the  various  processes.  In  particular  it  is  useful  to  note 
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Figure  1.  The  x  dependence  of  the  structure  function  F2(x,Q2)  at  Q2  =  15  GeV2.  The 
data  are  from  the  BCDMS  [1],  NMC  [2],  Hi  [3]  and  ZEUS  [4]  collaborations.  The  curves 
are  calculated  from  the  MRS(Do,D'_)  sets  of  partons  [5]. 

that  the  four  deep-inelastic  structure  functions  can,  to  leading  order,  be  expressed 
in  terms  of  parton  densities  in  the  form 

=     ^-x(u  +  u  +  d  +  d  +  -s}  (2) 

±x(F£N  +  FlN)    =    x(u-u  +  d-d)  (4) 

where  N  is  an  isoscalar  nuclear  target.  We  have  included  the  s  quark  (and  assumed 
[5]  that  s  =  s),  but  neglected  the  small  c  quark  contribution.  These  four  observ- 
ables,  Fi(x,<Q2),  measured  in  the  fixed-target  (pN  -»  pX  and  vN  -*•  f*X)  deep- 
inelastic  experiments,  do  not  on  their  own  determine  the  five  distributions  « ,  of,  u,  d_ 
and  5,  but  only  four  combinations,  which  we  may  take  to  be  u  +  u,  d  +  d,  u  +  d 
and  s.  The  difference  d—  u  is  not  determined,  and,  moreover,  at  leading  order  it 
appears  that  the  only  constraint  on  the  gluon  is  through  the  momentum  sum  rule. 
In  practice  the  situation  is  worse.  From  (2)  and  (3)  we  see  that  the  strange  quark 
distribution  is  essentially  determined  by  the  structure  function  difference 


**(*)  ~  f 


-3F2*D 


(*) 


(5) 


which  is  sensitive  to  the  relative  normalization  of  the  two  data  sets  and  to  heavy 
target  corrections  to  the  neutrino  measurements.  However,  we  see  from  Table  1 
that  valuable  independent  information  comes  from  other  processes:  (i)  the  gluon  is 
constrained  by  prompt  photon  production,  where  g(x)  enters  to  leading  order,  (ii) 
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the  strange  sea  is  measured  by  neutrino-induced  dimuon  production  (see  Fig.  2), 
and  (iii)  there  is  a  measurement  of  the  difference  d  —  u,  which  was  released  for  the 
first  time  at  this  conference,  from  the  observed  asymmetry  of  Drell-Yan  production 
in  pp  and  pn  collisions  (see  subsection  (c)). 

(a)  MRS  global  analysis 

Here  we  describe  the  MRS  determination  [6]  of  the  parton  densities,  /i(x,  Q2),  from 
a  global  analysis  of  the  data,  which  was  performed  before  the  HERA  measurements 
of  F%p  became  available.  Only  deep  inelastic  data  with  Q2  >  5  GeV2  (and  W2  >  10 
GeV2)  were  used.  The  input  distributions  at  Q%  =  4  GeV2  were  parametrized  in 
the  form 

*/<(*,  Qo)  =  ^z-A<(l-*)*(l  +  Tl-x*+*,-z)  (6) 

for  i  —  the  valence  quarks  wv,  dv,  the  sea  S,  and  the  gluon  g.  Three  of  the  four  A^s 
are  determined  by  the  momentum  and  flavour  sum  rules.  Since,  pre-HERA,  there 
were  no  data  for  x  <  10~2,  we  imposed  two  different  extrapolations  of  the  gluon 
and  sea  quark  distributions  to  small  x  (with  the  expectation  that  they  should  span 
the  forthcoming  HERA  measurements).  To  be  precise  for  the  gluon  and  sea  quarks 
we  choose 

At-  =  0    for  the  D'O  set 

\i  =  0.5    for  the  D'_  set  (7) 

where,  since  cr(j*p)  ~  sAi,  the  first  choice  corresponds  to  conventional  Pomeron 
exchange  leading  to  constant  total  cross  sections,  and  the  second  is  motivated  by 
the  small  x  behaviour  expected  from  the  BFKL  (Balitsky,  Fadin,  Kuraev,  Lipatov) 
equation  [7].  The  /?,•  and  the  valence  At-  are  left  as  free  parameters  although  we  have 
some  idea  what  values  will  be  physically  reasonable.  First,  the  small  x  behaviour 
anticipated  from  meson  Regge  exchanges  for  the  valence  quark  distributions  would 
correspond  to  Av  ~  —0.5.  Secondly,  naive  counting  rule  arguments  imply  that 
xfi  —  (1  —  x)2""1  as  x  — »•  1,  where  n  is  the  minimum  number  of  spectator  quarks 
which  accompany  the  struck  parton.  This  rule  would  suggest  /?,-  ~  5,  3  and  7  for, 
respectively,  the  gluon,  valence  and  sea  quark  distributions. 

In  the  MRS  analysis  the  flavour  structure  of  the  quark  sea  S  at  Q2,  =  4  GeV2 
is  taken  to  be 

2u  =  0.45 -A 

2J  =  0.45  4- A 

2s  =  0.25 

xA  =  x(d-*)  =  A*x-**(l-x)'ls.  (8) 

As  mentioned  before,  the  most  reliable  method  to  estimate  the  strength  of  the 
strange  sea  distribution  s(x,Q2)  is  to  observe  deep-inelastic  dimuon  production, 
t/N  — » (ji~n+X,  for  which  the  dominant  subprocess  is  vs  — *  fji~  +  c(— +  /*+).  Early 

^lirmii^n      Jntn      nTA»y>      4- V.  r->      m  nl  I  «r»»  f  i  nr\     svf    t  Vl  n.      t^HQ/i.     all  r\r\1»O<!ai/\n       t~\f    t.VlO     1  Tl  1M 1 1.      Q 


Table  1  The  experimental  data  used  to  determine  the  MRS  parton  distributions.  1 
last  column  gives  an  indication  of  the  main  type  of  constraint  imposed  by  a  par! 
ular  set  of  data. 
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excellent  agreement  with  their  results  (see  Fig.  2)  and  moreover  CCFR  find  that 
the  ratio  s/(ti  -f  d)  is  essentially  independent  of  x.  It  has  been  emphasized  [9] 
that  the  strange  sea  as  measured  in  neutrino  scattering  should  be  different  from 
that  in  muon  scattering  on  account  of  the  different  mass  thresholds  in  W*g  —*•  sc 
versus  7*0  — *•  ss.  In  practice  the  neutrino  data  have  been  corrected  by  CCFR  to 
approximately  take  into  account  the  mc  ^  0  effects  and  allow  for  this  difference. 

The  first  evidence  that  d^  ti,  and  hence  the  need  to  introduce  A  in  (8),  came 
from  the  NMC  study  of  the  Gottfried  sum  rule  [10].  We  return  to  discuss  this  later. 
Finally,  the  heavy  quark  contributions  are  determined  by  assuming  that 


Q  -  0      for      Q2  <  4m2Q  with      Q  =  c,  6, ..., 


(9) 


and  by  generating  non-zero  distributions  at  higher  Q2,  via  g  — >•  QQ,  using  QCD 
evolution  with  mq  =  0.  Near  threshold  clearly  this  prescription  is  unreliable,  but 
it  has  been  shown  to  give  reasonable  results  at  higher  Q2  [11].  The  charm  quark 
distribution  in  the  proton  has  attracted  much  attention  but  the  phenomenological 
situation  is  far  from  clear. 

The  input  distributions  (6)  and  (8)  are  evolved  up  in  Q2  using  next-to-leading 
order  GLAP  or  Altarelli-Parisi  equations  [13]  and  fitted  to  deep-inelastic  structure 
functions  (with  Q2  >  5  GeV^_and  W2  >  10  GeV2)  and  related  data.  The  dis- 
tributions are  defined  in  the  MS  renormalization  and  mass  factorization  scheme. 
The  fitted  value  of  the  QCD  scale  parameter  is  ATJ^(H/  =  4)  =  230  MeV  (that  is 
o,(M|)  =  0.1125),  which  is  consistent  with  the  values  a,  =  0.113  and  0.111  found 
by  analyses  of  BCDMS  [14]  and  CCFR  [15]  subsets  of  the  overall  data. 
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Figure  2.   The  shaded  band  shows  the  strange  quark  distribution  extracted  by  CCFR 
from  their  dimuon  data  [7],  together  with  the  MRS  distributions  of  ref.  [5]. 
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Figure  3.  The  description  of  the  NMC  data  [2]  for  the  structure  function  ratio  F£n / F£p 
given  by  the  MRS  partons  of  ref.  [5]. 
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Figure  5.  The  continuous  curves  show  the  description  of  the  CCFR  [14]  measurements 
of  .F^z.Q2)  and  xF?N(x,  Q2)  by  the  MRS(D^)  set  of  partons  [5].  The  data  are  shown 
after  correction  for  the  heavy  target  effects  and  after  the  overall  renormalization  of  0.95 
required  by  the  global  fit. 


Figs.  3-5  show  the  high  precision  of  recent  deep-inelastic  structure  function 
measurements,  as  well  as  the  quality  of  the  fit.  Accurate  data  exist  also  for  related 
processes  and  are  equally  well  described.  This  should  mean  that  the  parton  dis- 
tributions are  well  determined,  at  least  in  the  x  range  0.02  <  x  <  0.7  where  data 
are  available  for  a  full  range  of  different  processes.  At  smaller  x,  x  <  10~3,  the 
measurements  of  F£P  at  HERA  give  information  on  the  x  behaviour  of  the  quark 
sea.  We  will  discuss  the  expectations  of  the  small  x  behaviour  in  some  detail  later. 
For  the  moment  we  simply  note  that  it  is  possible  to  'tune'  the  exponent  A,-  of 
the  input  sea  and  gluon  distributions  of  (6)  to  obtain  the  best  global  fit  to  a  data 
set  which  now  includes  the  new  HERA  measurements  of  F%  [3,  4].  The  result,  set 
MRS(H)  of  partons  [16],  has  A,-  =  0.3. 

Before  turning  to  a  theoretical  discussion  of  the  small  x  region,  we  must  mention 
two  recent  measurements  which  impose  valuable  new  constraints  on  the  partons  in 
the  x  >  0.02  region.  The  first  is  the  measurement  of  the  rapidity  asymmetry  of 
W*  bosons  produced  in  pp  collisions  and  the  second  is  the  Drell-Yan  asymmetry 
observed  in  pp  and  pn  collisions. 

(b)    W±  rapidity  asymmetry 

W*  bosons  produced  in  pp  collisions  have  different  rapidity  (yw)  distributions.  A 
comparison  of  parton  distributions  shows  that  the  «  quark  in  the  proton  tends  to 
go  more  forward  than  the  d  quark,  so  that  the  W+  is  produced  preferentially  in 
the  direction  of  the  incident  proton,  while  the  W~  tends  to  go  in  the  direction  of 
the  antiproton.  Thus  we  anticipate  that  the  asymmetry 

-  tr(W-) 


will  be  positive  and  increase  with  the  rapidity  yw  of  the  produced  W±  bosons. 
It  turns  out  that  the  asymmetry  AW  is  correlated  with  the  slope  of  F^/F^  in 
the  x  ~  My/l^/8  region  [17]  -  larger  slopes  tend  to  imply  greater  asymmetry.  In 
practice  it  is  the  rapidity  asymmetry  of  the  f^  decay  leptons  which  is  observed 


Folding  in  the  W  —  >•  tv  decay  dilutes  the  asymmetry,  but  the  measurements  shown 
in  Fig.  6  indicate  that  it  is  still  sizeable.  The  present  data  favour  the  recent  MRS 
rather  than  the  CTEQ  parton  sets.  As  the  experimental  precision  increases  we  see 
that  there  is  a  great  potential  for  the  asymmetry  to  further  discriminate  between 
parton  sets  -  indeed  these  data  should  be  included  in  future  global  analyses  from 
the  beginning. 

(c)  d—  u  and  the  Drell-Yan  asymmetry 

Inspection  of  the  leading  order  expressions  (l)-(4)  shows  that  measurement  of  the 
four  independent  deep-inelastic  structure  functions  do  not  determine  d  —  «  on  a 
(x,  Q2)  point-by-point  basis.  The  difference  d  —  u  is  only  constrained  if  the  form  of 
the  x  dependence  of  partons  is  assjumed.  Indeed  prior  to  1992  all  global  analyses 
set  u  =  j.  The  first  hint  that  u^  d  came  from  an  NMC  study  [21]  of  the  Gottfried 


sum  rule.  NMC  found  that  their  data  gave 

/°8  *L(F2W  -  *?")  =  0.227  ±0.007(Stat.)±0.014(sys.) 

JQ.004    x 

at  Q2  =  4  GeV2,  as  compared  to  the  Gottfried  sum 

IGSR  =    I'  ^(F?  -  Ff)  =  J  /'  dx(uv  -  dv)  +  j  /'  dx(u  -  d) 
Jo    *  «/o  Jo 


if     u  =  d. 
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Figure  6.  A  plot  of  the  CDF  data  for  the  lepton  charge  asymmetry,  (11),  compared  to 
the  predictions  of  various  sets  of  partons  [5,  18,  19].  The  plot  is  taken  from  ref.  [16].  These 
data  were  fitted  in  the  MRS(A)  analysis  [20]  which  gives  a  description  essentially  equal 
to  that  of  MRS(D'_). 

Using  their  data  to  extrapolate  over  the  unmeasured  regions  of  z,  NMC  obtained 
JGSR  =  0.240  ±0.016  [21]  [22].  The  most  plausible  explanation  of  the  difference 
between  the  NMC  value  and  the  Gottfried  sum  of  |  is  that  d  >  u.  In  the  MRS 
'D'  fits  d  -  n  is  parametrized  according  to  (8)  with  the  parameter  A&  chosen  to 
reproduce  the  measured  value  of  /GSR-  It  is  interesting  to  note,  however,  that 
even  including  the  NMC  data,  it  is  still  possible  to  maintain  w  =  d  and  obtain  an 
equally  good  global  description  of  the  data,  but  at  the  expense  of  a  contrived  small 
x  behaviour  of  the  valence  distributions,  as  in  set  S'0  of  ref.  [6].  The  comparison  of 
the  (F%  -  F$]  NMC  data  with  SQ  and  D'Q  is  shown  in  Fig.  7  -  the  area  under  the 
curves  gives  /GSR  =  0-333  and  0.256  respectively,  the  difference  coming  primarily 
from  the  unmeasured  x  £  10~2  region.  The  most  physical  assumption  is  to  allow 
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Figure  7.  The  structure  function  difference  F£p 
with  the  description  by  partons  from  refs.  [5,  18]. 
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u  ^  d,  as  in  the  'D'-type  fits.    Nevertheless  the  conclusion  is  that  the  detailed 
structure  of  d  —  u  is  not  determined  by  the  available  deep-inelastic  data. 

As  the  W*  rapidity  asymmetry  measurements  improve  they  become  increas- 
ingly sensitive  to  the  behaviour  of  u  and  d  in  the  region  x  ~  M^/^/s.  However  a 
more  direct  method  [24]  to  obtain  information  on  d  —  «  is  to  compare  (Drell-Yan) 
lepton-pair  production  in  pp  and  pn  collisions,  via  the  asymmetry 


'pn 


(13) 


'PP 


'pn 


Because  u  >  d  in  the  proton,  the  asymmetry  ADY  is  positive  for  parton  sets  with 
d  —  u  zero  or  small,  but  ^DY  is  reduced  and  can  even  be  negative  for  parton  sets 
with  larger  d—  u  [25].  An  asymmetry  measurement  by  the  NA51  collaboration  [26], 


=  -0.09  ±0.02  ±0.02 


(14) 


at  x  =  0.18,  was  announced  at  this  conference.  This  result,  which  implies  u/d  ~  0.5 
at  x  =  0.18,  corresponds  to  a  breaking  of  flavour  symmetry  considerably  beyond 
that  associated  with  the  MRS(D0,D'_)  partons  and  almost  as  large  as  that  calcu- 
lated using  CTEQ2M  partons,  see  Fig.  8. 

We  may  conclude  that,  as  far  as  the  asymmetries  are  concerned,  the  MRS  par- 
tons  accurately  reproduce  the  W*  rapidity  asymmetry  (Fig.  6)  but  do  not  describe 
the  new  Drell-Yan  asymmetry  measurement.  On  the  other  hand  the  CTEQ  partons 
describe  the  W±  asymmetry  rather  poorly,  but  offer  a  much  better  description  of 
the  Drell-Yan  asymmetry. 

(d)  Present  status  of  parton  distributions 

We  noticed  that  the  high  precision  deep-inelastic  data_  are  well  described  by  the 
parton  distributions  but  that  they  do  not  pin  down  the  d  —  u  combination.  However 
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Figure  8.  The  pp/pn  Drell-Yan  asymmetry  of  (13)  measured  by  NA51  [25]  compared  to 
values  obtained  by  various  sets  of  partons  [5,  15,  20,  18].  The  data  point  was  included  in. 
the  MRS(A)  analysis  [20]. 
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tuc  pttnuuEt.  in  particular  tue 
new  asymmetry  measurements  probe  fine  details  of  the  quark  distributions. 

Can  we  use  the  residual  freedom  of  d  —  u  to  modify  the  MRS(H)  set  of  partons 
so  as  to  describe  the  new  Drell-Yan  asymmetry  measurement  [26]  while  maintaining 
the  quality  of  the  global  fit  to  the  data?  In  particular  can  we  maintain  the  quality 
of  the  description  of  the  W*  asymmetry  (Fig.  6)  and  of  the  accurate  NMC  mea- 
surements of  F%lF%t  It  turns  out  that  such  a  fit  is  possible  -  we  denote  the  new  set 
of  partons  by  MRS(A)  [20].  The  asymmetries  resulting  from  this  set  of  partons  are 
shown  in  Figs.  6  and  8.  We  would  expect  the  MRS(A)  partons  to  be  very  similar  to 
those  of  MRS(H),  except  that  d  —  u  would  be  much  larger  while  at  the  same  time 
approximately  conserving_u  -f  d,  d+d  and  «  +  it  (as  required  by  (l)-(4)).  Thus  we 
anticipate  an  increase  of  d  which  is  compensated  by  a  corresponding  decrease  in  u 
and  d,  which  in  turn  requires  a  similar  increase  in  it.  The  comparison  of  MRS(H) 
and  MRS(A)  parton  sets  is  shown  in  Fig.  9,  and  displays  these  trends  in  the  region 
0.02  <  x  <  0.7  where  a  full  set  of  deep  inelastic  data  exist. 

Although  the  quark  distributions  are  well-constrained  by  the  data,  the  same 
is  not  so  true  for  the  gluon.  Within  the  global  analyses,  we  can  identify  three 
types  of  constraint  on  the  gluon:  (i)  the  momentum  sum  rule,  (ii)  the  prompt 
photon  production  measurements  and  (iii)  the  observed  overall  pattern  of  scale 
violations,  dFi/dlogQ2,  for  x  <  0.1.  The  measurements  of  F2ep  at  HERA  probe 
primarily  the  behaviour  of  the  sea  quark  distributions  at  small  x,  although  accurate 
measurements  of  dF%p/d\ogQ2  will  further  constrain  the  gluon.  If  the  observed 
•steep  rise  of  F2  with  decreasing  x  is  attributed  to  the  Lipatov  or  BFKL  perturbative 
QCD  mechanism  via  g  — >•  qq,  then  the  same  behaviour  should  be  seen  also  in  the 
gluon  distribution.  A  direct  measurement  of  the  gluon  in  the  x  <  10~2  region  would 
be  invaluable. 

In  addition  to  the  potential  constraints  coming  from  accurate  measurements 
of  dFi/dlogQ2,  there  are  several  other  possible  ways  of  measuring  the  gluon  at 
HERA,  including  F£,  J/^,  QQ---  production.  In  principle  the  observation  of  the 
longitudinal  structure  function  is  a  gold-plated  measurement,  but  in  practice  it 
will  be  difficult.  To  extract  FL  as  a  function  of  x,Q2,  requires  observation  of  the 
y  —  Q2/xs  dependence  of  deep  inelastic  scattering,  which  means  having  electron- 
proton  collisions  at  different  c.m.  energies  >/s. 

There  are  also  possibilities  to  obtain  information  on  the  gluon  from  hadron 
colliders,  in  particular  the  Fermilab  pp  collider.  At  v^  =1.8  TeV,  x  values  com- 
parable to  those  currently  measured  at  HERA  can  be  probed  either  by  observing 
small  mass  systems  produced  centrally  (e.g.  66  or  Drell-Yan  production  or  by  more 
massive  final  states  at  large  rapidity  y  (e.g.  same-side  W-fjet  or  dijet  produc- 
tion). As  an  illustration  Fig.  10  shows  the  ratio  of  same-side  to  opposite-side  jet 
cross  sections  from  uncorrected  CDF  data,  as  a  function  of  the  equal  and  oppo- 
site rapidities  y.  At  large  y  the  same-side  cross  section  originates  dominantly  from 
subprocesses  of  g  (small  x)<7vai(large  x)  origin.  The  curves  are  the  MRS(D'_,Do,H) 
predictions  evaluated  at  leading  order  with  a  renormalization/factorization  scale 
chosen  to  mimic  the  next-to-leading  order  corrections.  Further  details  of  this  anal- 
ysis can  be  found  in  ref.  [27].  We  see  that  at  large  rapidity  the  ratio  is  sensitive 
to  the  small  x  behaviour  of  the  gluon  and  that  the  preliminary  CDF  data  favour 
the  singular  distributions.  Dijet  production  from  'direct'  photons  at  HERA  offers 
similar  possibilities  of  probing  the  gluon  at  small  x  via  751  —*•  qq  [28]. 
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rigure  10    The  same-side/opposite-side  dijet  ratio  predicted  by  parton  sets  of  refe.  [5, 
5J  compared  with  prehminary,  uncorrected  CDF  data. 
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3.  Conclusions 

The  parton  densities  of  the  proton  are  well  determined  in  the  region  0.02  <  x  <  0.7 
where  high  precision  data  for  a  wide  range  of  deep-inelastic  ant?  related  processes  are 
available.  The  one  combination,  d—  w,  that  is  not  constrained  by  the  deep-inelastic 
data,  has  for  the  first  time  been  determined  at  x  =  0.18  by  a  measurement  of  the 
Drell-Yan  asymmetry  in  pp  and  pn  collisions  [26].  This  measurement,  together  with 
the  W*  rapidity  asymmetry  measurements  probe  fine  details  of  the  quark  densities. 
It  is  therefore  not  surprising  that  the  existing  MRS  (and  CTEQ)  partons  require 
minor  modification.  The  result  is  a  new  set  of  partons  [20]  -  MRS(A)  -  which  is 
essentially  identical  to  MRS(H)  at  small  x]  recall  that  the  latter  parton  set  [16] 
was  obtained  by  incorporating  the  1992  HERA  measurements  of  F2  [3,  4]  into  the 
analysis.  It  is  found  that  MRS(A,H)  also  give  an  excellent  description  of  the  new 
preliminary  1993  higher  statistics  HERA  data  for  F%, 
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Abstract.  I  start  with  a  brief  introduction  to  the  elementary  particles  and  their  in- 
teractions, Higgs  mechanism  and  supersymmetry.  The  major  physics  objectives  of  the 
Tevatron  and  LHC  colliders  are  identified.  The  status  and  prospects  of  the  top  quark, 
charged  Higgs  boson  and  superparticle  searches  are  discussed  in  detail,  while  those  of  the 
neutral  Higgs  boson(s)  are  covered  in  a  parallel  talk  by  R.J.N.  Phillips  at  this  workshop. 


The  hadron  colliders  are  best  suited  to  explore  a  new  domain  of  energy  for  a  wide 
variety  of  new  physics  signals  because  of  their  higher  energy  reach  and  greater 
versatility.  With  the  loss  of  SSC,  one  looks  forward  to  the  Tevatron  upgrade  and 
especially  the  large  hadron  collider  (LHC)  to  survey  the  energy  range  of  100  — 
1000  GeV.  I  plan  to  give  an  overview  of  the  main  physics  issues  to  be  probed 
by  these  colliders.  To  put  them  in  perspective,  let  me  briefly  recall  our  current 
understanding  of  the  elementary  particles  and  their  interactions. 

Basic  Constituents  of  Matter  and  Their  Interactions  -  As  per  the  standard  model, 
the  bask  constituents  of  matter  are  the  3  pairs  of  leptons  and  quarks  shown  below. 
Each  pair  represents  2  charge  states  differing  by  1  unit  -  charge  0,  -1  for  the  leptons 
and  2/3,  -1/3  for  the  quarks  -  which  is  relevant  for  their  weak  interaction.  Apart 
from  this  electric  charge  of  course  the  quarks  possess  the  so-called  colour  charge, 
which  is  relevant  for  their  strong  interaction. 

Table  1,  Basic  constituents  of  matter 


leptons 

charge 

quarks 

charge 

Vt  Vp  V-r 

e  n   r 

0 
-1 

u   c    t 
d   s   b 

2/3 
-1/3 

Almost  half  of  these  elementary  particles  have  been  observed  during  the  last  two 
decades,  thanks  to  the  advent  of  the  colliders.  The  discovery  of  the  T  lepton  and 
the  charm  quark  and  study  of  their  detailed  properties  were  the  highlights  of  the 
SPEAR  (e+e~)  collider.  Although  the  bottom  quark  was  first  discovered  in  a  fixed 
target  hadron  machine  (the  Fermilab  SPS),  its  detailed  study  could  be  made  only  at 


tne  jjumo  ana  ^t/ort,  {e T  e  )  comaers.  I  ne  nrst  experimental  evidence  ^tnougn 
still  indirect)  of  vr  has  come  from  the  observation  of  W  — »•  r  vr  events  at  the 
CERN  pp  collider.  These  are  the  (in)famous  monojet  plus  missing-pr  events  of  the 
UA1  experiment  which  were  originally  thought  to  signal  supersymmetric  particle 
production.  As  regards  the  top  quark  -  the  last  and  the  most  massive  member  of 
the  above  table  -  the  CDF  experiment  has  recently  reported  a  tentative  signal  in 
the  mass  range  of  ~  175  GeV  [1,  2]  from  the  Tevatron  pp  collider  [3].  One  expects 
a  more  definitive  signal  to  emerge  from  the  ongoing  CDF  and  D0  experiments  at 
the  Tevatron  collider.  For  there  is  sufficient  indirect  evidence  for  the  existence  of 
top  quark  in  the  above  mass  range,  as  we  shall  see  below. 

Next  consider  the  interactions  between  these  elementary  particles.  Apart  from 
gravitation,  which  is  too  weak  to  be  of  interest  to  particle  physics  phenomenology, 
there  are  3  basic  interactions  -  strong,  electromagnetic  and  weak.  They  are  all 
gauge  interactions,  mediated  by  vector  particles.  The  strong  interaction  (QCD)  is 
mediated  by  the  exchange  of  massless  vector  gluons  with  couplings  proportional  to 
the  colour  charge  C  (Fig.  la).  This  is  analogous  to  the  electromagnetic  interaction 
(QED),  mediated  by  the  massless  vector  photon  with  couplings  proportional  to  the 
electric  charge  e  (Fig.  Ib).  The  gluon  was  detected  at  the  PETRA  (e+e~)  collider 
via  the  three-jet  events  i  coming  from  gluon  radiation  [4] 

e+ e~  -*  qqg  . 

Unlike  the  photons  which  possess  no  electric  charge  the  gluons  possess  colour  charge 
and  can  therefore  interact  among  themselves.  The  gluon  self  interaction,  the  most 
distinctive  feature  of  strong  interaction,  has  been  observed  at  the  CERN  pp  collider, 
via  the  two-jet  events  coming  from  [4] 
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Figure  1.  The  basic  amplitudes  of  (a)  strong,  (b)  electromagnetic  and  (c)  charged  current 
weak  interaction. 

The  weak  interactions  are  mediated  by  the  massive  charged  and  neutral  vector 
bosons  W±  and  Z°.  The  charged  W  boson  couples  to  each  of  the  above  pairs  of 
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leptons  and  quarks  with  the  same  universal  coupling  (Fig.  Ic),  while  the  Z  boson 
couplings  are  given  by  the  standard  SU(2)  x  U(l)  electro-weak  model  of  Glashow, 
Weinberg  and  Salam  in  terms  of  the  mixing  angle  QW-  It  also  relates  the  weak  and 
electromagnetic  couplings 

aw=a/sin:tOw  ,  (1) 

where  sin2  9w  is  known  from  the  neutrino  scattering  (and  more  recently  the  LEP 
data)  [1],  i.e. 

sin2  8W  ^  .23  .  (2) 

Thus  the  W  mass  is  predicted  by  estimating  the  Fermi  coupling  from  p.  decay  (Fig. 
Ic);  i.e 

GF  =  -TK  Trf-  =  4=    •  2.Q    „,    =  1-1663  x  1(T5  GeV~2  .  (3) 

\/2  MJ,       v/2  sin2  Bw  •  M%,  ^  ' 

Substituting  eq.  (2)  and  the  fine  structure  constant  at  the  appropriate  mass  scale 
a(M&)  ~  1/128  (4) 

one  gets 

Mw  ^  80  GeV,  Mz  =  Mw/cos9w  c±  91  GeV  .  (5) 

The  W  and  Z  bosons  were  seen  at  these  masses  at  the  CERN  pp  collider  and  more 
recently  at  the  Tevatron  collider.  Finally,  it  has  been  possible  to  produce  millions 
of  Z  bosons  and  study  its  detailed  properties  at  the  large  electron  positron  collider 
(LEP). 

It  would  appear  from  the  above  discussion  that  all  that  remains  to  be  seen  at 
the  future  colliders  is  a  definitive  signal  of  the  top  quark!  This  is  of  course  not  the 
whole  story,  as  we  see  below. 

The  Mass  Problem  (Higgs  Mech.)  -  It  arises  from  the  fact  that  for  massive  vec- 
tor bosons,  the  mass  term  in  the  Lagrangian  breaks  gauge  invariance  and  hence 
the  renormalisability  of  the  theory.  Let  us  consider  the  SU(2)  gauge  interaction, 
mediated  by  the  Isotriplet  of  vector  bosons 


in  the  presence  of  charged  scalar  particles  represented  by  the  complex  field  <j>.  We 
have 

C     • 


4 
x  Wv  .  (6) 

The  three  terms  in  the  Lagrangian  from  right  to  left  represent  the  vector  boson 


and  the  scalar  kinetic  energy  and  gauge-interaction  term.  Each  of  them  is  invariant 
under  the  gauge  transformations 

<t>  -+  eis'$<j>,  WP-+WH-  -d^S  -  a  x  W^  .  (7) 

y 

But  adding  a  mass  term  for  the  vector  boson 


will  clearly  break  the  gauge  invariance  [5].  In  contrast,  the  scalar  mass  term  is  gauge 
invariant.  This  fact  is  used  to  give  mass  to  the  vector  bosons  through  back-door. 
One  starts  with  an  Isodoublet  of  complex  scalar  field  with  imaginary  mass, 


and  turns  on  the  Higgs  mechanism  [6].  Three  of  the  scalar  fields  are  absorbed  as 
Goldstone  bosons  to  give  mass  and  hence  longitudinal  components  to  the  3  vector 
bosons,  while  the  remaining  one  becomes  a  physical  scalar  particle  (the  famous 
Higgs  particle)  with  real  mass 


=  MW  .  (9) 

v     ff     ' 

Although  the  mass  is  related  to  the  unknown  scalar  self-coupling  A,  the  validity  of 
the  perturbation  theory  implies 

A  <  1  =*  MH.  <  ITeV  .  (10) 

Finally,  the  Higgs  coupling  to  the  vector  bosons  as  well  as  the  quarks  and  leptons 
(not  mentioned  so  far)  are  predicted  to  be 

—     9 


Thus  the  Higgs  particle  has  appreciable  couplings  only  to  the  heavy  particles  like 
W,  Z  bosons  or  the  t  quark;  and  the  best  place  to  look  for  it  is  in  the  decay  of 
Z  or  the  toponium  (it)  state  [7].  Even  then  the  branching  ratio  is  tiny,  so  that 
one  needs  copious  production  of  Z  AS  provided  by  the  LEP  (e+e~)  collider.  It  has 
yeilded  a  lower  mass  bound  of  about  60  GeV  [1]  for  Higgs  particle  which  is  close 
to  its  discovery  limit.  The  search  can  be  pushed  up  further  to  about  80  GeV  at 
LEP-II.  If  we  do  not  find  a  Higgs  particle  in  this  mass  range,  then  the  search  must 
be  extended  over  the  hundreds  of  GeV  range  since  the  theoretical  mass  limit  is  1 
TeV.  This  is  the  main  physics  goal  of  LHC,  which  is  proposed  to  be  a  14  TeV  pp 
collider.  The  reason  for  the  pp  option  instead  of  pp  is  its  high  luminosity  and  hence 
a  higher  mass  reach  for  the  Higgs  search. 

Hierarchy  Problem  (SUSY  Soln.)  -  Solving  the  mass  problem  by  the  Higgs  scalars 
leads  to  the  so-called  hierarchy  problem.  The  property  that  scalar  particle  mass 
is  not  protected  by  any  gauge  symmetry,  which  was  in  fact  used  to  solve  the  mass 
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problem,  implies  that  they  are  quadratically  divergent  under  radiative  correction 
(Fig.  2).  Consequently  the  output  Higgs  mass  would  become  as  large  as  the  cutoff 
scale  of  the  electroweak  theory,  i.e. 

MH  —  MGUT(W™  GeV)  or  MPlanck(1019  GeV). 
How  to  restrict  the  Higgs  mass  to 

MH  ~  Mw  ~  100  GeV? 

JH 
/         N 

/"~~"x  /  \ 

i  \  f^J 

1  I      \         '  H 


1234 

Figure  2.  The  quadratically  divergent  contributions  to  the  Higgs  mass  from  the  radiative 
Higgs  loops  (1,2)  and  the  cancelling  SUSY  contribution  from  the  Higgsino  loops  (3,4). 

By  far  the  most  attractive  solution  to  this  problem  is  provided  by  supersym- 
metry (SUSY)  [8].  It  provides  canceling  contributions  from  radiative  loops  with 
Higgsino,  fermionic  superpartner  of  the  Higgs  scalar  (Fig.  2).  For  the  cancellation 
to  be  exact  up  to  a  mass  scale  of  ~  100  GeV,  one  requires 

1)  Exact  supersymmetry  in  the  couplings 

2)  A  bound  on  supersymmetry  breaking  in  masses 

"  100  GeV. 


Thus  one  expects  to  see  superpartners  of  standard  particles  -  scalar  partners 
of  quarks  and  leptons  (q,  £}  and  fermionic  partners  of  gauge  and  Higgs  bosons 
(g,1,W,Z,H)  -  within  the  mass  scale  of  several  hundred  GeV.  Search  for  such 
particles  is  an  important  programme  of  present  and  proposed  colliders.  By  far  the 
strongest  mass  bound  on  super  particles  comes  from  the  CDF  experiment  [9]  at  the 
Tevatron  (pp)  collider,  i.e. 

Mq-g  >  140  GeV.  (12) 

The  ongoing  CDF  and  D0  experiments  at  Tevatron  are  expected  to  extend  this 
search  up  to  a  mass  range  of  ~  250  GeV.  The  search  can  be  extended  to  over  1 
TeV  at  LHC. 

It  is  clear  from  the  above  discussions  that  the  searches  for  top  quark,  Higgs 
boson(s)  and  possible  superparticles  are  the  most  important  physics  objectives  of 


w    /         \     w 


Figure  3.  Radiative  correction  to  the  W  boson  mass  arising  from  the  tb  loop. 

the  forthcoming  high  energy  collider  experiments.  While  it  will  take  a  21st  century 
machine  like  LHC  to  carry  on  the  Higgs  boson  and  superparticle  searches  upto 
their  predicted  mass  bounds  of  ~  1  TeV,  there  is  a  strong  indirect  evidence  for  the 
top  quark  to  lie  in  the  mass  range  of  ~  170  GeV.  This  comes  from  the  radiative 
correction  to  eq.  (3)  coming  from  the  tb  loop  contribution  to  W  self  energy  (Fig. 
3)  and  the  analogous  it  contribution  to  the  Z  self  energy.  Since  these  vector  bosons 
acquire  longitudinal  components  by  swallowing  Higgs  scalars  their  •fermionic  cou- 
plings are  propertional  to  the  fermion  mass  (eq.  11),  which  can  be  sizeable  for  a 
large  Mt.  The  resulting  radiative  correction  is  quadratic  in  Mt-  More  precisely  eq. 
(3)  has  a  radiative  correction  factor  (1  +  Ar),  where 

(13) 

for  Mt  >  Mw>  Thus  for  Mt  >  200  GeV,  one  would  get  untenably  low  values 
of  sin2  QW  or  MW  (Mz).  Consistency  with  the  precission  measurements  of  these 
quantities  (particularly  at  LEP)  requires  [1] 

Mt- 170  ±25  GeV.  (14) 

Therefore  one  expects  a  definitive  top  quark  signal  to  emerge  from  the  ongoing 
experiments  at  the  Tevatron  pp  collider. 


Figure  4.  Top  quark  production  in  pp  (pp)  collision  via  gluon-gluon  fusion. 

Top  Quark  Search  -  The  hadron  colliders  are  the  most  promising  machines  for  top 
quark  search  because  of  their  high  energy  reach.    But  the  signal  is  messy;   and 
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one  has  to  use  special  tricks  to  separate  it  from  the  background.  The  dominant 
mechanism  for  top  quark  production  is  the  so-called  flavour  creation  process  via 
gluon-gluon  fusion  (Fig.  4)  and  quark-antiquark  fusion  (Fig.  la),  i.e. 


*t-  (15) 

One  looks  for  a  prompt  charged  lepton  t  (i.e.  e  or  fi)  coming  from  its  leptonic  decay 
t  -»•  bvt  (16) 

which  eliminates  the  background  from  gluon  and  ordinary  stable  quarks  (u,d,  s). 
Of  course  the  charged  lepton  could  come  from  the  unstable  quarks  b  and  c,  i.e. 

99(w)     -*     bb,  cc; 
b-*cvt,  c    -*•     svt.  (17) 

These  background  can  be  effectively  suppressed  by  requiring  the  charged  lepton 
to  be  isolated  from  the  other  particles.  Because  of  the  large  energy  release  in  the 
decay  of  the  massive  top  quark,  the  decay  products  come  wide  apart.  In  contrast 
the  energy  release  in  the  light  6  or  c  quark  decay  is  small,  so  that  the  decay  products 
come  together  in  a  narrow  cone  -  i.e.  the  charged  lepton  appears  as  a  part  of  the 
decay  quark  jet.  The  isolated  lepton  signature  provides  a  simple  but  very  powerful 
signature  for  top  quark,  first  suggested  in  [10].  Using  this  signature  the  top  quark 
search  was  carried  out  at  the  CERN  pp  collider  and  then  at  the  Tevatron  collider 
-  the  latter  giving  a  mass  limit  of  Mt  >  89  GeV  [11]. 

With  the  luminosity  upgrade  of  the  Tevatron  collider  it  is  possible  now  to  extend 
the  search  to  the  mass  range  of  100  —  200  GeV.  A  top  quark  in  this  mass  range 
decays  into  a  real  W  boson,  so  that  one  has  a  1W  final  state,  i.e. 

it  ->  bbWW  ->  Uqqlv.  (18) 

The  most  serious  background  in  this  case  is  direct  W  production  with  additional 
QCD  jets, 

qq  ->  ggW  -*  j\htv.  (19) 

Since  the  QCD  jets  are  largely  soft  and/or  collinear,  they  can  be  suppressed  to  a 
large  extent  by  transverse  momentum  and  invariant  mass  cuts  -  e.g.  pj  >  60  GeV, 
•Mj'ija  ~  80  GeV  -  without  affecting  the  signal. 

Fig.  5  shows  an  early  prediction  of  the  it  signal  and  the  W  +  2  jets  background 
with  these  cuts  for  the  Tevatron  upgrade  [12];  the  right  hand  scale  corresponds  to 
a  luminosity  of  100  pb"1.  One  sees  firstly  that  for  the  relatively  clean  dilepton 
channel,  corresponding  to  leptonic  decay  of  both  the  W  bosons  in  (18),  one  has 
a  measurable  signal  upto  M*  ~  150  GeV.  For  the  isolated  lepton  plus  multijet 
channel  the  signal  remains  measurable  right  upto  Mt  ~  200  GeV,  although  one 
has  to  contend  with  a  formidable  background.  Here  one  hopes  to  be  helped  by 
the  fact  the  signal  is  dominated  by  3  jets  accompanying  the  isolated  lepton,  for 
which  the  QCD  background  would  be  further  suppressed  by  an  order  of  magnitude 
[12,  13].  Finally  the  presence  of  a  pair  of  6  quark  jets  in  the  signal  (18)  would  help 
to  enhance  the  signal  to  background  ratio  further,  if  one  has  a  reasonably  efficient 
b  identification. 
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Figure  5.  Top  quark  contribution  to  the  isolated  lepton  plus  n-jet  events  and  a\so  dilepton 
events  (dotted  line)  shown  for  the  typical  energy  (2  TeV)  and  luminosity  (100  pb"1 )  of 
the  Tevatron  upgrade.  The  background  to  the  2-jet  events  from  W  plus  2-jet  and  W  pair 
production  processes  are  also  shown  [12]. 
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Figure  6.  Top  mass  distribution  for  26  events  in  the  W  +  3  or  more  jet  sample  (solid 
histogram)  and  the  background  of  13  events  (dots)  obtained  from  W+  multijets  Monte 
Carlo  simulation.  The  dashed  histogram  represnts  the  sum  of  13  if  Monte  Carlo  events 
with  Mt  =  175  GeV  plus  the  13  background  events  [2]. 
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of  26  events  in  the  W  (-+  tv)  +  3  or  more  jets  channel  against  the  expected  back- 
ground of  ~  13  events  from  W+  QCD  jets.  The  excess  is  consistent  with  a  top 
quark  signal  of  mass  ~  175  GeV  (Fig.  6).  They  also  have  a  reasonable  efficiency  of 
6  identification  (--  30%)  by  combining  informations  on  its  decay  vertex  in  the  mi- 
crovertex  detector  and  on  the  lepton  coming  from  its  semileptonic  decay.  Requiring 
at  least  one  identified  b  jet  leaves  a  sample  7  events  against  an  expected  background 
of  1.4  (Fig.  7).  The  excess  of  5.6  events  has  been  tentatively  interpreted  as  a  top 
quark  signal  of  mass  ~  175  GeV  [2]. 
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Figure  7.  Top  mass  distribution  for  the  data  (solid  histogram)  and  the  background  of  1.4 
W+  multijets  Monte  Carlo  events  (dots)  having  a  tagged  b.  The  dashed  histogram  repre- 
sents the  sum  of  5.6  tf  Monte  Carlo  events  with  Mt  =  175  GeV  plus  the  1.4  background 
events  [2]. 

But  of  course  the  data  sample  is  too  small  to  draw  any  definitive  conclusion.  It 
may  be  noted  here  that  this  data  sample  was  based  on  a  luminosity  of  ~  20  pb-1 , 
while  the  ongoing  CDF  and  D0  runs  at  the  Tevatron  are  expected  to  accumulate 
a  luminosity  of  -  100  ph"1.  Therefore  a  more  definitive  picture  is  expected  to 
emerge  soon.  Even  then  one  would  have  of  course  no  more  than  a  dozen  or  two  of 
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top  candidate  events.  With  an  accumulated  luminosity  of  ~  1000  pb"1,  expected 
for  the  next  phase  of  the  Tevatron  upgrade,  one  expects  to  see  ~  100  top  quark 
events.  This  will  be  sufficient  to  establish  a  definitive  top  signal,  but  still  inadequate 
to  study  its  decay  properties. 


(D 


CM 
Q. 


4-1 


•H 

b" 


10 


10 


10 


10 


0.1 


.01 


.,         r-_ 

No  Ey  Cut 

AC 

E*  <  10  GeV 


4O 


6O 


8O 


IOO 


Figure  8.  The  expected  it  signal  at  LHC  in  the  cleanest  (e/i)  channel  shown  against  the 
PT  of  the  2nd  (softer)  lepton.  The  66  background  with  and  without  the  isolation  cut  are 
also  shown  [14]. 

In  contrast  one  expects  copious  production  of  top  at  the  LHC.  In  the  cleanest 
(ep.)  channel,  shown  in  Fig.  8  [14],  one  expects  a  top  cross-section  of  ~  104  fb.  This 
corresponds  to  a  cross-section  of  ~  105  fb  in  the  lepton  -f-  multijet  channel  (18) 
discussed  above.  Even  with  the  low  luminosity  option  of  LHC  (~  10  fb~1/year)> 
this  would  imply  an  annual  rate  of  ~  1  million  top  quark  events  -  i.e.  similar  to  the 
rate  of  Z  events  at  LEP.  This  will  enable  one  to  study  its  decay  properties  in  detail 
and  in  particular  to  search  for  new  particles  in  the  decay  of  top.  In  particular  there 
is  a  good  deal  of  recent  interest  in  the  search  of  one  such  new  particle  in  top  quark 
decay,  i.e.  the  charged  Higgs  boson  H^  of  the  supersymmetric  standard  model. 


D  P  Roy 

Charged  Higgs  Boson  Search  -  The  minimal  supersymmetric  standard  model 
(MSSM)  has  two  Higgs  isospin  doublets  with  opposite  hypercharge  V  =  ±1  to 
ensure  anomaly  cancellation  between  their  fermionic  partners  [6,  8].  The  two  dou- 
blets of  complex  scalar  fields  correspond  to  8  independent  scalars,  3  of  which  are 
absorbed  as  Goldstone  bosons  to  give  mass  to  the  W±  and  Z.  So  there  are  5 
physical  Higgs  bosons  -  3  neutral  (h°,H°,A0)  and  2  charged  ones  (#*).  We  have 
the  following  fermionic  couplings  of  the  charged  Higgs  boson  in  the  diagonal  KM 
matrix  approximation, 

£  =     .„..      -  H+   cot  pMtfbi     4-     tan  fiMbtb/i  4-  cot  /?Mcc«£ 
y2Mw        i 

4-     tan/?MTi/rfi  \  +  he,  (20) 

where  we  have  neglected  the  couplings  propertional  to  the  light  quark  and  lepton 
masses.  The  subscript  L(R]  stands  for  the  left  (right)  handed  spinor  state  and 
tan/?  represents  the  ratio  of  the  two  Higgs  vacuum  expectation  values.  In  the 
supergravity  models  1  <  tan  /?  <  Mt/Mb. 

As  we  see  from  (20),  the  t  couplings  to  bH+  and  bW*  are  comparable  over  a 
large  range  of  tan  /?  and  so  are  the  H+  couplings  to  cs  and  VT+  .  Thus  for  MH  <  Mt , 
one  expects  significant  branching  fractions  for  the  decays 


H+  ->  T+IA  (21) 

In  contrast  to  the  preferencial  H+  decay  into  r,  there  is  a  universal  branching 
fraction  (=  1/9)  for  W  boson  decay  into  the  3  lepton  species,  i.e. 


W+  -+e+v,}ji+v,T+v.  (22) 

Thus  an  excess  of  top  decay  into  r  vis  a  vis  the  e,  ft  channels 


constitutes  a  distinctive  signature  for  charged  Higgs  boson  [17]. 

A  second  signature  for  charged  Higgs  boson  is  provided  by  the  r  polarisation  — 
the  T  leptons  coming  from  the  decay  of  a  scalar  (#*)  and  vector  (W*)  boson  have 
exactly  opposite  polarisations  [18].  Using  the  two  signatures  one  can  carry  on  the 
H*  search  close  to  the  top  quark  mass  at  LHC  over  the  whole  range  of  tan  /?  [19]. 

For  MH  >  Mt  one  still  expects  a  sizeable  rate  of  H*  production  at  LHC  via 
the  gluon  —6  quark  fusion 

gb-+tH~  -»«&;  (24) 

but  the  dominant  decay  mode  into  fb  has  aa  enormous  QCD  background.  With  a 
good  6  quark  identification,  however,  one  expects  to  have  a  viable  signal  if  tan  j3  —  •  1 
or  very  large  (~  Mt/Mb)  [20,  21]. 

Fig.  9  shows  the  expected  signal  against  the  QCD  background  for  tan/?  =  1  and  50 
assuming  a  ^-identification  efficiency  of  30%  F^ffl.  Inter  pst.incrlv  t.Viesft  two  reirions 
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Figure  9.  The  expected  charged  Higgs  signals  at  LHS  shown  against  the  reconstructed 
H±  mass  along  with  the  background.  The  cases  MH±  =  200,  300, 400,  500  GeV  are  shown 
for  (a)  tan/?  =  1  and  (b)  tan/?  =  50  [20]. 


couplings  at  the  GUT  scale  [22].  The  underlying  reason  for  favouring  these  two 
regions  of  tan/?  is  of  course  the  same  in  both  the  cases  -  i.e.  a  large  Htb  Yukawa 
coupling  a  la  eq.  (20). 
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Figure  10.  The  proton  decay  process  arising  from  squark  exchange. 

Search  for  Superparticles  -  Let  me  start  with  a  brief  discussion  of  jR-parity,  which 
underlies  the  canonical  missing-pr  signature  for  superparticle  search.  The  presence 
of  scalar  quarks  and  leptons  (<j,  1}  in  SUSY  imply  baryon  and  lepton  number  vio- 
lating interactions  shown  in  Fig.  10.  Moreover  this  diagram  would  imply  proton 
decay  with  a  typical  time  scale  of  weak  interaction  (TP  ~  10~8  sec!),  since  the  mass 
of  the  exchanged  particle  Mq  is  comparable  to  MW  •  To  forbid  this  catastrophic 
proton  decay  one  assumes  J?-parity  conservation,  where 

R=(-lfB+L+™  (25) 

so  that  it  is  +1  for  all  the  standard  particles  and  —1  for  their  superpartners  differing 
by  half  a  unit  of  spin(S).  It  automatically  forbids  single  emission/absorption  of  a 
superparticle.  It  implies  that  (i)  the  superparticles  are  produced  in  pair;  and  (ii) 
the  lightest  superparticle  (LSP)  resulting  from  their  decay  is  stable.  The  LSP  is 
also  expected  to  be  colour  and  charge  neutral  for  cosmological  reasons;  and  in  most 
SUSY  models  it  turns  out  to  be  the  photino  7.  Finally  the  LSP  is  expected  to 
interact  very  weakly  with  matter  like  the  neutrino  (Fig.  11);  and  hence  escape 
the  detector  without  a  trace.  The  apparent  imbalance  of  transverse  momentum 
(missing-pr)  resulting  from  this  serves  as  a  signature  for  superparticle  production 
[23]. 

The  superparticles  having  the  largest  production  rates  at  the  hadron  colliders 
are  the  strongly  interacting  ones,  i.e.  squark  q  and  gluino  g.  They  are  produced 
via  gluon-gluon  fusion  (Fig.  12) 

99  -*  99  or  qq  (26) 

and  decay  via 

9  -*  «T,  9  ->•  m-  (27) 


The  decay  of  one  of  the  squarks  (gluinos)  into  a  leading  photino  carrying  the  bulk 
of  its  momentum  results  in  a  large  missing-py  event  accompanied  by  one  or  more 
jets.  This  is  illustrated  in  Fig.  13;  the  number  of  visible  jets  depend  on  the  jet 
detection  algorithm.  The  rate  of  such  large  missing-py  events  can  be  predicted  as 


elq) 


elq) 


r   e(q)- 


w 


Figure  11.  Comparison  of  the  rates  of  LSP  (7)  and  V  interaction  with  ordinary  matter 
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Figure  12.  Event  configuration  in  the  transverse  plane  for  a  pair  of  (a)  squark  and  (b) 
gluino  production  at  a  hadron  collider. 
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Figure  13.  Pair  production  of  (a)  gluino  and  (b)  squark  via  gluon-gluon  fusion. 

a  function  of  q  or  g  mass  by  convoluting  their  pair  production  cross-section  with 
the  probability  of  one  of  them  decaying  into  a  leading  7  [24]. 

The  SM  background  for  large  missing-pr  events  come  from  prompt  neutrino 
production  processes,  notably  W  -+  TV  (Z  -*  vv]  accompanied  by  QCD  jets.  The 
size  of  this  background  can  be  estimated  from  the  observed  rate  of  W  -»  tv  (Z  — »•  11} 
accompanied  by  QCD  jets.  Observation  of  no  clear  excess  over  this  background  has 
led  to  lower  mass  limits  of  q  and  g  from  the  CERN  pp  and  Tevatron  colliders.  The 
strongest  mass  limit  of  (12)  is  based  on  the  early  Tevatron  data  with  an  integrated 
luminosity  of  ~  4  pb"1  [9].  With  luminosity  upgradation  it  is  possible  to  extend  to 
search  to  ~  250  GeV.  Finally  the  search  can  be  carried  up  to  the  theoretical  mass 
bound  of  ~  1  TeV  at  the  LHC.  Fig.  14  shows  the  expected  gluino  signal  against 
the  SM  background  at  LHC  [25]. 
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Figure  14.  ISA  JET  Monte  Carlo  prediction  for  missing  —  ET  distribution  after  selection 
cuts  for  (a)  M-g  =  300  GeV  and  (b)  m-g  =  100  GeV  at  LHC.  The  solid  (dashed)  line 
corresponds  to  the  gluino  signal  for  tan/3  =  2(10),  while  the  points  represent  the  total  SM 
background  [25]. 

There  is  a  good  deal  of  recent  interest  in  a  second  type  of  superparticle  signature 
-  i.e.  the  multilepton  and  in  particular  the  like  sign  dilepton  signature.  It  arises 
from  i)  the  leptonic  decay  of  LSP  in  the  fl-parity  violating  SUSY  model  and  ii)  from 
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SUSY  model  as  well. 

i)  It  is  clear  from  Fig.  10  that  proton  stability  requires  B  or  L  conservation,  but 
not  necessarily  both.  Hence  one  can  have  two  types  of  /^-violating  SUSY  models, 
corresponding  to  B  and  L  violation  [25].  The  former  implies  LSP  decay  into  a 
multiquark  channel  which  are  hard  to  distinguish  from  the  QCD  background;  but 
the  latter  implies  a  distinctive  leptonic  decay  of  LSP 

7  -+  %'  (or  tfv).  (28) 

Eqs.  26-28  imply  at  least  2  leptons  in  the  final  state;  and  they  are  expected  to 
have  like  sign  half  the  time,  thanks  to  the  Majorana  nature  of  7.  This  results  in  a 
distinctive  like  sign  dilepton  (LSD)  signature  for  superparticle  production  in  the  R- 
violating  SUSY  model,  analogous  to  the  missing-pr  signature  for  the  H-conserving 
model.  Th6  CDF  dilepton  data  from  the  Tevatron  collider  [11]  has  been  analysed 
in  [26]  to  give  a  mass  limit  of 

MM  >  100  GeV  (29) 

in  the  ^-violating  SUSY  model.  This  is  comparable  to  the  corresponding  mass 
limit  (12)  for  the  72-conserving  model.  Using  the  LSD  signature  one  can  extend 
the  g  search  in  the  J?-violating  SUSY  model  upto  a  mass  range  of  ~-  1  TeV  at  the 
LHC  [27].  Moreover  the  LSD  signature  is  also  relevant  for  g  search  at  LHC  in  the 
/2-conserving  SUSY  model  as  we  see  below. 

ii)  In  the  mass  range  of  a  few  hundred  GeV,  the  gluino  undergoes  cascade  decay 
into  photino  via  W  and  Z.  In  particular 

g^qq'W  (30) 

followed  by 

%  6/7.  (31) 


This  means  a  leptonic  branching  fraction  of  ~  10%  for  g  decay,  i.e.  a  branching 
fraction  of  ~  1%  for  a  dilepton  final  state  resulting  from  the  gg  pair.  Finally  the 
Majorana  nature  of  g  implies  a  LSD  final  state  half  the  time.  Despite  its  small 
branching  fraction  (~  1/2%)  the  LSD  channel  provides  a  viable  gluino  signature 
upto  M-g  ~  1  TeV  at  LHC  because  of  the  small  SM  background. 
Fig.  15  shows  the  expected  gluino  signal  along  with  the  background  in  the  LSD 
channel  as  functions  of  the*  ^Accompanying  missing-pp  [27].  It  may  be  mentioned 
here  that  the  neutral  and  charged  gauginos  (7,  Z  and  W)  mix  with  their  Higgsino 
counterparts;  and  one  has  to  diagonalise  the  resulting  neutralino  and  chargino 
mass  matrices  for  a  quantitative  estimate  of  the  gluino  signal  [27].  Nonetheless  the 
simplified  description  of  the  signal  outlined  above  is  valid  to  a  good  accuracy. 

Conclusion  -  In  summary,  the  searches  for  top  quark,  Higgs  boson(s)  and  possible 
superparticles  are  the  three  main  physics  objectives  of  Tevatron  and  the  LHC.  There 
is  good  reason  to  expect  a  definitive  top  quark  signal  in  the  mass  range  of  —  175 
GeV  to  emerge  from  the  forthcoming  Tevatron  data.  But  one  needs  the  LHC  to 
carry  the  Higgs  and  superparticle  searches  over  the  predicted  mass  range  going  upto 
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Figure  15.  The  expected  LSD  signals  for  different  gluino  masses  (300,600,1000  GeV) 
shown  against  the  accompanying  missing-pr  along  with  the  dominant  background  (dashed 
line)  at  LHC.  The  signal  curves  are  for  the  J?-conserving  SUSY  model  with  tan  ft  —  2  and 
the  Higgsino  mass  parameter  /j  =  4Mw  [27]. 


~  1  TeV.  It  should  be  noted  here  that  the  Higgs  and  supersymmetric  particles  are 
the  minimal  set  of  missing  pieces  which  will  complete  the  picture  of  elementary 
particles  and  their  interactions.  But  of  course  this  is  not  the  only  set.  It  may  very 
well  be  that  the  nature  has  chosen  an  alternative  way  of  completing  this  picture 
with  a  different  (and  larger)  set  of  missing  pieces.  In  that  case  one  expects  to 
see  experimental  signals  of  this  new  physics  alternative  in  lieu  of  the  Higgs  and 
superparticles,  but  still  in  the  energy  range  of  ^  1  TeV.  Thus  one  hopes  that  the 
LHC  data  will  help  to  complete  the  picture  of  elementary  particle  physics  along 
the  lines  outlined  above  (i.e.  the  MSSM),  or  else  provide  crucial  experimental  clue 
pointing  to  the  alternative  route. 
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Abstract.  We  survey  the  different  stages  in  the  evolution  of  the  quark-gluon  plasma 
expected  to  be  produced  in  high  energy  nuclear  collisions,  and  we  discuss  some  specific 
experimental  probes  for  each. 


1.  Introduction 

When  we  study  strongly  interacting  matter  in  the  framework  of  statistical  quantum 
chromodynamics,  we  deal  with  ensembles  of  time-independent  systems.  Adiabat- 
ically,  quark  matter  becomes  hadronic  matter,  or  vice  versa.  Our  systems  are  in 
equilibrium  to  begin  with,  and  they  are  transformed,  very  slowly  and  by  slight  of 
changes  in  external  parameters,  into  other  equilibrium  systems.  Nevertheless,  we 
hope  to  test  the  essential  predictions  of  these  studies,  quark  deconfinement  and 
chiral  symmetry  restoration,  through  nuclear  collisions,  in  which  our  initial  system, 
consisting  of  two  colliding  nuclei,  is  certainly  not  in  any  way  thermal,  and  in  which 
whatever  matter  is  eventually  produced  very  rapidly  expands  and  freezes  out  into 
non-interacting  hadrons.  While  the  basis  of  what  we  try  to  study  indeed  is  given 
by  statistical  QCD,  our  tools  must  clearly  be  adopted  to  the  rapidly  evolving  world 
accessible  to  us  in  high  energy  nuclear  collisions.  Whatever  we  measure  will  reflect 
the  properties  of  the  system  in  some  stage  of  its  evolution,  and  we  have  to  learn 
to  associate  specific  observables  and  specific  stages  of  the  system  we  are  studying. 
In  this  survey,  I  will  try  to  give  some  examples  of  how  to  probe  the  evolution  of 
the  quark-gluon  plasma,  from  two  colliding  parton  beams  to  the  observation  of 
identified  hadrons.  - 

Over  the  past  years,  three  stages  in  high  energy  nuclear  collisions  have  attracted 
particular  attention.  In  the  period  immediately  after  the  collision,  multiple  scatter- 
ing between  the  primary  and  secondary  constituents  is  supposed  to  lead  to  a  rapid 
generation  of  entropy  and  thus  eventually  to  thermalisation.  What  happens  at  this 
stage  is  of  crucial  relevance  for  the  subsequent  properties  of  the  system,  and  much 
important  work  on  this  topic  has  appeared  during  the  past  year,  based  in  particular 
on  event  generators  using  parton  dynamics  [1].  The  only  tools  we  have  to  check  this 
stage  directly  are  hard  early  phenomena  suffering  little  or  no  effect  by  the  further 


at  the  time  they  were  formed  -  i.e.,  they  will  tell  us  about  the  hadronisation  of  the 
quark-gluon  plasma  [2].  If  nuclear  collisions  do  indeed  lead  to  a  thermalised  sys- 
tem of  quarks  and  gluons,  this  hadronisation  stage,  the  final  state  in  the  evolution, 
will  be  independent  of  the  initial  thermal  state,  the  primordial  plasma.  In  other 
words,  the  local  hadronisation  properties  of  a  plasma  initially  at  a  temperature  of 
one  GeV  should  be  no  different  from  those  of  a  plasma  of  two  hundred  MeV  [3]. 
How  can  we  then  probe  the  primordial  plasma,  the  system  we  expect  to  be  hotter 
at  RHIC  than  at  the  SPS,  and  much  hotter  still  at  the  LHC?  The  tools  proposed 
for  this  purpose  include  quarkonium  spectra  [4],  high  mass  thermal  dileptons  and 
hard  thermal  photons  [5],  thermal  charm  [6],  and  hard  jets  [7].  Of  these,  up  to  now 
only  J/^j  suppression  -  predicted  as  a  consequence  of  a  hot  primordial  quark-gluon 
plasma  -  was  actually  observed;  the  analysis  of  charmonium  spectra  is  therefore  of 
particular  interest. 

In  the  following,  I  will  address  some  specific  aspects  of  probing  each  of  the  three 
evolution  stages.  Section  2  will  be  devoted  to  pre-equilibrium  and  thermalisation. 
In  Section  3,  quarkonium  production  will  be  considered  as  a  probe  of  the  early  hot 
and  dense  thermal  stages  in  nuclear  collisions.  Section  4  will  then  deal  with  what 
spectra  of  thermal  hadrons  can  tell  us  about  the  quark-hadron  transition  and  the 
freeze-out  into  free  hadrons. 


2.  Pre-equilibrium  and  thermalisation 

Thermal  equilibrium  can  be  attained  by  multiple  scattering  among  the  constituents 
of  the  incident  nuclei  and  the  secondaries  produced  in  these  interactions.  The  pri- 
mary interactions  must  trigger  a  cascade  of  more  and  more  interacting  secondaries, 
tertiaries,  and  so  on,  leading  to  a  rapid  entropy  increase  and  with  it  a  (local)  loss 
of  memory  of  the  initial  state.  Computer  simulations  of  such  cascades  provide  es- 
timates of  the  time  to  needed  to  reach  local  equilibrium,  and  they  tell  us  what  the 
thermal  conditions  were  at  that  time.  They  specify  the  initial  energy  density  to,  the 
initial  entropy  density  s0,  and  they  can  tell  us  to  what  extent  there  is  "chemical" 
equilibrium  among  the  different  constituent  types,  whether  hadrons  or  partons. 

In  such  simulations,  a  considerable  number  of  input  aspects  has  to  be  specified 
in  order  to  arrive  at  definite  predictions. 

•  What  is  the  nature  of  the  constituents?   They  can  be  hadrons,  constituent 
quarks,  or  the  partons  (massless  quarks  and  gluons)  of  perturbative  QCD. 

®  What  is  the  collision  dynamics?  We  can  start  from  a  phenomenological  pic- 
ture of  hadronic  interactions  (ARC  [8],  dual  parton  model  [9]),  from  string 
interactions  (Venus  [10],  RQMD  [11]  Spacer  [12]),  or  from  perturbative  QCD 
for  partonic  interactions,  with  soft  additions  both  for  low  PT  processes  and 
for  the  hadronisation  of  the  partons  (Hijing  [13],  parton  cascade  model  [14]). 

«  What  is  the  evolution  pattern  for  the  collision?  Here  we  can  consider  inde- 
pendent successive  binary  collisions,  molecular  dynamics  schemes  or  a  more 
detailed  transport  theory  description. 

•  What  collective  interaction  effects  should  be  included?    Since  all  evolution 
patterns  considered  are  approximative,  we  have  to  make  sure  that  such  col- 


shadowing  and  jet  quenching  are  correctly  taken  into  account. 

It  is  thus  clear  that  the  results  of  such  microscopic  evolution  studies  are  necessarily 
quite  model  dependent.  Therefore  an  obvious  requirement  is  that  before  making 
predictions  for  nuclear  collisions,  one  has  to  tune  the  particular  generator  to  repro- 
duce the  available  data  from  hadron-hadron  and  hadron-nucleus  collisions.  Beyond 
this  one  may  hope  that  in  the  high  energy  limit  things  become  less  ambiguous: 
eventually  the  basic  input  should  be  an  internetted  parton  cascade,  based  on  an 
underlying  description  in  terms  of  perturbative  QCD.  The  past  year  has  brought  a 
number  of  interesting  studies  in  this  direction.  Here  we  can  only  summarize  their 
results;  for  details,  we  refer  to  [15]  [16]. 
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Figure  1.  Entropy  per  parton  as  function  of  time  after  the  collision,  for  different  incident 
CMS  energies  ,/s/A;  from  [16]. 

The  specific  entropy  (entropy  per  secondary)  increases  very  rapidly  at  high 
energies  [16],  as  seen  in  Fig.  1.  While  at  RHIC  energy  it  is  still  compatible  with 
the  one  fermi  generally  assumed  to  be  the  equilibration  time  at  present  energies, 
we  approach  the  expected  value  S/N  =  4  in  less  than  0.5  fm  for  LHC  energy.  This 
rapid  equilibration  is  also  reflected  in  increasingly  shorter  relaxation  times  (Fig.  2). 
As  a  consequence,  the  initial  temperatures  become  higher  (Fig.  3),  since  we  can 
now  start  considering  the  system  as  thermal  at  earlier  times. 

On  the  other  hand,  these  high  energy  considerations  do  not  lead  to  a  corre- 
sponding rapid  onset  of  chemical  equilibrium.  As  had  been  pointed  out  quite  some 
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Figure  2.  Relaxation  time  as  function  of  the  beam  energy  ^/s/A;  from  [16]. 
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Figure  3.  Central  temperature  as  function  of  time  after  the  collision,  for  different  incident 
CMS  energies  -\/s/A;  from  [16]. 


time  ago  [17],  the  gluon  interaction  cross  sections,  including  those  leading  to  fur- 
ther gluon  production,  are  considerably  larger  than  their  quark  counterparts.  This 
results  in  a  perturbative  QCD  cascade  with  a  very  rapid  gluon  thermalisation,  so 
that  the  entropy  of  the  gluons  soon  reaches  its  equilibrium  value.  The  entropy  of 
the  quarks,  although  it  also  increases  quickly,  appears  to  remain  at  about  60  -  70% 
of  its  equilibrium  value  for  quite  some  time  (Fig.  4).  This  behaviour  has  therefore 
been  called  the  "hot  gluon  scenario"  [6].  We  should  note,  however,  that  it  does 
not  make  the  early  thermal  system  a  gluon  plasma.  A  quark-gluon  plasma  (with 
2.5  light  quark  flavours)  in  both  thermal  and  chemical  equilibrium  would  contain 
quarks  and  gluons  in  a  ratio  of  about  1.6:1;  in  the  hot  gluon  scenario  we  have 
instead  a  ratio  of  about  1:1,  so  that  there  are  still  as  many  quarks  as  there  are 
gluons. 
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Figure  4.  Parton  densities,  normalised  to  equilibrium  values,  for  different  parton  species, 
as  functions  of  the  proper  time  after  the  collision;  from  [16]. 

The  basis  of  these  considerations  is  a  dominance  of  perturbative  QCD  processes 
for  the  overall  behaviour  of  the  system  at  RHIC  and  in  particular  at  LHC  energies. 


How  can  we  test  if  this  is  correct?  Two  basic  predictions  of  the  partonic  pre- 
equilibration  stage  are  enhanced  open  charm  [18]  and  enhanced  high  mass  dilepton 
production  [19].  The  former  arises  because  the  abundant  presence  of  hard  secondary 
(and  tertiary,  etc.)  gluons,  the  second  since  the  fewer  quarks  are  therefore  harder. 
Since  there  are  more  hard  gluons  than  quarks,  open  charm  production  will,  however, 
be  enhanced  more  than  that  of  high  mass  dileptons. 

Here  two  caveats  seem  appropriate.  First  of  all,  we  have  to  say  with  respect 
to  what  the  enhancement  is  to  be  measured.  With  decreasing  fractional  parton 
momentum  x  ~  (M /%/«),  the  gluon  distributions  in  hadrons  are  expected  to  grow 
more  strongly  than  those  for  quarks,  so  that  increasing  energies  will  presumably 
lead  to  an  increase  of  open  charm  relative  to  Drell-Yan  productions  even  in  p  —  p 
collisions.  Hence  we  will  always  need  hadron-hadron  collision  data  as  reference 
to  define  the  predicted  pre-equilibrium  enhancement  in  nucleus-nucleus  collisions. 
Secondly,  the  high  energy  predictions  for  nuclear  collisions  have  not  yet  been  ex- 
trapolated down  to  lower  energies  where  there  is  data  to  compare  them  to.  This 
is  obviously  not  trivial,  since  at  lower  energies  the  soft  components  are  crucial.  As 
long  as  their  role  is  not  checked  specifically  for  the  noted  processes*  -open  charm, 
Drell-Yan  production),  it  is  not  clear  how  reliable  the  high  energy  predictions  really 
are. 

We  summarize  this  section  by  noting  that  the  production  of  open  charm  and 
beauty  as  well  as  that  of  high  mass  dileptons  should  provide  good  tools  to  probe 
the  pre-equilibrium  stage  and  the  onset  of  thermalisation. 

3.  The  primordial  plasma 

We  now  turn  to  the  question  of  how  we  can  verify  if  the  systems  produced  in  high 
energy  nucleus-nucleus  collisions  were  in  their  early  stages  indeed  deconfined  states 
of  matter.  As  already  noted,  for  this  we  need  probes  which  are  formed  in  the 
primordial  stage,  and  only  then,  in  order  to  avoid  the  subsequent  "destruction  of 
memory"  in  a  thermal  evolution.  There  are  essentially  three  probes  of  this  type 
which  have  so  far  been  considered:  quarkonium  states  (J/ip,  $' ,  T,  T'),  real  or 
virtual  photons  (dileptons),  and  hard  jets.  We  shall  here  concentrate  on  the  first  of 
these  three,  since  it  constitutes  the  only  predicted  signature  which  has  in  fact  been 
observed  experimentally,  since  it  has  been  studied  most  extensively  (including,  in 
particular,  also  competing  alternative  mechanisms),  and  finally  since  it  illustrates 
the  kind  of  analysis  necessary  before  we  can  arrive  at  any  conclusions.  The  topic 
of  this  section  thus  is  quarkonium  production  as  probe  of  the  early  quark-gluon 
plasma;  to  have  a  definite  process,  we  shall  talk  about  charmonium  (J/t/>  and  V')> 
but  everything  is  applicable  to  bottonium  production  as  well. 

The  basic  physics  of  the  probe  is  quite  simple.  Charmonium  production  in 
hadronic  collisions  occurs  in  two  stages.  We  have  first  a  hard  process,  the  fusion  of 
two  gluons  to  form  a  (coloured)  cc  pair,  and  then  subsequently  soft  colour  neutral- 
isation and  resonance  binding.  The  first  process,  presumably  described  in  terms  of 
perturbative  QCD,  takes  only  about  M"1  ~  0.1  fm  or  less;  the  second  needs  about 
0.3  -  0  j>  fm  [20]  [21].  If  nuclear  collisions  lead  to  the  formation  of  a  deconfined 
medium,  then  the  second  step  is  inhibited  in  such  collisions,  the  c  and  the  c  fly 
apart  and  eventually  form  open  charm  D  mesons.  In  contrast,  the  overall  Drell- 


spectrum  from  high  energy  nuclear  collisions,  we  thus  expect  a  suppression  of  the 
J/t/>  and  t/>'  signals  relative  to  the  Drell-Yan  continuum,  if  there  was  quark-gluon 
plasma  production  [4].  This  suppression  should  manifest  itself  in  a  smaller  signal- 
to-continuum  ratio  in  nucleus-nucleus  collisions  when  compared  to  hadron-hadron 
and  hadron-nucleus  collisions,  and  it  should  become  stronger  with  increasing  energy 
density  in  nucleus-nucleus  collisions.  Both  these  effects  are  in  fact  observed  [22], 
and  we  thus  have  to  interpret  these  observations. 
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Figure  5.  Drell-Yan  rates  in  p  —  p  collisions  [45],  compared  to  calculations  using  quark 
structure  functions  (dashed  line)  [46]  and  to  a  fit  [47]. 

Our  analysis  of  experimental  results  will  proceed  in  several  steps.  To  start  with, 
we  show  that  the  overall  charmonium  and  Drell-Yan  production  rates  in  hadron- 
hadron  collisions  are  basically  understood,  the  first  in  terms  of  gluon  fusion,  the 
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Figure  6.    J/V*  production  rates  in  p  —  p  collisions,  as  obtained  for  MRS  and  scaling 
structure  functions  [48]  and  from  a  fit  [47],  compared  to  experimental  results  [49]. 


second  through  quark-antiquark  annihilation.  In  Figs.  5  and  6  we  show  p  —  p  data 
for  these  processes,  compared  to  perturbative  predictions  using  different  structure 
function  parametrisations.  The  data,  available  at  present  for  CMS  energies  up 
to  v/s  —  100  GeV,  are  quite  well  accounted  for.  Beyond  that  energy,  we  need  the 
structure  functions  at  very  small  x.  HERA  will  provided  this;  but  it  would  certainly 
be  good  to  have  independent  p  —  p  data  at  higher  energies.  We  note  further  that 
gluon  fusion  as  basic  mechanism  for  charmonium  production  is  nicely  supported  by 
recent  data  comparing  results  obtained  with  incident  p,  ir+,  TT~  and  p  beams  [23]. 
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Figure  7.  J/ij)  and  ij>'  production  in  p  —  A  vs.  p  —  p  collisions,  as  function  of  A  [25]. 

Next  we  turn  to  hadron-nucleus  collisions.  It  is  found  [24]  [25]  that  charmonium 
production  in  such  collisions  (and  that  of  bottonium  as  well)  is  suppressed  with 
increasing  A,  in  comparison  to  the  corresponding  production  on  hadrons  (Figs.  7 
and  8).  It  thus  seems  natural  to  ask  if  the  suppression  found  in  nucleus-nucleus 
collisions  could  be  due  to  the  same  mechanism  as  in  hadron-nucleus  reactions;  if 
this  were  possible,  the  suppression  in  nucleus-nucleus  collisions  could  of  course  not 
be  interpreted  in  terms  of  deconfinement.  Theoretically,  such  a  common  mechanism 
is  difficult  to  justify.  The  kinematically  accessible  region  in  h  —  A  collisions  implies 
that  the  produced  cc  pairs  leave  the  nucleus  as  a  small  colour  octet  state  and  reach 
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Figure  8.  J/V>  and  ^'  production  in  j>  —  W  vs.  ;>  —  p  collisions,  as  function  of  IF  [25]. 


physical  resonance  size  only  far  outside  the  target;  in  A  —  B  collisions,  on  the 
other  hand,  fully  formed  resonances  can  interact  with  partially  stopped  strongly 
interacting  matter.  A  specific  consequence  of  the  pre-hadronic  character  of  the  cc 
in  h  —  A  collisions  is  that  the  suppression  of  the  J/  V>  and  the  ^'  should  there  be 
the  same;  this  is  in  fact  observed  (see  Figs.  7  and  8).  The  relevant  mechanism  for 
the  suppression  in  h  —  A  collisions  is  thus  expected  to  be  a  combination  of  colour 
interactions  [21]  [26]  and  gluon  shadowing  [27],  and  this  does  in  fact  account  for  all 
data  in  the  region  0  <  XF  <  1,  as  seen  in  Figs.  9  and  10  [21]. 

We  now  return  to  the  suppression  in  nucleus-nucleus  collisions.  As  already 
noted,  the  accessible  kinematic  region  (0  <  XF  <  0.2)  allows  the  formation  of  phys- 
ical resonances  in  the  medium.  The  observed  suppression  could  thus  be  due  to  some 
type  of  absorption  in  confined  matter  at  very  high  density  [28]  as  well  to  deconfine- 
ment,  and  we  have  to  find  a  way  to  distinguish  between  the  two  mechanisms.  But 
whatever  the  origin  of  the  suppression,  the  existence  of  physical  resonance  states 
predicts  that  the  V>'  ,  because  of  its  larger  geometric  size,  will  suffer  a  stronger  sup- 
pression than  the  J/ij>.  Specifically,  this  means  that  the  ratio  V'VC^/V')  production 
in  A  —  B  collisions  should  be  smaller  than  in  h  —  A  collisions,  and  that  it  should 
decrease  with  increasing  energy  density  (or  equivalently,  with  ET)-  Both  predic- 
tions are  observed  [29]  [30],  as  shown  in  Fig.  11.  We  therefore  conclude  that  the 
A-dependent  charmonium  suppression  in  h  —  A  collisions  and  the  ^-dependent 
suppression  in  A  —  A  collisions  have  different  origins. 

The  last  step  of  our  analysis  should  now  tell  us  what  the  origin  of  the  char- 
monium suppression  in  A  -  B  collisions  is.  We  recall  that  the  data  [22]  shows  a 
J/t/i  suppression  increasing  with  ET,  up  to  a  reduction  of  signal-above-continuum 
of  about  50%  at  the  highest  ET  values.  While  the  continuum  was  up  to  now  mainly 
determined  by  the  mass  range  below  the  J/V>,  the  suppression  is  now  established 
[30]  also  through  the  high  mass  continuum  above  the  \l>'  ,  and  it  persists  when  the 
gluon  shadowing  effects  determined  in  h  —  A  collisions  are  removed  [31].  A  similar 
but  stronger  suppression  is  found  for  the  1/1';  moreover,  the  ratio  ^'/(J/^)  decreases 
with  ET.  As  is  known,  these  features  can  be  accounted  for  both  by  deconfinement 
(quark-gluon  plasma  formation)  and  by  absorption  in  a  hadronic  medium  of  ex- 
treme density  [31].  In  the  deconfinement  picture,  a  charmonium  state  x  survives 
if  the  energy  density  e  of  the  medium  is  less  than  the  value  ex  needed  to  dissolve 
it.  In  nuclear  collisions,  with  a  hot  inner  and  a  cooler  outer  interaction  region,  the 
survival  probability  thus  becomes 


s  = 


where  Vc*ld  is  that  part  of  the  interaction  volume  V  in  which  e  <  ex  .  With  t  propor- 
tional to  the  nucleon  density  and  with  standard  nuclear  geometry  (equidistribution 
of  nucleons  in  the  nucleus),  this  gives  us 

SdM  =  6(6,  -  e)  +          9/4  6(e  -  c.)  (2) 


for  the  survival  probability  of  charmonium  state  x.  In  the  case  of  absorption  in  an 
expanding  medium,  on  the  other  hand,  this  survival  probability  is  given  by 

S°(e)  =  exp{-(rxn(€)r0  Info/r,  )};  (3) 
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Figure  9.  J/V>  production  in  p  —  Pi  and  p  —  W  vs.  p  —  p  collisions  for  different  -^1 
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The  evolution  of  the  quark- gluon  plasma 
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Figure  11.  The  ratio  of  ^'  to  J/0  production  in  p  —  A  and  5  —  U  collisions  as  function 
of  the  initial  energy  density  fo  [30]. 


at  energy  density  e,  TO  the  equilibration  time  and  Tf  the  time  at  which  the  medium 
becomes  too  dilute  to  provide  absorption. 
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Figure  12.  J/tfr  suppression  as  function  of  the  initial  energy  density  eo  for  deconfinement 
and  absorption  in  dense  matter  [31]. 


The  two  Jf^)  suppression  patterns  resulting  from  eqs.  (2)  and  (3)  are  shown  in  Fig. 
12,  together  with  the  presently  available  data  [31].  Both  are  seen  to  account  for 
the  experimental  results.  We  should  note,  however,  that  in  both  approaches  the 
agreement  with  the  data  implies  an  energy  density  of  2  -  3  GeV/fm3  at  the  high  end. 
In  the  deconfinement  picture,  such  a  density  results  in  a  quark-gluon  plasma  and 
hence  charmonium  melting.  In  the  hadronic  absorption  scenario,  however,  it  means 
a  density  of  four  or  more  hadrons  per  normal  hadronic  volume  [32],  and  the  J/tp  or 
V1'  must  interact  independently  with  each  of  these  hadrons.  It  is  conceptually  not 
clear  how  this  can  take  place. 

What  qualitative  differences  are  there  between  the  two  suppression  mechanisms? 
We  note  two  possible  effects  to  look  for. 

•  The  pure  15  $  state  needs  a  higher  energy  density  to  "melt"  than  the  larger 
excited  Xc  state,  which  decays  into  a  $.  The  observed  J/^'s  are  about  70% 
directly  produced  and  about  30%  due  to  x-decays.  The  J/$  survival  proba- 
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bility  thus  becomes 
Sf(€)     =      0.3 

+  0.7 

where  the  first  term  corresponds  to  the  (earlier)  suppression  of  the  x  corr 
ponent  and  the  second  to  that  of  the  direct  V-  As  a  result,  there  is  th 
"bump"  in  the  deconfinement  suppression  pattern  seen  in  Fig.  12,  while  al 
sorption  leads  to  a  monotonic  decrease  in  f .  It  is  quite  possible  that  quantui 
effects  will  wash  out  the  two-component  structure  with  its  resulting  diffei 
ent  "melting  points";  but  if  observed,  if  would  certainly  provide  evidence  fc 
deconfinement. 

At  any  given  density,  larger  resonances  are  always  absorbed  more  than  smalle 
hence  in  the  absorption  scenario,  the  ratio  VJ'/(^/V))  must  vanish  with  ir 
creasing  density.  On  the  other  hand,  for  a  sufficiently  high  given  density 
deconfinement  will  melt  all  resonances;  any  survivors  must  come  from  th 
cool  and  hence  non-deconfining  outer  regions  of  the  interaction  zone,  whic 
are  determined  by  nuclear  geometry.  As  a  consequence,  the  ratio 
takes  on  the  constant  value 


for   €  >  €<p  (5 


in  the  case  of  deconfinement j  while  for  absorption  it  vanishes  exponential! 
in  e,  as  seen  in  Fig.  13  [31]. 

At  present,  the  statistics  and  the  energy  density  range  available  do  not  yet  allo1 
any  decision  on  the  origin  of  charmonium  suppression  in  A  —  B  collisions.  We  ca 
conclude,  however,  that  quarkonium  production  does  give  us  a  tool  to  probe  th 
primordial  state,  and  this  tool  has  so  far  indicated  the  existence  of  dense  primordi; 
matter. 

4.  The  quark-hadron  transition 

The  first  goal  of  high  energy  nuclear  collisions  is  to  produce  strongly  interactin 
matter.  We  thus  have  to  find  a  way  to  check  if  the  systems  produced  in  sue 
collisions  are  indeed  in  local  thermal  equilibrium.  We  hope  that  a  detailed  study  < 
thermal  hadrons  (i.e.,  those  produced  in  the  break-up  of  the  quark-gluon  plasm; 
will  provide  a  tool  for  this.  Such  hadrons  will  give  us  direct  information  about  tl 
system  at  the  time  they  were  formed:  about  its  equilibration,  size,  temperatur 
flow  structure,  and  more.  The  transition  from  a  quark-gluon  plasma  to  hadron 
matter  is  one  of  the  most  fascinating  features  of  strong  interaction  thermodynamic 
It  has  attracted  much  interest,  in  statistical  QCD  as  well  as  in  the  context 
phenomenological  models,  and  it  is  clearly  of  great  importance  to  find  ways 
study  it  experimentally.  If  the  quark-hadron  transition  and  the  final  freeze-oi 
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Figure  13.  The  ratio  of  V>'  to  J/V>  svippiession  as  function  of  the  initial  energy  density 
CD  for  deconfinement  and  absorption  in  dense  matter  [31]. 
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are  not  far  apart,  it  is  quite  likely  that  thermal  hadrons  will  retain  some  transitio 
features  and  thus  provide  us  with  information  about  the  critical  region.  In  additioi 
thermal  hadrons  can  of  course  also  give  us  indirect  information  about  prior  stages, 
the  evolution  pattern  of  the  system  is  known;  we  could  then  use  them  to  determin 
the  initial  entropy  and  energy  density  or  the  intial  temperature.  Here  we  will  no 
assume  the  evolution  to  be  known  and  thus  consider  thermal  hadron  production  a 
a  way  to  probe  the  quark-haflron  transition  and  the  freeze-out  era. 
The  basic  questions  here  seem  to  be  these: 

•  Are  the  finally  emitted  hadrons  in  thermal  equilibrium? 

•  Is  there  one  universal  freeze-out  point,  or  do  strong  interactions  stop  for  som 
hadrons  earlier  than  for  others? 

Both  questions  appearently  deal  only  with  hadrons  made  up  of  light  quarks;  char 
monium  and  bottonium  states,  for  example,  presumably  never  partake  in  any  ther 
mal  equilibrium.  Because  of  the  miniscule  Boltzmann  factor  (exp  —  (MJT)  ~  10"1 
for  T  ~  300  MeV),  the  thermal  production  of  these  states  is  ruled  out  at  all  mean 
ingful  temperatures. 

Thermal  equilibrium  can  appear  in  two  forms.  A  system  containing  differed 
particle  species  can  be  in  overall  thermal  equilibrium;  this  means  that  each  specie 
has  thermal  momentum  distributions,  and  that  the  relative  abundance  of  the  differ 
ent  species  is  governed  by  their  thermal  weights,  dependent  on  masses  and  interna 
degrees  of  freedom.  Thermal  equilibrium  for  a  given  species  is,  curiously  enough 
not  so  easy  to  check  in  high  energy  nuclear  collisions.  It  implies  that  the  distribu 
tions  of  the  emitted  mesons  and  baryons  should  be  of  Boltzmann  form, 


dmT 

with  WIT  —  (m2  -f  Pr)1^  denoting  the  transverse  energy  of  a  hadron  of  mass  m 
It  so  happens  that  already  the  distributions  of  secondaries  in  p  —  p  reactions  (an< 
with  some  caveats,  even  in  e+-e~  annihilation)  have  this  form.  This  either  in 
dicates  that  already  these  processes  lead  to  thermal  behaviour  -  a  point  of  viev 
explored  a  number  of  times  over  the  past  fourty  years  [33]  -  or  it  is  due  to  othe 
reasons  (geometry,  uncertainty  effects).  In  either  case  we  have  no  way  to  test  ther 
mal  behaviour  for  a  given  particle  species  specifically  for  nucleus-nucleus  collisions 
We  can,  however,  check  if  the  different  hadron  species  are  produced  according  t< 
some  thermal  weights,  and  this  study  has  led  to  very  interesting  results.  It  starte< 
with  the  prediction  of  strangeness  enhancement  in  nuclear  collisions  and  was  sub 
sequently  developed  into  a  rather  well-defined  method  to  probe  thermalisation  i; 
this  more  "chemically  equilibrated"  sense  [2]. 

If  at  freeze-out  we  do  indeed  have  an  ideal  gas  of  hadrons  and  hadronic  resc 
nances,  then  the  system  is  described  by  the  partition  function 


(7 
Here  W{  is  the  phase  space  factor  for  non-strange  mesons  of  species  t,  while  th 


in  question.  The  baryon-mimber  fugacity  is  defined  as  IB  —  eyp(f*B/T)  and  that 
for  the  strangeness  as  /5  =  ex.p(ns/T).  The  phase  space  factors  are  given  by 


with  di  denoting  the  degeneracy,  m;  the  mass  of  hadron  species  i;  V  is  the  volume 
of  the  system.  For  the  temperature  range  presently  of  interest  (T  <  250  MeV), 
it  is  sufficient  to  include  in  the  sums  in  eq.  (7)  all  known  hadrons  up  to  a  mass 
of  2  GeV.  The  partition  function  (7)  then  determines  all  thermal  properties  of  the 
system  in  terms  of  the  three  parameters  T,^B  and  ps-  One  of  these,  say  /is, 
can  be  eliminated  by  requiring  the  overall  strangeness  of  the  system  to  vanish.  If 
there  is  a  one-stage  freeze-out  of  all  thermal  hadrons,  then  all  production  ratios 
(K/ir,  ir/p,  TT/p,  K/Y,Y/Y,  S/Y,...)  are  given  in  terms  of  the  two  parameters  !Z^and 
HB-  In  other  words,  we  can  take  two  measured  ratios,  such  as  A/A  and  S/S,  to 
fix  the  values  of  T  and  HB,  and  all  other  ratios  are  then  predicted,  if  the  different 
particle  species  are  indeed  present  according  to  their  equilibrium  weights.  Although 
previous  analyses  had  been  in  accord  with  this  for  all  strange  baryon  and  meson 
ratios  at  T  ~  200  MeV  [34,  38],  HB  -  300  MeV  [39],  newer  and  more  precise 
data  [4Q]  no  longer  provide  a  common  equilibrium  freeze-out  point  even  for  strange 
hadrons  (Fig.  14). 

Bringing  in  non-strange  hadrons  had  always  led  to  disagreement,  with  lower  freeze- 
out  temperatures  (T  ~  100  -  140  MeV)  as  well  as  lower  baryochemical  potential 
((IB  —  200  MeV).  It  is  possible,  however,  to  retain  a  common  freeze-out  for  all 
strange  hadrons  by  introducing  partial  strangeness  saturation,  i.e.,  by  assuming 
that  strangeness  is  not  yet  fully  present  in  the  system  to  the  extent  it  should  be  if 
there  was  complete  chemical  equilibrium  [41]  With  this  assumption,  the  fugacities 
Is  and  Ig1  in  eq.  (7)  are  multiplied  by  a  strangeness  saturation  factor  7  <  1.  This 
results  in  T  ~  200  MeV  and  ps  —  250  MeV  as  freeze-out  parameters  for  the 
(partially  equilibrated)  strange  hadrons  [42],  as  seen  in  Fig.  15. 
The  unique  freeze-out  point  for  hyperons  and  cascade  particles  (and  their  antipar- 
ticles)  arises  by  construction,  of  course,  since  we  have  three  independent  ratios  and 
three  parameters.  The  freeze-out  point  is  also  in  accord  with  present  A./K°.  One 
basic  problem  in  this  type  of  analysis  is  the  availability  of  data  in  the  same  kine- 
matic regions  and  for  the  same  collision  partners.  It  will  be  interesting  to  follow 
the  further  development  of  the  freeze-out  study  for  both  strange  and  non-strange 
hadrons.  All  present  results  do  suggest,  however,  that  there  is  no  unique  freeze-out 
point  for  all  hadrons. 

It  therefore  seems  reasonable  to  consider  when  freeze-out  will  occur.  One  pos- 
sible definition  is  given  by  requiring  the  mean  free  path  of  a  hadron  in  the  medium 
to  reach  the  size  of  the  system.  This  leads  to 

1*  (9) 

for  the  freeze-out  radius  Rp\  here  cr  denotes  the  interaction  cross-section  for  the 
hadron  in  question.  A  hadron  species  with  a  bigger  cross-section  should  thus  un- 
dergo a  later  freeze-out  [43].  If  we  assume  strange  hadrons  to  have  an  interaction 
cross-section  half  the  size  of  non-strange  cross-sections  (this  is  the  case  for  the  Kp 
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Figure  14.    Strange  baryon  production  ratios  as  function  of  temperature  and  baryo- 
chemical  potential,  with  full  strangeness  saturation  [42]. 
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Figure  15.    Strange  baryon  production  ratios  as  function  of  temperature  and  baryo- 
chemical  potential,  with  partial  strangeness  saturation  [42]. 
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vs.  Tip  cross-secuons  ai  i  v^ev/c  iao  inomemuiny,  men  we  get  ftp  ~  u./  /!£  .  inis 
result  can  be  checked  independently  by  hadron  interferometry  studies,  which  deter- 
mine the  freeze-out  radii  for  pions  and  kaons;  new  data  [44]  give  at  least  qualitative 
agreement.  In  the  case  of  an  isentropic  longitudinal  expansion,  these  ./^-values 
imply  Tp  ~  190  MeV  if  7j*  ~  130  MeV,  in  reasonable  accord  with  what  we  found 
above.  Hadron  spectra  thus  seem  to  suggest  that  freeze-out  is  a  sequential  pro- 
cess, determined  by  the  different  hadronic  interaction  cross-sections;  however,  this 
conclusion  is  at  present  still  of  a  very  qualitative  nature. 

Nevertheless,  we  have  seen  that  hadron  spectra  and  correlations  provide  us  with 
a  good  probe  for  the  study  of  hadronisation  and  the  quark-hadron  transition  region. 

5.  Conclusions 

We  have  seen  that  different  signals  probe  different  stages  in  the  evolution  of  the 
quark-gluon  plasma.  It  is  therefore  necessary  to  clarify  what  we  can  learn  from 
any  given  signal,  and  from  what  stage  it  brings  us  information.  We  saw  that  high 
mass  Drell-Yan  pairs  and  open  charm  mesons  probe  the  pre-equilibrium  stage;  the 
suppression  of  charmonium  and  bottonium  production  provides  a  tool  for  the  study 
of  the  hot  primordial  QGP;  hadronic  spectra  and  correlations  give  us  information 
about  the  quark-hadron  transition  and  freeze-out.  Clearly  they  are  just  some  of  the 
possible  signals  and  probes;  there  are  many  others  -  thermal  dileptons,  direct  pho- 
tons, jet  production,  hadron  mass  modifications  in  media,  flow  effects,  strangelets, 
to  name  some  that  have  so  far  been  studied  quite  extensively.  We  will  certainly  need 
all  of  them  in  order  to  establish  the  complete  evolution  pattern  of  the  quark-gluon 
plasma. 
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The  rare  decays  of  the  K  meson  have  had  a  long  tradition  as  a  laboratory  for  testing 
the  symmetry  properties  of  the  weak  interactions,  and  the  manner  in  which  these 
symmetries  are  broken  by  higher  order  effects.  Present-day  interest  is  focussed  on 
decays  that  are  suppressed  by  CP-symmetry  or  GIM  symmetry.  Such  decays,  in 
the  standard  theory,  are  sensitive  to  effects  of  the  virtual  top  quark,  and  could 
also  reveal  new  interactions  transcending  the  standard  model.  In  addition,  the 
radiative  decays  of  the  K  meson  have  become  a  useful  testing-ground  for  effective 
Lagrangians  describing  the  low  energy  interactions  of  pions,  kaons  and  photons. 
This  talk  is  a  selective  review  of  some  rare  K  processes.  For  a  more  comprehensive 
discussion,  we  refer  to  the  reviews  listed  in  [1], 

1.  Charged  current  rarities 

An  example  of  a  decay  that  is  rare,  yet  allowed  in  the  lowest  order,  is  the  AS  =  0 
transition  K°  — >•  K+e~ve  (Fig.  la).  This  is  closely  analogous  to  pion  /?-decay, 
TT~  — *  7r°e~z7e.  Conservation  of  the  vector  current  uj^d  dictates  that  the  relevant 
matrix  element  is  (K°\I-\K+)  =  1  (in  analogy  to  (7r°|7_|7r+)  =  \/2)  and  the 
predicted  branching  ratio  is 

B(KL-*K+e-ve}  =  Zx  10"9.  (1) 

A  curious  analogue  of  the  AS  =  AQ  decay  K+  —»•  7r°e+i/e  is  the  hypothetical 
transition  K+  —»•  "r/"e+j/e-  The  latter  is  not  observable  as  a  real  decay  process, 
since  m^  >  m^.  The  matrix  elements  for  K+  -+  rj  and  K+  — *•  TT°  are  —  l/v/6  and 
l/v/2,  respectively.  A  possible  way  to  probe  the  K+  — +•  rj  coupling  is  via  the  decay 

A-+-*77e+i/e.  (2) 

Some  of  the  relevant  diagrams  are  shown  in  Fig.  Ib.  An  estimate  of  this  decay 
would  be  of  interest. 

In  the  second  order  of  weak  interactions,  it  is  possible  to  obtain  a  A5  =  —AQ 
transition  K°  -»  Tr+e~i7e  (Fig.  Ic).  A  model  based  on  the  parity-conserving 
vertices  KQ  -  TT°  and  K+  -  TT+  ,  combined  with  the  A5  =  AQ  transitions  TT°  — >• 
7T+r  vi  and  K°  -+  K+l~vi  gives  [2] 


the  transition  ir~  -+  \i   v^,  yields  [3] 


This  second  mechanism  produces  a  AS  =  — AQ  amplitude  K°  — »•  TT+/J~^  with 
a  helicity  structure  and  pion  energy  spectrum  that  is  quite  different  from  that  in 
the  AS  =  AQ  transition  K°  — *  Tr+fi~]7tl.  To  the  extent  that  both  mechanisms 
are  possible,  the  e/p  ratio  in  AS  =  -AQ  transitions  will  be  different  from  that  in 
AS  =  AQ  decays. 

2.  Neutral  current  decay  into  /+/"  pairs 

2.1  Decay  KL-—*  H+P~ 

The  decay  KL  — *  ^+A*~  has  a  well-known  unitarity  bound  [4] 


associated  with  the  27  intermediate  state  (Fig.  2a).  The  corresponding  branching 
ratio  is 


=  (6.8i0.3)xlO-9.  (6) 

If  one  includes  the  dispersive  part  of  the  27  amplitude,  as  well  as  the  real  amplitude 
induced  by  the  short-distance  interaction  sd  — >•  /^+/i~  (box  and  penguin  diagrams, 
Fig.  2b),  the  full  branching  ratio  is 

D/  If       ,.4"  ,,  —  \  D  i     |      /R .  I          /  D '.         |2  f'J\ 

•i-*  \  •**•  Lr          A*     /*     7  "~  -^flo*    i    I  Y  ^disp  -C  y  •'-'short  — dist,  |    •  \ '  / 

A  theoretical  estimate  of  Bdiap  requires  a  model  for  the  KL  —*  7*7*  form  factor. 
Vector-meson-dominance  models  [5],  similar  to  those  used  for  7r°,  r]  — +  7*7*,  tend 
to  give  Bdisp/ Baits  <  0.1  (to  be  compared  with  the  experimental  rate  of  r)  — *  /^+^~  > 
B(ij  ->•  /^+/i~)  =  (1.3  ±  0.1)  x  Babi(r)  -+  /i+/^-)).  A  model  for  tfL  -^  7*7  [6],  that 
describes  the  observed  spectrum  of  Dalitz  pairs  in  KL  — *  je+e~  [7],  yields  an  even 
smaller  value  of  B,aip. 

The  short-distance  branching  ratio  has  been  calculated  to  be  [8,  9] 

tf£ ->0+/T)  =  6.4  x  1(T3 

i(KL  -*  2^)  -I-  ReAty(xt)]  (8) 


where 

Xt   a<~ 


and 

A!  =  Kt; Vid  =  -A2\\l  -p-  i,,).  (10) 

The  term  PQ(KL  — >•  2/f)  denotes  a  residual  charm  quark  contribution,  estimated  to 
be  -0.75  x  10~4,  for  mc  =  1.4  GeV,  A  =  200  MeV. 


*v --^  =  1.1  x  10   "        (AMs  only). 

F(AS  =  +AQ) 

This  second  mechanism  produces  a  A5  =  -AQ  amplitude  JK°  — »•  ir+H~Vp.  w 
a  helicity  structure  and  pion  energy  spectrum  that  is  quite  different  from  that 
the  AS  =  AQ  transition  K°  -+  T+jT"!^.  To  the  extent  that  both  mechanis 
are  possible,  the  e/fi  ratio  in  AS  =  — AQ  transitions  will  be  different  from  that 
AS  =  AQ  decays. 

2.  Neutral  current  decay  into  /+/~  pairs 


2.1  Decay  KL  •-*  H*  H~ 

The  decay  KL  —  »•  ^+//~  has  a  well-known  unitarity  bound  [4] 


associated  with  the  27  intermediate  state  (Fig.  2a).  The  corresponding  branch  i 
ratio  is 

Bata  =  (6.8  ±  0.3)  x  1(T9.  ( 

If  one  includes  the  dispersive  part  of  the  2j  amplitude,  as  well  as  the  real  amplitu 
induced  by  the  short-distance  interaction  sd  —  »•  p.+n~  (box  and  penguin  diagran 
Fig.  2b),  the  full  branching  ratio  is 


A  theoretical  estimate  of  BA-,p  requires  a  model  for  the  A^  —  ».  7*7*  form  factt 
Vector-meson-dominance  models  [5],  similar  to  those  used  for  TT°,  -q  —  »•  7*7*,  tei 
to  give  Bdiap/Babs  <  0.1  (to  be  compared  with  the  experimental  rate  of  17  —  »•  //+  n 
B(T)  -H.  /i+^-)  =  (1.3  ±  0.1)  x  Bflj5(i7-H.  /!+/!-)).  A  model  for  ^  -.7*7  [6],  th 
describes  the  observed  spectrum  of  Dalitz  pairs  in  ATL  -+  je+e~:  [7],  yields  an  ev« 
smaller  value  of  Bdi,p. 

The  short-distance  branching  ratio  has  been  calculated  to  be  [8,  9] 

B  short-  di,t.(KL  -»•  n+V~)  =  6.4  x  10~3 


( 
where 


and 


The  term  PQ(KL  -*  2/i)  denotes  a  residual  charm  quark  contribution,  estimated 
be  -0.75  x  10~4,  for  mc  =  1.4  GeV,  A  =  200  MeV. 
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rhe  experimental  branching  ratio  Bexp(KL  -+  ^+A*~)  =  (7.4  ±0.4)  x  10~9  [10], 
ily  slightly  in  excess  of  the  unitarity  bound  (Eq.  (6)),  and  can  constrain  the 
imeter  ReAt  =  —  ,A2A5(1  —  /?)  appearing  in  the  short-distance  contribution.  At 
ent,  the  uncertainty  in  the  2j  dispersive  rate  B<«*p  limits  the  efficacy  of  this 
iy  in  providing  a  stringent  constraint  on  p  [11]. 

Decay  KL  -*  7r°/+/- 

le  one-photon  exchange  approximation,  OP-symmetry  forbids  the  decay  KI  —+• 
'/-  but  allows  the  decay  KI  ->•  ir°/+/~  [12],  A  CP-conserving  amplitude  for 
-»•  TT°/+/~  is  possible  through  a  2j  intermediate  state  (Fig.  2a).  The  27-induced 
litude  is  proportional  to  mi  if  the  two  photons  have  J  =  0;  however,  dynamical 
els  of  KI  —  >•  7r°77  permit  also  a  J  =  2  component  which  yields  an  amplitude 
^2  —  >•  7r°/+/~  unsuppressed  by  a  factor  m\  [13].  It  follows  that  the  amplitude 
%L  —  +  TT°/+/~  will  contain  three  components 

.  =  <x2A2y  +  aeAi^  +  rj\4Asd,  (11) 

re  the  second  and  third  pieces  denote  amplitudes  associated  with  indirect  and 
:t  (short-distance)  C.P-violation.  The  factors  a2,  ea  and  r?A4  express  the  orders 
agnitude  of  these  components,  and  underscore  the  fact  that  they  are,  a  priori, 
parable  in  size. 

?he  CP-conserving  amplitude  for  KI  —  *  TTO/+/~  involving  a  2j  intermediate 
5  has  the  form  [14] 

(K2(p)  -H, 


is  proportional  to  m/,  and  is  negligible  for  K%  —  +  7r°e+e~.  The  coefficient 
is  an  effective  coupling  constant  appearing  in  the  KI  —  *  7r°77  amplitude, 
metrized  as 


(13) 

e  the  ellipsis  denote  terms  that  are  ineffective  for  KL  —*  7r°e+e~.  An  analysis 
ie  data  on  KL  ->  7r°77  [15]  suggests  Ge//  ~  0.15  x  10~7m^2  [14].  The  form 
>r  FI  has  an  absorptive  part 

A  r2       2       ,\       .,,1.1  + 


e  A  =  -2p  •  (k  -  Jb;),  /?  =  v/l-4m2/s,  5  =  (it  +  Jb')2.  The  corresponding 
conserving  branching  ratio  is 


-^  =  1.7  x  10-.a  -I- 
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sp  being  the  dispersive  part,  estimated  in  [14]  to  be  1.5. 
The  result  (15)  has  to  be  compared  with  the  CP-violating  rate  [16] 


BCpv(KL  ->  *°e+O  =  [|0.76re"'4 


~ 

<r»  (16) 


where  Cy  and  CA  are  coupling  constants  describing  the  short-distance  interaction 
sd  -»•  e+e~  ,  with  numerical  values  Cy  -  -0.6,  CA  =  0.7  for  mt  =  170  GeV.  The 
parameter  r,  characterising  the  indirect  CP-violating  amplitude,  is  defined  as 


For  r)  =  0.4,  the  CP-violating  branching  ratio  BCpv  is  1.2  x  10~11(2.1  x  10"11)  for 
r  —  +!(—!)  .  Models  with  much  larger  and  much  smaller  values  of  r  are  possible. 
These  estimates  suggest  that  the  CP-violating  rate  BCPV  is  somewhat  higher  than 
the  CP-conserving  rate  BCPC  •  A  precise  measurement  of  the  rate  and  spectrum 
of  K%  -+  7r°77  would  help  to  sharpen  the  estimate  of  BCPC,  while  a  measurement 
of  r  is  needed  to  predict  BCPV-  Expectations  for  the  decay  KL  —  »•  7rVV~>  as 
well  as  an  estimate  of  the  CP-violating  /+/~  energy  asymmetry  are  given  in  [14]. 
Present  experimental  limits  (from  the  E799  experiment  at  Fermilab)  are  B(KL  -+ 
7r°e+e-)  <  4.3  x  10'9,  B(KL  -+  vrW)  <  5.1  x  10~9  [17]. 
2.3  Decay  KL  -*  *+Tr~e+e- 

A  study  of  the  photon  spectrum  in  the  decay  KL  -*  ir+x~y  shows  two  clear  compo- 
nents [18]:  (i)  bremsstrahlung  from  the  CP-violating  decay  KL  —  »•  TT+TT",  and  (ii) 
direct  photon  emission  of  magnetic  dipole  nature  from  the  CP-conserving  decay 
KI  —  *  7r+7r~7.  The  simultaneous  presence  of  bremsstrahlung  and  Ml  amplitudes 
implies  that  the  photon  in  the  decay  KL  ->  7r+7r~7  has  a  CP-violating  circular 
polarization.  The  conversion  process  KL  —  »•  7r+7r~e+e~  may  be  viewed  as  a  means 
of  probing  this  polarization,  by  studying  the  correlation  of  the  e+e~  plane  relative 
to  the  7r+7r~  plane. 

An  analysis  based  on  the  amplitude 

A(KL  - 


-k         _  •  k 


was  carried  out  in  [19].  Here  k  -  k+  +  jfe_,  fs  is  the  amplitude  of  /T5  -»•  TT+TT"  ,  and 
gsr  and  <7Mi  are  given  empirically  by  gBr  =  r?+_e''*°(mK),  pjvfi  =  i(0.76)c"1^*),  60,i 
being  the  s~  and  p-wave  TTTT  phase  shifts.  A  significant  CP-violating  asymmetry 
was  found  in  the  ^-distribution  of  the  process,  $  being  the  angle  between  the  e+e~ 
and  7r+7r~  planes: 


A    = 
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«     14%,  (19) 

where  S\  denotes  an  average  p-wave  phase  (~  10°).  This  analysis  was  extended  in 
[20]  to  include  short-distance  CP-violation,  contained  in  the  effective  Hamiltonian 
for  srf  —  *  e+e~.  These  direct  CP-violating  effects  were  found  to  be  very  small 
(<  10~3).  The  branching  ratio  of  KL  -*•  K+ir~e+e~  is  predicted  to  be  3  x  10~7, 
so  that  the  large  asymmetry  given  in  Eq.  (19)  may  well  be  accessible  in  the  next 
round  of  experiments. 

2.4  Decay  K+  -+  *+/+/- 

The  decays  K+  —  *  v+l+l~  are  dominated  by  one-photon  exchange,  and  are  prin- 
cipally of  interest  as  tests  of  the  Kirj*  vertex.  In  chiral  pertubation  theory,  this 
vertex  is  calculable  in  terms  of  the  effective  Lagrangian  £e//(7r,  K,  7),  and  the  ma- 
trix element  has  the  form  [21] 


A(K+(k)  ->  TT(P)  +  /+  +  r  ) 

(20) 


where  GS  =  Gr/V^VudV*3g8,  g&  =  5.1.  The  function  C+(z)  is  known,  up  to  an 
additive  constant  w+.  In  terms  of  this  constant,  the  branching  ratios  are  calculated 
to  be 

B(K+  -»7r+fi+e-)     =     (3.15  -  21.  lw+  +  36.1u>J)  x  10~8, 
B(K+  -*  *+n+n   }     =     (3.93  -  32.7u;+  +  70.5u;J.)  x  lO'9.  (21) 

The  recent  measurement  [22]  B(K+  ->  Tr+e+e~)  =  (2.99  ±  0.22)  x  10~7  implies 
w+  =  0.8918;  JS,  and  leads  to  the  prediction  B(K+  -*  7r+/i+/i-)  =  3.07  x  10~8. 

Refinements  to  the  matrix  element  (20)  occur  if  one  takes  into  account  the  short- 
distance  interaction  sd  —  »•  J+/~  [23].  One  interesting  aspect  of  these  corrections  is 
the  addition  to  the  matrix  element  (20)  of  a  parity-violating  term  of  the  form 


c(k-Py] 


(22) 
The  short-distance  interaction  gives  B  =  /+£  ,  C  =  /-£  ,  where 


e  =  -1.4  x  10-4  -  A*\4(l  -  p  -  ir,),  (23) 

2irsm  &w 

/+,  /_  being  the  two  form  factors  of  K13  decay  (/+  w  1,  /_  «  0).  Interference  with 
the  leading  one-photon  exchange  amplitude  (20)  gives  rise  to  a  parity-violating 
longitudinal  polarization  of  the  ^+  [23] 

(24) 
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(20)  (which  we  write  as  Fyu(tf  +  ty)v)  produces  a  T-odd,  P-odd  term  in  the  decay 
rate,  of  the  form  lm(FvF^)(s+  x  sL)  •  p,  where  s+  and  £_  are  spin  vectors  of  the 
H+  and  f*~  in  the  K  rest  frame,  and  p  is  the  /*+  momentum.  Such  a  term  can  probe 
the  CP-violating  parameter  TJ  of  the  short-distance  interaction.  Detection  of  this 
term,  however,  requires  measurement  of  a  correlation  between  the  spins  of  both  ^/+ 
and  /4~,  a  difficult  task. 

An  important  challenge  for  the  low  energy  effective  Lagrangian  £efj(K,  7r,7) 
is  to  produce  a  reliable  prediction  for  the  decay  K\  —*  v°l+l~ .  In  chiral  pertu- 
bation  theory,  the  matrix  element  is  determined  up  to  an  unknown  constant  w$ 
[21].  As  discussed  in  Section  2.2,  information  on  this  decay  mode  is  important  for 
determining  the  magnitude  of  indirect  <7P-violation  in  KL  —>  *0 l+l~  - 

3.  Neutral  current  decays  into  vv  pairs 

3.1  Decay  K+  -*  r+vv 

The  decay  K+  — >  it+vv  is  a  short-distance  dominated  reaction,  determined  by  the 
box  and  penguin  graphs  shown  in  Fig.  3a.  The  branching  ratio  is  predicted  to  be 
[25] 

nf  tf+       ,   _4- ,.  .-7  \  c  o  ..  \r\— 5|  o  (  if-\-          _  +,.  ,-:\i\v/'/-M2  /'OK^ 

jD(A       — >•  7T    V\V\)  —  O.y  X  1U       IJTO^A       — >•  7T    V\V\)  +  A|A  (X< )\    ,  (^) 

where 

-2.5  xlO~4     for        i/e,i/^  ,0^ 

-,  «        -iA-4      r  l^"J 

-1. 7  x  10  *     for        i/r  v     ' 


and 

Xt 


]n 

ln 


Summing  over  all  three  neutrino  flavours,  one  obtains  a  typical  value  B(K+  —  * 
TT+  vv]  =  1.3  x  10~10  for  p  =  0,  T?  =  0.4,  with  a  possible  range  (0.5  -  5)  x  10~10  for 
the  presently  allowed  domain  of  (p,  77).  The  present  experimental  limit  is  5.2  x  10~9 
(ACS  E787)  [26]  and  a  sensitivity  of  10~10/event  is  within  reach. 

The  long-distance  contributions  to  K+  —  »•  x+i>v,  symbolised  by  the  diagrams 
in  Fig.  3b,  were  calculated  in  [27],  In  particular,  the  hadronic  contribution  to  the 
K+TT+Z  vertex  was  obtained  using  current  algebra  arguments.  It  was  concluded 
that  these  effects  are  three  orders  of  magnitude  smaller  than  the  short-distance 
contribution.  A  more  recent  calculation  [28],  using  chiral  pertubation  theory,  has 
found  a  very  similar  result.  (For  additional  remarks,  see  [29]). 

3.2  Decay  KL  —*  ic^vv 

Finally,  an  example  of  a  short-distance  dominated  process,  which  is  at  the  same 
time  purely  CP-violating,  is  the  decay  K?  —  >•  TT°I/P  [30].  Its  branching  ratio  is  given 
by  [31] 
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While  experimentally  remote  (the  present  limit  is  B(Ki  —»•  VQVV)  <  5.8  x  10~5 
[32]),  this  reaction  is  an  interesting  example  of  a  process  that  directly  measures  the 
CP-violating  paramter  77,  with  essentially  no  hadronic  uncertainties. 

4.  Radiative  Decays 

The  radiative  decays  of  the  K  mesons,  such  as  KL,S  —  f  77,  KL  —  *  T°77,  KL  —  »• 
7r+7r~7,  /f+  —  t-  7r+7r°7,  #+  -*•  ^+77,  KL  -+  7r°7r°77  ,#£,,$  -»•  7r°7r°7  etc.  are  a 
source  of  abundant  grist  for  models,  such  as  chiral  pertubation  theory  [33],  that 
attempt  to  describe  the  low  energy  interactions  of  pions,  kaons  and  photons.  The 
interplay  of  chiral  symmetry,  CP-symmetry  and  gauge  invariance,  combined  with 
the  weak  nonleptonic  A/  =  1/2  rule,  produces  interesting  patterns  and  hierarchies 
amongst  the  various  channels.  We  will  limit  our  remarks  here  to  two  reactions, 

KL,S     -»     7r°7r°7, 

KL,s     -     37  (29) 

that  have  the  piquant  feature  of  being  quadrupole   transitions. 
4.1  Decay  KL,s  -"  7r°7r°7 

Gauge  invariance  implies  that  the  7r°7r°  system  in  KL,S  —  *  T07r°7  cannot  have  J  =  0 
(since  that  would  amount  to  a  0  —  »•  0  radiative  transition).  Bose  statistics  implies 
that  the  7r°7r°  pair  cannot  have  J  =  1  (since  such  a  state  would  not  be  symmetric 
under  exchange  of  the  two  TT°'S).  It  follows  that  the  7r°7r°  state  has  at  least  two 
units  of  angular  momentum,  and  the  associated  photon  corresponds  to  quadrupole 
radiation  [34]. 

<7P-invariance  implies  that  the  decays  KL  —  *•  7r°7r°7  and  KS  —  »•  7r°7r°7  are  El 
and  M2  transitions,  respectively,  with  matrix  elements 

A(KL  -*  irVK 


A(KS  -  *Q(p^Q(pMk}}    =  -^c.FpM.  (30) 


m 


K 


A  qualitative  estimate  of  the  decay  rates  may  be  obtained  by  making  a  comparison 
with  the  measured  Ml  transition  KL  —nr+ir~j  [16],  parametrized  as 

A(KL  -*  * 


m*; 


Assuming  that  the  dimensionless  couplings  gM\,  SMI  and  QEI  are  similar  in  mag- 
nitude, and  that  the  reaction  "radius"  I/A  is  of  order  l/mp,  we  obtain  [34] 

B(KL  -+  7r°T°7)     w     1.0  xl(T8, 

-*  »V7)     w     1.7x10-".  (32) 


pair   process   I\L    —  r  n    n    e    c,       c<jum  suuw    twu  wjiujjuucuts,   <_»uc  «»enj^ia.tcu    witu 

KL  —  *  K0ir°7  (£2;  0(p6)),  which  is  strongly  peaked  at  low  e+e~  masses,  and  one 
associated  with  KL  -*  7r°7r°7*  (O(p4))  which  shows  up  as  a  broad  continuum  [34]. 

4.2  Decay  K°  -+  Three  Photons 

Given  that  B(tf£  _+  27)  =  5.7  x  HT4,  B(KS  -*  2T)  =  2.4  x  1(T6  [36]  it  is 


interesting  to  ask  what  one  expects  for  the  decays  KL,S  —*  37  [37]. 

First  of  all,  it  should  be  noted  that  both  KL  —*•  37  and  AS  —»  37  are  possible 
without  violating  CP  or  any  other  general  symmetry  principle.  Gauge  invariance 
dictates  that  no  pair  of  photons  in  these  channels  can  have  J  =  0,  while  Bose 
statistics  forbids  any  pair  from  having  J  =  1  (Yang's  theorem).  It  follows  that 
every  pair  of  photons  in  these  decays  must  have  at  least  two  units  of  angular 
momentum.  A  simple  model  that  relates  the  decays  KL,S  —*  37  to  the  other 
quadrupole  transition  KL,S  ~  *  7i"°7r°7  discussed  above  yields  [37] 

-+3y)     =    3xl(T19, 

-+37)     =     5xl(T21.  (33) 


Thus  the  37  decay  mode  is  suppressed  relative  to  the  27  mode  by  15  orders  of 
magnitude  -  a  remarkable  reminder  of  the  power  of  Bose  statistics  in  this  year  of 
the  Bose  centenary! 
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Figure  1.  Diagrams  relevant  for  the  decays  (a)  K°  ->  K+e~ff 
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Figure  2.  (a)  Two-photon  contribution  to  KL  ->  /+/~  and 
describing  the  short-distance  interaction  sd  -*  /"*"/". 
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Testing  electroweak  vector  boson  self  interactions 

R.  SINHA 

Institute  of  Mathematical  Sciences,  CIT  Campus,Taramam,  Madras,  600  113. 

Abstract.  We  review  anomalous  triple  electroweak  vector  boson  (TEVB)  couplings. 
The  anomalous  vertex  is  written  in  a  model  independent  way  and  related  to  the  moments 
of  the  W  boson.  Loop-induced  bounds  on  these  couplings  are  derived,  including  those 
obtained  from  the  oblique  parameters  S,T,U.  We  show  that  these  bounds  from  oblique 
parameters  in  particular,  cannot  rule  out  anomalous  couplings  even  if  the  oblique  param- 
eters are  in  complete  agreement  with  SM.  The  importance  of  testing  the  TEVB  vertex 
beyond  LEPI  is  thereby  demonstrated.  We  examine  direct  bounds  obtainable  from  the 
W-radiative  and  Wf  production  at  CDF  and  DO  collaborations.  However,  all  the  bounds 
on  TEVB  to  date  are  too  weak  to  limit  most  models.  It  is  pointed  out  that  future  colliders 
will  be  needed  to  probe  loop  level  SM  corrections  to  the  TEVB  vertex,  as  well  as  examine 
extensions  beyond  the  SM.  A  detailed  study  of  the  process  e+e~  — *  l*C~vv'  is  carried 
out  with  numerical  results  pressented  for  the  "Next  Linear  Collider"  (NLC)  proposed  to 
run  at  ^/s  =  500GeV,  iTeV.  We  find  that  the  TEVB  couplings  can  be  studied  accurately 
enough  to  probe  regions  close  the  loop  level  corrections  within  the  SM,  and  also  allow  for 
a  possibility  of  separating  the  photon  coupling  from  that  of  the  Z  using  polarized  beams 
at  the  NLC. 


1.  Introduction 

The  SU(2)L  <8>  U(\}y  theory  of  electroweak  interactions,  which  is  a  part  of  the 
standard  model  (SM),  has  had  dramatic  confirmations  in  the  last  decade.  Exper- 
iments at  LEP  are  currently  able  to  measure  the  mass  of  the  Z  and  its  couplings 
to  fermions  at  a  less  than  1%  level  and  they  agree  with  the  theory  considered  up 
to  one  loop.  This  success  of  the  SM  has  generated  a  feeling  that  even  those  of  its 
parameters  that  have  not  been  tested  directly  are  likely  to  be  in  good  agreement 
with  observation.  While  present  precision  measurements  do  constrain  new  physics 
vis-a-vis  the  parameters  of  the  SM,  such  constraints  need  not  be  as  restrictive  in 
some  sectors  as  in  others.  In  fact,  the  gauge  boson  couplings  of  the  standard  model 
of  electroweak  interactions  are  only  just  beginning  to  be  directly  measured.  There 
has  now  been  observation  of  the  process  pp  -+  evjX,  presumably  representing  Wj 
production  and  radiative  W  decay,  at  the  Collider  Detector  at  Fermilab  (CDF), 
the  DO  collaboration  at  Fermilab  [1,  2,  3]  and  at  UA2  at  CERN.[4]  Recently  a  lot 
of  work  has  been  devoted  to  a  global  analysis  of  low  energy  data  and  LEP  data,  in 
order  to  extract  bounds  on  the  gauge  boson  couplings. [5].  Extensive  studies[6]  have 
also  been  undertaken  to  determine  the  possibilities  of  detecting  such  deviations  at 
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2.  The  general  triple  electroweak  vector  boson  vertex 

The  vector  boson  self  interactions  are  uniquely  fixed  by  the  gauge  structure  of  the 
SM.  However,  new  physics  as  well  as  higher  order  corrections  will  modify  these  self 
interactions.  The  couplings  of  W  bosons  to  the  photon  and  Z  can  be  described 
by  an  effective  Lagrangian,  the  most  general  form  of  which  may  be  written  down 
with  the  minimum  requirements-  that  of  Lorentz  invariance,  global  577(2)  and  local 
U(l)  symmetry  [7,  8]  as: 


'a  a  oa      (1) 


In  the  above  equation  V  represents  the  neutral  gauge  bosons  either  the  photon 
or  the  Z,  Vap  =  dQV0  -  dpVa,  Wap  =  daWp  -  dpWa  and  gv  is  the  WWV  cou- 
pling strength  in  the  SM  with  <77  =  e  and  gz  =  ec/s,  where  c2  =  1  —  s2  = 
MW/MZ-  The  above  Lagrangian  consists  of  five  dimension  4  terms  and  2  dimen- 
sion 6  terms  and  has  the  corresponding  seven  parameters  or  couplings  character- 
izing the  couplings  of  the  W  to  each  of  neutral  gauge  bosons.  These  seven  op- 
erators exhaust  all  possible  Lorentz  structures  if  it  is  assumed  that  d^W1*  =  0 
and  dpV1*  =  0.  These  conditions  are  automatic  if  the  bosons  are  coupled  to 
light  fermions.  Higher  dimension  operators  can  be  added  by  the  replacement 
Vp  —  >  nnV^.  However,  contributions  from  these  operators  provides  only  the  mo- 
mentum dependence  to  the  couplings.  The  SM  values  for  these  couplings  are 
gZ  -  Kj  =  KZ  =  1,  and  A7,  Az,«7,«2r,  A7,  *z,  g~[,  9%  ,g1,gj  are  all  zero.  Elec- 
tromagnetic gauge  invariance  fixes  gl  to  be  unity  and  g"[  and  gl  to  be  zero.  The 
other  couplings  gf  ,  g%  ,  gj  ,  «7,  KZ,  A7,  \z,  KJ,  %z,  ^7,  ^z  have  to  be  determined  ex- 
perimentally. Considering  that  a  large  number  of  parameters  have  to  be  determined 
experimentally  some  assumptions  have  often  been  made  to  reduce  this  set  by  mak- 
ing some  assumptions.  The  couplings  gj  and  gj  violate  custodial  SU(2)  and  hence 
can  be  assumed  to  be  small  or  zero.  The  number  of  extra  parameters  is  further 
reduced  by  taking  (jrf  ,  the  weak  neutral  charge  of  the  W,  equal  to  unity. 

The  parameters  /c7)z  and  A7)z  are  unique  as  they  can  be  related  to  the  multipole 
moments  of  the  W  -  which  can  be  defined  with  respect  to  both  neutral  bosons 
as  follows 


+  «  + 


Qw       =       -TF5-(«-A) 
Mw 


-(K  +  A) 
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Qw  =- 

(2) 

where  fj,w  is  the  magnetic  dipole,  Qw  is  the  electric  quadrupole  moment,  dw  is 
the  electric  dipole  and  Qw  is  the  magnetic  quadrupole  of  the  W  boson.  Tree  level 
unitarity  will  restrict  the  WWF  couplings  to  their  SM  values  at  asymptotically 
high  energies.  Any  deviation  from  the  standard  model  is  thus  described  by  a  form 
factor  and  depends  on  a  scale  for  new  physics  'A'.  It  is  customarily  assumed  that 
these  form  factors  are  constant  for  \/*  <C  A  and  start  decreasing  when  A  is  reached 
or  surpassed,  in  analogy  with  the  nucleon  form  factors.  These  form  factors  will 
therefore  act  as  regulators  for  loop  corrections. 

3.  Anomalous  couplings  in  various  models 

If  the  W  bosons  are  composite  objects,  then  deviation  of  the  triple  gauge  boson 
coupling  parameters  from  their  standard  model  values  could  be  very  large  indeed; 
as  an  example,  «  has  been  calculated  to  be  greater  than  three  in  one  model[9]. 
However,  within  the  standard  model,  upper  bounds  on  the  one  loop  corrections  to 
the  tree  level  values  of  /c7  and  A7  are  expected  to  be  [10]  A«7  =  1.7  x  10~2  and 
AA7  =  0.25  x  10~2  for  a  top  quark  of  about  170GeV.  In  extensions  of  the  standard 
model  such  as  those  containing  extra  Higgs  doublets,  extra  heavy  fermions  [11], 
the  deviations  from  the  tree  level  standard  model  values  tend  to  be  of  about  the 
same  order  of  magnitude  as  these  one  loop  corrections.  For  the  minimal  super- 
symmetric  SM,  due  to  cancellations,  the  values  expected  are  slightly  below  the  SM 
ones. Typically,  in  most  renormalizable  models,  Kyiz  are  expected  to  vary  by  only 
less  than  5%  of  the  SM  value.  Variations  in  \ytz  are  generally  (although  not  al- 
ways) further  suppressed  by  an  order  of  magnitude.  Hence,  variations  of,  or  limits 
on  K"ysz  alone  are  considered  usually  in  literature. 

4.  Loop-induced  constraints  on  anomalous  couplings 

Much  work  has  gone  into  constraining  the  anomalous  couplings  in  eqn(l),in  partic- 
ular by  loop  induced  processes  at  low  energies.  However,  questions  have  been  raised 
on  the  validity  of  these  bounds.  In  particular  the  validity  of  the  Lagrangian  itself 
has  been  questioned.  The  Lagrangian  is  not  explicitly  gauge  invariant,  and  hence  it 
has  been  claimed[12]  that  it  is  not  an  valid  extension  of  the  SM.  However,  Burgess 
and  London  [13]  have  argued  by  an  explicit  proof  that  -  any  Lagrangian  that  con- 
tains W's  and  Z's  which  satisfies  Lorentz  invariance  and  U(I}em  gauge  invariance  is 
automatically  S£/i,(2)  x  C/y(l)  gauge  invariant-the  gauge  invariance  being  realized 
nonlinearly.  Burgess  and  London,  however,  agree  that  most  bounds  on  these  cou- 
plings are  over  estimated.  The  real  culprit  they  point  out  is  not  gauge  invariance 
but  the  improper  use  of  cutoff  in  estimating  the  size  of  the  loop  diagrams.  They 
point  out  that  the  cutoff  dependence  in  low  energy  theory  does  not,  in  general,  give 
an  accurate  indication  of  the  true  dependence  on  the  heavy  physics,  although  it  can 
do  so  for  a  logarithmic  divergence.  The  constraints  on  the  anomalous  couplings  are 
therefore  considerably  weakened.  Some  of  the  valid  bounds  -depending  at  most  on 


the  log  of  the  cutoff  scale  A  -  are  considered  here.  Bounds  from  tree  level  unitarity 
are  based  on  the  assumption  of  a  dipole  form  factor  and  provide  the  limits[14] 

2  - 

and  |A,A|  <  4 


Understandably  these  limits  are  too  weak  to  bound  any  model.  The  muon  magnetic 
moment  is  used  set  the  bound  [15] 

<3.7 


'7        '      M2, 

However,  it  has  been  argued  [16]  that  contributions  from  direct  terms  such  as 
JiVfivpF*11'  which  are  needed  to  cancel  divergence  weaken  these  limits.  Analogously 
the  neutron  dipole  moment  is  also  used  to  limit  the  CP  violating  couplings  giving[17] 
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5.  Bounds  from  oblique  parameters  on  anomalous  couplings 

New  physics  beyond  the  SM  can  be  usefully  constrained  in  terms  of  the  "oblique" 
electroweak  correction  [18,  19]  parameters  5,T,  and  U  [21]  (  or  Afi,Af2,  or  Acs 
).  These  are  linearly  related  to  Rww(q2),  naj(g2)  (where  a,b  —  -y,Z),  which  are 
the  contributions  from  new  physics  to  the  electroweak  vector  boson  self-energies  as 
follows: 


<*M2S    = 


where  SE(q2)  =  II(g2)  —  11(0).  In  order  to  constrain  the  WWV  couplings,  we  relate 
them  to  these  oblique  parameters  and  use  the  known  constraints  on  the  latter. 
The  approach  presented  here  is  from  Choudhury,  Roy  and  Sinha([22]).  Analysis 
of  the  type  discussed  here  have  also  been  done  by  Hagiwara  ei  al.  ([23])  and 
Burgess  tt  al.  ([24]),  with  similar  conclusions.  Hagiwara  et  al,  deal  with  a 
linearly  realized  Lagrangian  whereas  we  and  Burgess  et  al.  deal  with  a  nonlinearly 
realised  Lagrangian.  The  major  difference  between  our  analysis  and  that  of  Burgess 
et  al.  is  that  they  include  vertex  corrections  ignored  by  us  and  include  more  than 
the  three  oblique  parameters.  However,  Burgess  ei  al.  ignore  «2,A2  and  K\  terms 
which  have  been  included  by  us. 

If  the_  anomalous  vector  boson  couplings  constitute  the  sole  source  of  new 
physics,  T  and  U  will  measure  the  weak  isospin  breaking  induced  by  them,  while  5 
arises  from  their  5J7(2)jr  <g>  t/(l)y  breaking  aspect  through  the  involvement  of  the 


longitudnal  vector  boson  modes.  One  can  work  out  the  divergent  contributions  to 
the  Il-functions  from  the  nonstandard  W  WV  vertex  by  using  (1)  and  dimensional 
regularization.  Defining  U(q2)  to  be  the  coefficient  of  the  usual  (2/c  -  TE)[13]  term 
in  II($2),  we  compute 


where  r  =  q2/M^  and  r)a>i,  s  1  —  K0ij.  For  compactness,  we  introduce  the  notation: 

d     =     36c2-4-c-2,  £2     ==     24c2-4, 

C3     =     2-c-2,  C4    = 


With  T,  5  and  ?7  defined  analogously  in  terms  of  the  II  's  and  denoting  (F)  = 
*2F7  +  <?FZ  for  any  P,  we  have 


+  (A»  -  A7)  {C2(3  +  fo»  +  (A}C4}](5) 
Since  IIaj(0)  =  0,  only  EWW  contributes  to  T  and  we  can  write 

lW(<?2)  =  aM2,   f  -  ^(r)   ,  (6) 


{3  -I-  ^}  +  2  (6  +  2r  -  r2)  (A2)  +  4  (3  +  3<T  2  +  r)  A2  . 
Consequently, 

V  =  -        ^(l)  +          3  [2Ci  {^}  +  C3(tf  +  2C2(A)  (3  +  fo»  +  C4{A}2]  .         (9) 


Since  T  depends  only  on  Iljyiv(O)  and  Hzz(ty  and  not  on  their  q2  variations  it  is 
unaffected  by  dimension  6  operators  and  hence  is  independent  of  A7  and  \z  •  Also, 
terms  in  S  are  proportional  to  either  rjz  —  »?7  or  A^  —  Ay  as  they  should,  since  S 
originates  from  the  mixing  between  weak  hypercharge  (Y)  and  the  third  component 
of  weak  isospin  -  the  WWY  vertex  being  linear  in  these  differences. 

It  should  be  noted  that,  unlike  in  the  SM  and  its  straightforward  extensions, 
the  oblique  parameters  are  not  finite  quantities  here.  This  is  a  consequence  of 
the  presence  of  nonrenormalizable  terms  in  the  Lagrangian  of  (1).  In  a  cut-off 
dependent  regularization  scheme,  this  fact  would  manifest  itself  through  a  non- 
trivial  functional  dependence  on  the  cut-off  scale  [23].  As  a  matching  condition 
between  two  effective  theories  [25],  we  identify  ft  =  A,  the  scale  at  which  new 
physics  becomes  manifest  (assumed  to  be  ~  ITeF).  Using  the  MS  scheme  of 
renormalization,  we  are  lead  to 

+  51n-  (10) 


translated  into  5,  T  and  U . 

We  use  [26]  5  =  -0.31  ±  0.49,  f  =  -0.12  ±  0.34  and  U  =  -0.11  ±  0.92,  though 
our  results  are  insensitive  to  the  central  values.  As  pointed  out,  T  depends  only 
on  fy  and  177,  being  expressed  as  a  sum  of  quadratic  functions  of  theirs.  Thus  the 
measurement  of  T  alone  constrains  these  parameters  to  lie  on  an  elliptic  band  in 
the  rj<y  —  rjz  plane,  the  width  of  the  band  being  given  by  the  errors  on  Mw/Mz  and 
T.  S  and  U  then  reduce  the  allowed  region  into  smaller  part  of  the  elliptic  band. 
Since  5  is  proportional  to  the  differences  of  7-  and  Z-couplings,  constraints  on  it 
generally  (though  not  always)  tend  to  make  those  converge.  Rather  unexpectedly, 
U  play^s  a  significant  role  in  constraining  these  anomalous  couplings.  In  conjunction 
with  S  and  T,  it  serves  to  exclude  a  large  part  of  the  parameter  space. 

We  obtain  the  (95%  C.L.)  bounds  -6.1^<4.1,  -6.0<A7<4.5,  -2.0<^<0.3 
and  -4.5  £  Xz  £  1.9,  earlier  UA2  analysis  [4]  had  yielded  -8.4  £  t^  £  12.1  (for 
arbitrary  A7)  and  —  8.5^$  A7  ^6.5  (for  arbitrary  rj-y).  The  allowed  regions  represent 
solutions  to  polynomial  equations,  which  are  curves  that  thicken  into  bands  on 
account  of  experimental  error  bars  on  the  coefficients.  As  the  errors  shrink  to  zero, 
the  allowed  parameter  space  collapses  into  the  curves  still  permitting  wide  ranges 
of  values  for  the  anomalous  couplings.  This  observation  holds  even  for  the  SM 
point  viz.  S  =  T  —  U  =  0,  being  a  consequence  of  cancellations  between  various 
contributions  to  the  H-functions. 

The  bounds  that  we  have  obtained  above  are  still  large  as  compared  to  the 
ratio  t>2/A2,  where  v  is  the  electroweak  VEV  [v  =  IMw /(e  «)].  Arguments  based 
on  Naive  Dimensional  Analysis  [27]  suggest,  for  instance,  that  KV  —  1  ~  u2/A2. 
However,  any  approximate  equality  KV  —  1  ~  v2/A2  is  only  a  matter  of  conjecture. 
This  makes  phenomenological  investigations  of  these  bounds,  as  performed  here, 
rather  important. 

6.  Direct  bounds  on  anomalous  couplings 

The  bounds  mentioned  above  are  to  some  extent  model  dependent,  in  the  sense 
that  they  are  obtained  by  loop  level  processes,  where  'other  new  physics'  can  also 
contribute.  However,  there  has  now  been  the  observation  of  pp  — >•  evyX,  presum- 
ably representing  Wj  production  and  radiative  W  decay,  at  the  Collider  Detec- 
tor Facility  (CDF)  [1]  at  the  Tevatron  (•/«  =1.8  TeV)  at  Fermilab  and  at  UA2 
(^/s  =  630  GeV)  at  CERN  [4].  Such  measurement  allows  for  the  possibility  of 
directly  testing  the  gauge  boson  couplings  by  tree  level  processes.  The  best  way  to 
detect  possible  nonstandard  gauge  boson  couplings  is  to  make  use  of  the  radiation 
amplitude  zero[28]  which  occurs  in  the  angular  distribution  of  du  — >  Wj  as  well 
as  in  W  — +  d«7.The  presence  of  the  zero  will  be  explicitly  borne  out  in  equations 
(12)  and  (14)  below  by  the  factors  Z  contained  therein.  The  dip  which  persists  in 
pp  —*•  WjX,  is  very  sensitive  to  deviations  from  the  standard  model  gauge  boson 
couplings. 

Preliminary  results  from  CDF[1]  quote  a  limited  number  of  events  (6  events  cor- 
responding to  an  integrated  luminosity  (  £)  of  4.3pb-1)  [29]  UA2  has  also  reported 
their  final  results  and  quote  10  events  with  an  integrated  luminosity  of 
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uin-.>^oo  \ju*,a,iu  tuc  mail  uirc^t  uuuuua|_ouj  un  n,,  u<_mi  tuc  uutcu  uuuiuer  oi  w  7  events, 
as  well  as  the  number  of  radiative  W  decays,  W+  — »•  e+i/7  and  VF~  — *  t~v^. 
The  formula  for  the  cross  section  of  Wj  production  is  given  by 

a(PP     _,     W-iX)  =  \  V;    /  f  dxAdxB(Pf(xA)P^(xB) 

i=ds  J    ^ 

where, 

*  v2  JPhaseSpace 

2 ^         ^->?^^JI_+        *?         r^  +  (p  +  62^^_j\       ^12^ 


and  where  S  =  (J?i  +  Jfe2)2,<  =  (P-fci)2,«  =  (P  -  Jfc2)2  with  «+£+«  =  Afjr,  and 
Vij  is  the  KM  matrix  element.  Z  =  (Q,-  +  TT^")  is  the  zero  factor,  Qt-  is  the  electric 
charge  of  the  quark  9,-,  and  77  =  «  —  1.  For  this  section  the  subscript  7  is  supressed, 
kappa  is  meant  to  imply  Ky. 

For  the  radiative  W  decay  case,  one  obtains  the  result 

a(PP  _>  e~pTx)  =  1  Y,    I  I  dxAdxB6(sxAxB  -  M*,)  \Ptp(xA)p£(xB) 
i=d,»  J   J 

*(g,-(*i)gT,(*2)  -.  ei/T)  +  P?(xA}Pf(xB}a(qi(k'1}^(k^  -+  e~^}\  .        (13) 
where  we  obtain  <r,  using  the  zero-width  approximation  as, 


ei>)  I  [Z'2A  -  Z'rjB  +  r/2C]  (14) 

»'P/iase5'pace 


where  -Z'  =  (^i  _  ^^)  is  the  zero  factor,  and 
4 


w 


fe.fi  ,oll     ,  4(P2.fe)(P2.fci)       4(Pi.fci)(fi.fe) 
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2  L 


(15) 

To  obtain  the  total  cross  sections  we  perform  the  integrations  over  the  parton 
fractions  and  over  the  phase  space,  using  Vegas  Monte  Carlo  routines  and  EHLQ[31] 
parton  distribution  functions.  The  following  cuts  were  also  imposed,  to  agree  with 
the  cuts  imposed  by  CDF:  (1)  the  transverse  photon  energy  ET-I  <  5  GeV;  (2) 
the  photon  pseudo-rapidity  |r/7|  <  3.0;  (3)  the  electron-photon  angular  separation 
0.3 


With  these  cuts,  the  Wj  production  (W+  -f  W~)  cross  section  is  computed  to 
be 

=  [32.8  -  0.600*7  +  l-42r72]p&.  (16) 


To  calculate  the  number  of  pp  —  *•  WfX  —  »  evjX  events  to  be  expected  at  CDF  for 
any  given  77,  we  use  the  the  electron  and  photon  acceptances  and  their  detection 
efficiencies  to  be  Ae  =  0.41,£e  =  Q.67,.4-,.  =  0.54  and  £y  =  0.50  respectively,  as 
given  by  CDF[1].  Also  including  the  Drell-Yan  correction  factor  KDY  —  1-3,  we 
get  n  =  <rW-,£B(W  -+  ev}KDYARSeA^.  Using  B(W  -*  ev)  -  0.11  we  get 
n  =  1.49  -  0.02737;  -f  0.0647r/2.  Of  the  total  of  six  events  that  are  reported  by 
CDF,three  events  have  a  large  T\$  ~  2.4  —  2.6  whereas  the  other  three  have  77^,  ~  0.8 
thus  they  can  be  clearly  classified  as  W-y  and  W  radiative  decays.  Assuming  3  W~f 
events  we  get  a  loose  bound 

-8.6  <  K  <  11.0(95%OL).  (17) 

The  cross  section  for  pp  —  >•  WX  —  >•  evyX  is 

-  (18.8  -  0.0758r/  -f  0.108r;2)p6.  (18) 


Using  the  same  cuts,  acceptances,  and  efficiencies  as  before  we  obtain  the  quadratic 
equation  in  77  for  the  number  n  of  pp  —  f  WX  —>•  evyX  events:  n  =  7.79  -f  0.0315*7  + 
0.0447rj2.  For  3  events  this  gives  a  bound 

-2.4  <  K  <  3.7(96%CL).  (19) 

If  the  events  are  not  separated  into  decay  and  radiative  ones,  the  total  number 
of  events  is  n  =  9.28  -f  0.0042r7  +  O.lOOr?2,  which  gives  -3.9  <  K  <  4.8  (95%  CL). 
UA2  have  also  set  a  bound  on  K  without  separating  the  events  to  be  —5.4  <  K  <  7.9. 

The  very  loose  bounds  that  one  gets  in  PP  —*•  WX  —  >•  evyX  case  compared 
to  the  PP  —  +•  W~fX  —  »•  evjX  considered  earlier  can  be  traced  directly  to  the 
differences  in  the  production  mechanisms  for  the  W  in  the  two  cases,  and  to  the 
effect  of  the  cuts  imposed  on  the  SM  term  relative  to  the  non-SM  term  in  the  two 
cases.  Note  that  in  Eq.(8),  there  is  a  S  function  involving  parton  fractions  x^,  XB 
which  is  not  there  in  Eq.(l),  whose  effect  is  to  reduce  the  cross  section  obtained 
from  Eq.(8)  relative  to  that  obtained  from  Eq.(l).  The  effect  of  the  cut  imposed  on 
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when  it  comes  to  limiting  most  models.  Direct  studies  of  the  TEVB  vertices  will 
be  possible  at  Fermilab,  and  upcoming  LEP  II  or  proposed  future  collider  like  77 
collider  or  the  NLC.  New  machines  will  be  necessary  to  probe  regions  much  closer 
to  the  SM  prediction. 

7.  Testing  TEVB  at  future  colliders 

With  the  Next  Linear  Collider  (NLC)  in  mind  we  have  investigated[33]  a  set  of  pro- 
cesses of  four  lepton  production  in  e+e~  collisions  with  respect  to  their  sensitivity 
to  gauge  boson  coupling  parameters.  The  processes  are  all  of  the  general  form 


e+e~  -+  rt'-vv'.  (20) 

we  include  all  possible  charged  lepton  combinations,  specifically  these  are  pr,  pe 
(re),  \i\i  (?"7")>  and  ee.  The  channels  given  in  brackets  have  the  same  set  of  Feyn- 
man  diagram  contributions  as  their  corresponding  unbracketed  channel  and  we  will 
henceforth  drop  reference  to  them  as  distinct  processes. 

For  all  our  processes,  we  do  include  the  full  gauge  invariant  set  of  diagrams, 
the  n+r~  state  can  be  produced  via  9  different  diagrams;  the  /i+e"~  state  receives 
contributions  from  18  diagrams.  The  \L\JL  process  has  a  total  of  28  contributing 
diagrams  with  most  of  the  extras  being  7  or  Z  'bremsstrahlung'  from  the  initial  or 
final  state  leptons.  The  ee  process  goes  via  56  diagrams..  For  the  /*//  and  ee  final 
states,  in  some  of  the  diagrams,  all  v  species  can  appear.  These  diagrams  are  added 
incoherently  in  the  calculation.  However,  for  the  purpose  of  counting  the  number 
of  diagrams,  we  regard  all  the  v  final  states  as  contributing  to  a  single  diagram. 

It  is  important  to  realize  that  one  needs  the  full  calculation  that  we  discuss  here 
as  opposed  to  the  calculations  of  W  pair  or  single  W  production.  At  LEP  II  energies 
it  is  appropriate  to  calculate  W  pair  production,  the  processes  being  dominated 
by  the  nearly  on-shell  W  propagators.  This  is  not  the  case,  however,  at  higher 
energies  unless  one  is  dealing  with  a  final  state  which  allows  for  the  experimental 
reconstruction  of  Ws.  We  are  considering  here  the  purely  leptonic  final  states.  For 
this  case,  the  Ws  can  be  reconstructed  (up  to  a  two-fold  ambiguity)  only  under 
the  assumption  that  they  are  on-shell.  [8]  Neither  W  pair  nor  single  W  production 
can  be  isolated  by  experimental  cuts  for  the  purely  leptonic  case.  Consequently, 
we  must  fully  calculate  the  i+£'~i/i>f  production,  as  opposed  to  W  pair  production 
only  and,  in  doing  so,  we  aim  to  unearth  a  more  realistic  picture  of  the  sensitivity 
to  the  couplings  in  question. 

In  order  to  deal  easily  with  the  large  number  of  Feynman  diagrams  and  to 
readily  retain  helicity  information,  we  have  written  the  amplitude  for  each  process 
in  the  CALKUL  helicity  formulation  [34].  We  assume  massless  spinors  describe  the 
fermions  although  we  do  retain  fermion  masses  in  the  propagators;  this  amounts  to 
neglecting  terms  proportional  to  m/,  a  good  approximation.  The  matrix  element 
squared  for  each  process  is  embedded  in  a  Monte  Carlo  algorithm  for  integration 
over  the  final  state  four  body  phase  space  to  yield  the  cross  sections  and  various 
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distributions.  We  sum  and  average  over  initial  spins  and  sum  over  final  spins. 
We  use  Mz  =  91. 196  GeV,  Tz  =  2.534  GeV,  Mw  -  80.6  GeV,  Tw  =  2.25  GeV, 
me  -  0.51  IMeV,  m^  -  0.1057  GeV,  mr  =  1.7841  GeV,  and  sin20w  =0.23. 

The  experimental  signature  for  the  processes  under  consideration  is  a  clean  one, 
an  oppositely  charged  lepton  pair  and  missing  transverse  momentum  and  energy 
due  to  the  neutrinos.  We  have  made  some  fairly  simple  cuts  as  described  below  to 
account  for  detector  acceptance  and  potential  backgrounds.  For  all  the  processes, 
we  require  a  cut  on  the  angle  of  each  of  the  charged  leptons  relative  to  the  beam 
such  that  —0.95  <  cos  QI±  <  0.95  .  This  is  the  only  cut  we  impose  for  the  pr  and  pe 
final  states.  This  angular  cut  is  experimentally  motivated  however  it  also  serves  to 
regulate  the  t-channel  photon  poles,  allowing  us  to  neglect  the  terms  proportional 
to  my.  These  diagrams  contribute  to  the  /*+e~  and  e+e~  final  states,  as  described 
above. 

One  potential  background  is  r  pair  production  with  each  of  the  r's  decaying 
leptonically.  At  -<Js  of  200  GeV,  each  of  the  four  lepton  processes  and  the  r  pair 
production,  multiplied  by  the  branching  ratios  of  r  into  e  or  fj,  of  17.8%  each 
[35],  yield  about  the  same  rate.  At  higher  energies,  the  r  pair  production  cross 
section  is  falling  like  1/s  while  the  cross  section  for  our  processes  remains  large. 
In  addition,  the  r  pair  process  should  have  substantially  greater  missing  energy 
with  four  neutrinos  in  the  final  state.  It  seems  that  this  source  of  background  is 
manageable. 

The  four  lepton  processes  with  one  or  more  r's  in  the  final  state  (fir  and  rr) 
could  feed  down  as  a  background  to  the  processes  without  any  r  if  the  r(s)  decays 
leptonically.  However,  factoring  in  the  r  decay  branching  ratio  and  accounting  for 
the  higher  missing  energy  keeps  this  background  under  control.  Another  potential 
background  comes  from  two  photon  processes  with  the  e+  and  e~  undetected  near 
the  beam.  This  is  relevant  to  the  ^/i  and  ee  processes  and  we  make  a  cut  on  missing 
transverse  momentum  to  eliminate  two  photon  events  as  a  background  source;  we 
require  total  visible  pr  >  10  GeV .  We  also  require  for  these  two  processes  that  each 
charged  lepton  carry  a  minimum  energy,  EI  >  10  GeV.  Finally,  again  for  the  pp. 
and  ee  processes  we  make  a  cut  on  the  invariant  mass  of  the  charged  lepton  pair; 
we  require  m<-f^_  >  25  GeV  in  order  to  eliminate  the  low  invariant  mass  dileptons 
corresponding  to  the  photon  pole  in  these  processes. 

We  evaluate  the  cross  sections  as  a  function  of  ^/s  for  the  processes  e+e~  — >• 
t+V'vv'  for  i+l'~  equal  to  n+r~ ,  ^+e~,  ^+/i~,  e+e~,  respectively,  with  the  cuts 
as  described  above  imposed.  The  sensitivity  to  KV  increases  with  increasing  center 
of  mass  energy.  The  pr  process  exhibits  the  most  sensitivity  to  KV,  as  might 
be  expected  since  it  has  the  least  number  of  extraneous  contributing  diagrams; 
however,  it  also  has  the  smallest  cross  section.  Thus,  it  is  useful  to  consider  all  the 
processes. 

We  make  our  study  of  KV  dependence  at  two  center  of  mass  energies,  500  GeV 
and  1  TeV,  motivated  by  the  possibility  of  future  high  energy  e+e~  colliders.  For 
each  of  the  four  types  of  four  lepton  processes,  at  each  of  the  two  energies,  we  vary 
K-y  alone  from  0.9  to  1.1,  KZ  alone  over  the  same  range,  and  /c7  constrained  to  equal 
KZ  over  the  same  range. 

At  >/s  of  500  GeV,  each  process  is  more  sensitive  to  deviations  of  KV  below 
the  standard  model  value  of  1  than  above  it;  however  at  the  higher  center  of  mass 
energy  of  ITeV,  the  sensitivity  to  KV  is  considerably  more  symmetric  about  1. 


a  parabola  for  the  cross  section  as  a  function  of  K  and  solved  for  the  cross  section 
as  a  function  of  the  two  parameters  «7  and  KZ  as 


\M  |2  =  a  -f  bKy  +  CKZ  +  dK^KZ  +  en*  +  JK\.  (21) 


We  turn  these  results  into  limits  on  the  detection  of  deviations  of  KV  from  1 
by  assuming  an  integrated  luminosity  of  50/6"  1  for  a  proposed  collider  [36].  ^From 
the  total  cross  section  of  the  individual  processes,  we  find  the  following  2<r  lim- 
its on  measurements  of  «:7  and  KZ-  At  ^/s  of  500  GeV,  «7  could  be  measured 
within  -2.5%  (pr)  to  +9.5%(ee)  and  KZ  within  the  range  -6%  (/JT,  ^e,  fj.  mu)  to 
-1-8%  (pe).  At  1  TeV,  the  corresponding  limits  on  /c7  are  —1%  (pr)  to  +3.5%  (nr,  fie) 
and  on  KZ  we  find  limits  of  -1.5%  (p.r]  to  4-2.5%  (/*r).  The  channels  given  in  brack- 
ets with  each  limit  indicate  which  of  the  processes  supplies  the  best  bound.  These 
particular  limits  simply  represent  the  outer  bound  of  the  2<r  contour  for  the  various 
processes.  If  one  makes  some  assumptions  about  the  relationship  of  «7  and  KZ, 
such  as  that  «7  =  KZ  or  that  A«7  =  C08a2g°08_8^a  6  —  A/c^  [23],  better  bounds  are 
obtained.  In  addition,  combining  the  statistics  from  all  the  processes  considered 
here  would  improve  the  bounds.  In  fact,  one  could  also  combine  these  four  lepton 
processes  with  the  similar  jet  channels  such  as  e+e~  —*•  qiftv.  Combined  bounds 
would  necessitate  inclusion  of  detector  acceptances  and  efficiencies  for  the  various 
particle  types.  We  emphasize  that  even  the  bounds  quoted  above  from  the  cross 
sections  of  individual  processes  are,  indeed,  approaching  the  very  interesting  realm 
of  probing  KV  to  within  a  few  per  cent  of  the  standard  model  value.  We  note  that 
it  is  particularly  important  to  go  to  the  higher  energy  in  order  to  probe  values  of 
KV  larger  than  1. 

We  have  also  generated  a  number  of  distributions;  these  include  the  differential 
cross  sections  with  respect  to  the  angle  of  each  charged  lepton  relative  to  the  beam, 
the  angle  between  the  charged  leptons,  the  energy  and  transverse  momentum  of  each 
charged  lepton,  the  total  visible  energy  and  transverse  momentum  and  the  invariant 
mass  of  the  charged  lepton  pair.  The  charged  lepton  angular  distributions  tend  to  be 
strongly  peaked  along  the  beam  line  due  to  the  t-  channel  neutrino  exchange;  they 
are  generally  enhanced  somewhat  away  from  the  beam  direction  for  nonstandard 
KV  values.  The  energy  and  transverse  momentum  distributions  of  the  individual 
particles  tend  to  be  enhanced  over  most  of  their  range.  The  total  visible  transverse 
momentum  is  preferentially  enhanced  where  the  differential  cross  section  is  largest. 

It  now  remains  to  determine  whether  we  can  pinpoint  the  values  of  K7  and  KZ 
individually.  There  have  been  a  number  of  approaches  proposed  for  discriminating 
between  deviations  of  K7  and  KZ-  One  suggestion  is  to  study  processes  which 
only  involve  one  or  the  other  of  the  fWW  and  ZWW  vertices.  The  associated 
production  of  a  W  with  either  a  7  or  a  Z  boson,  radiative  W  decay  [28],  and  ej 
processes  such  as  ej  ->•  Wv  [37]  fall  into  this  category.  Another  suggestion  is  to 
make  cuts  which  isolate  one  of  the  vertices.  For  instance,  [38]  has  studied  the  n+p' 
production  process  which  we  also  consider  here  and  have  focussed  on  the  ZWW 
vertex  by  requiring  that  the  invariant  mass  of  the  ji+/i~  pair  fall  within  5  GeV  of 
Mz.  Here,  we  emphasize  instead  the  potential  usefulness  of  the  helicity  structure 
in  providing  a  determination  of  «7  and  KZ- 
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We  observe  that  the  (  —  \-  -\  —  )  [39]  amplitude  is  suppressed  at  -y/*  ^  Afz  for 
«7  =  KZ  as  a  direct  result  of  the  general  form  of  this  amplitude,  which  is  given 
below. 
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Here  A  and  B  denote  the  KV  dependent  and  independent  factors,  respectively,  of  the 
amplitude.  For  large  center  of  mass  energies,  the  cancellation  of  the  K^  and  KZ  terms 
results  in  a  (  —  h  H  —  )  helicity  contribution  of  less  than  about  one  per  cent  of  the 
total  cross  section  for  the  standard  model  and  for  «7  =  KZ  in  general.  On  the  other 
hand,  for  nonequal  values  of  «7  and  KZ,  this  contribution  can  be  as  much  as  30% 
of  the  total.  Thus,  polarized  beams  accessing  the  individual  helicity  contributions 
could  differentiate  between  the  Ky  =  KZ  case  and  the  nonequal  case.  Apart  from  the 
general  observation  described  above  regarding  the  case  of  K7  and  KZ  equal,  experi- 
mental results  on  the  cross  sections  for  the  four  types  of  processes  we  consider  with 
polarized  and  unpolarized  beams  could  provide  a  characteristic  'fingerprint'  for  a 
(«7,  KZ)  pair.  As  an  example  of  how  this  might  work,for  the  pr  process  at  1  TeV;  we 
note  that,  for  instance,  («7,/cz)  =  (0.945,0.945),  (1.07,  1.07),  (1,1.095),  (1,0.92), 
and  (0.92,  1)  all  have  approximately  the  same  total  cross  section.  The  percentage 
of  the  cross  section  supplied  by  the  (-  +  H  —  )  helicity  is  less  than  1%  for  the  two 
cases  quoted  with  «7  =  KZ;  it  is  3.6%  for  (1,1.095),  18%  for  (1,0.92),  and  27% 
for  (0.92,  1).  Since  the  total  cross  section  for  unpolarized  beams  corresponds  to 
about  1000  events,  these  cases  can  possibly  be  discriminated  providing  reasonable 
polarization  can  be  achieved.  Similar  results  from  the  four  types  of  processes  can 
be  combined  to  narrow  in  on  the  actual  values  of  «7  and  KZ,  individually.  The 
detailed  study  of  distributions  will  also  be  useful  as  mentioned.  We  note  that  the 
processes  considered  here  offer  a  very  clean  experimental  signature  for  excellent 
sensitivity  to  «7  and  KZ  at  a  high  energy  e+e~  collider. 

8.  Conclusions 


The  anomalous  electro  weak  vector  boson  couplings  have  just  started  getting  tested. 
The  vector  boson  vertex  can  be  written  in  a  model  independent  way  and  related  to 
the  moments  of  the  W  boson.  We  have  examined  some  direct  and  indirect  (loop- 
induced)  bounds  on  the  TEVB  couplings.  However,  these  bounds  are  too  weak  to 
limit  most  models  and  probe  loop  level  corrections  to  the  standard  model.  There 
will  be  better  bounds  coming  from  CDF  and  DO  collaborations  as  well  as  from  LEPII 
when  it  becomes  operational.  Bounds  obtained  from  oblique  parameters,  obtained 
from  LEPI  data  cannot  rule  out  anomalous  couplings  even  if  the  oblique  parameters 
are  in  complete  agreement  with  the  SM.  Future  colliders  will  be  needed  to  probe 
modifications  to  the  electroweak  vector  bosons  vertex  from  loop  level  corrections 
within  the  SM,  as  well  as  from  most  extensions  of  the  SM.  In  this  connection  it 
is  found  that  the  Next  Linear  Collider  running  at  SOOGeV  and  ITeV  can  bound 
these  couplings  accurately  to  probe  at  this  level  (less  than  5%)  and  also  allow  for  the 
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P.  N.  Pandita,  R.  J.  N.  Phillips,  R.  Ramchandran,  G.  Rajasekaran, 

V.  Ravindran,  S.  D.  Rindani,  R.  G.  Roberts,  D.  P.  Roy,  P.Roy,  J.  P.  Singh, 
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The  activities  of  this  group  centered  round  two  main  topics  of  current  interest. 
The  advent  of  HERA  the  e-p  collider  at  DESY  led  to  the  measurement  of  proton 
structure  functions  with  unprecedented  accuracy,  in  particular,  at  low  values  of 
the  Bjorken  variable  x.  Theoretical  understanding  of  these  results  often  demands 
departures  from  perturbative  QCD.  A  part  of  the  group  activities  was  therefore  de- 
voted to  structure  functions  and  related  topics.  The  following  topics  were  discussed 
in  working  group  sessions(  the  names  of  the  contributors  are  given  in  parenthesis): 

1)  Determination  ofcr(x,Q2)  at  HERA  (  M. Drees  and  R.M.Godbole) 

2)  Polarised  structure  functions:    possibility  of  constructing  single  spin  asym- 
metries (  V.Gupta,  D. Indumathi,  H.S.Mani,  P.Mathews,  M.V.N. Murthy, 
R.Ramchandran,  G. Rajasekaran,  L.Sehgal,  J.P.  Singh  and  K. Sridhar) 

3)  Rise  of  FI(X)  at  small  ar,fixed  poles  and  Schwinger  terms  (  K.Bora,  V.Gupta, 
D.Indumathi,    H.S.Mani,    P.Mathews,    M.V.N. Murthy,    R.Ramchandran, 
G. Rajasekaran,  V. Ravindran  and  K. Sridhar) 

Searches  for  new  particles  and  interactions  indicating  physics  beyond  the 
standard  model  are  at  the  focus  of  attention  of  all  the  on  going  experiments  at 
hadron  and  e+  -  e~  colliders  like  LEP  100  at  CERN  and  the  TEVATRON  at 
Fermilab.  The  issues  unresolved  after  the  present  round  of  experiments  ,  if  any,  will 
continue  to  dominate  the  experiments  at  the  future  colliders  like  LEP  200  and  LHC 
at  CERN.  In  view  of  this  a  considerable  part  of  the  group  activity  was  devoted  to 
the  phenomenology  of  new  particle  searches  at  hadron  and  e+  —  e~  colliders.  The 
following  topics  were  covered  in  some  details(  the  names  of  the  contributors  are 
given  in  parenthesis): 


1)  Virtual  LSPs  in  N  =  1  SUGRA  models  (A.Datta,  M.Drees  and  M.Guchait) 

2)  Like  sign  dileptons  and  SUSY  searches  in  models  with  conserved  and  broken 
R-Parity  (M.Guchait,  N.K.Mondal  and  D.P.Roy) 

3)  Loop  Induced  gluino  decays(M. Drees,  M.Guchait,  B.Mukhopadhyaya  and 
D.P.Roy) 

4)  Limit  on  the  lightest  neutralino  mass  in  R-parity  violating  theories  (S.Banerjee, 
R.M.Godbole,  K.Mazumdar,  D.P.Roy  and  P.Roy) 

5)  Tau  number  violating  signals  for  R-parity  violating  SUSY  at  LEP  200 
(S.Banerjee,  R.M.Godbole,  K.Mazumdar  and  P.Roy) 

B)  Higgs  Physics 

1)  Production  of  Charged  Higgs  in  pp  collisions  via  7  — 7  interactions  (  M.Drees, 
R.M.Godbole,  M.Nowakowski  and  S.D.Rindani) 

2)  Implications    of    a    tree    level    HWZ    coupling    at    e+  —  e~    colliders 
(  R.M.Godbole,  B.Mukhopadhyaya  and  M.Nowakowski) 

C)  Gauge  E^psoh  Physics 

1)  Faliure     of    equivalent     gauge     boson     (EVBA)     approximation? 
(  G.Rajasekaran,  S.D.Rindani  and  R.Sinha) 

2)  Single  Z  production  in  e+  —  e~  collision,  anomalous  triple  gauge  boson  coti- 
pling  and  EVBA  (  S.D.Rindani  and  J.P.Singh) 

3)  CP  violating  tripple  vector  boson  couplings  at  NLC  (  P.Kalyaniak,  P.Madsen, 
N.Sinha  and  R.sinha) 

4)  Determination  of  tripple  gauge  boson  vertices  at  LEP  200  ( T.Aziz,  S.Banerjee, 
K.Mazumdar,  NJSinha  and  R.Sinha  ) 

Summaries  of  some  of  the  above  works  were  submitted  by  the  authors. 
They  are  presented  following  this. 
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The  lightest  supersymmetric  particle  (LSP)playa  a  crucial  role  in  the  search  for 
supersymmetry  (SUSY)  since  it  carries  missing  transverse  energy  (fir)  which  is 
traditionally  regarded  as  the  most  distinctive  signature  of  SUSY  particles.  In  the 
currently  favoured  models,  the  lightest  neutralino  (Zi)  is  assumed  to  be  the  LSP 

[1]- 

Recently  it  has  been  emphasised  [2,  3]  that  there  may  exist  SUSY  particles 
which,  though  unstable,  decay  dominantly  into  invisible  channels.  This  occurs 
if  the  sneutrinos  (i7)  (the  super-partners  of  the  neutrinos)  ,  though  heavier  than  the 
LSP,  are  lighter  than  the  lighter  chargino  (Wi)  or  the  second  lightest  neutralino  (Z^) 
and  are  much  lighter  than  all  other  SJJSY  particles.  As  a  consequence,  the  invisible 
two-body  decay  mode  v  — *  vZ\  opens  up  and  completely  dominates  over 
others,  being  the  only  kinematically  allowed  two-body  decay  ofttie  sneutrinos.  The 
other  necessary  condition  for  this  scheme  to  work  is  that  the  Z\  has  a  substantial 
Zino  component.  This,  however,  is  almost  always  the  case  as  long  as  the  gluino 
(the  super-partner  of  the  gluon)  has  a  mass  above  the  lower  bound  obtained  by 
the  SUSY  searches  at  the  Tevatron  [4].  In  such  cases  the  £2,  which  also  has 
a  substantial  Zino  component,  decays  primarily  through  the  process  Zi  — »•  vv. 
These  particles  decaying  primarily  into  invisible  channels,  hereafter  called  virtual 
LSP's  (VLSP's),  may  act  as  additional  sources  of  $T  and  can  significantly  affect 
the  strategy  for  SUSY  searches  [2] . 

Another  important  consequence  is  the  decay  W\  —^+  \v,  \  —  e,  \JLOT  r.  Since  this 
is  the  only  kinematically  allowed  two  body  decay  of  W\  it  occurs  with  a  branching 
ratio(  BR  )  w  1. 

In  Ref  2.  sparticle  masses  were  treated  as  free  phenomenological  parameters, 
although  it  was  commented  briefly  that  it  is  not  unlikely  that  the  VLSP  scenario 
can  be  accomodated  in  the  N=l  SUGRA  models  with  common  squark  and  gaugino 
masses  at  the  GUT  scale  [5].  In  this  work  we  show  in  detail  that  this  indeed  is  the 
case  for  a  reasonably  large  region  of  the  SUSY  parameter  space. 

We  assume  a  minimal  N=l  supergravity  model  [5]  with  a  common  scalar  mass 
mo  and  a  common  gaugino  mass  mi/2  at  the  GUT  scale  MX,  We  also  assume 
minimal  particle  content.  We  follow  the  standard  convention  for  the  neutralino 
and  chargino  mass  matrices  [1]. 

In  the  following,  MI  and  M2  are  SUSY  breaking  U(l)  and  577(2)  gaugino 
masses,  j*  is  the  supersymmetric  Higgs(ino)  mass  and  tan/?  =  {/^/(^i)  's  tne 
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atio  of  the  vacuum  expectation  values  (vevs).  Ine  assumed  unification  ol  gaugino 
nasses  leads  to  the  following  relation  at  the  weak  scale: 

Mi  =  |  tan20w  M2  ~  0.5M2,  (1) 

o 

vhere 
Ui(Mz]  -  0.82m1/2  (2) 

jives  the  connection  to  the  GUT  scale  gaugino  mass.  The  above  parameters  also 
letermine  the  chargino  masses. 

The  relevant  slepton  masses  [5]  at  the  weak  scale  are  determined  by  Af2,  mo 
md  tan/?: 

mfR  =  mo  +  0.223M22  -f-  sin2  0w&z, 
m?L  =  m2Q  -f  0.773M|  -f  (0.5  -  sin2  0w)DZt 
m?  =  mj  +  0.773M|  -  0.5DZ> 
vhere 


"or  tan/?  >  1.  Notice  that  m,-    >  rnj-    always. 

Our  free  parameters  are  thus  m0,  M2  (which  we  traded  for  mi/2),  /J  and  tan/?. 
Fhere  are  two  sets  of  constraints  on  the  allowed  parameter  space:  direct  experimen- 
tal constraints  (primarily  from  LEP-I  [6])  ,  and  requirements  for  having  VLSP's. 
We  took  the  following  experimental  constraints  into  account  [6]: 

mjH  >  45  GeV,    m*  >  40  GeV,    m^  >  46  GeV, 

T(Z-+ZiZi)<  12MeV,    £r(Z  ->  2^-)  <  0.25  MeV,  (5) 


vhere  the  sum  does  not  include  (i,  j)  —  (1,  1). 

The  defining  property  of  the  VLSP  scenario  is  that  both  the  sneutrino  and  Zi 
lecay  invisibly.  This  implies  the  following  constraints: 

m«?  <  mza  <  miL  ,  miR 

mp  <  m,^  <  m,-t  .  (6) 

Notice  that  eq.  6  implies  me  <  m^;  eqs.(3)  then  give  an  upper  bound  on  M2: 
|M2|  <  1.15>/Dj.  (7) 


Fhis  in  turn  requires  a  gluino  light  enough  to  be  within  the  striking  range  of  the 
Fevatron.  For  example  with  tan/?  =  2  (10)  the  upperbound  on  the  gluino  mass  is 
243.45  GeV  (  284.29  GeV).  For  each  value  of  the  gluino  mass  there  is  an  upper 
bound  on  the  lighter  chargino  mass  (see,  for  example,  Datta,  Guchait  and  Ray- 
:haudhuri  in  ref  [6]).  The  above  bounds  on  the  gluino  mass  implies  m^  <  95  GeV 
(approximately).  Then  eq.  6  implies  that  mo  cannot-be  too  large,  either,  hence 
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sleptons  will  also  be  light.  The  lightest  charged  sparticle  mass  is  then  expected  to 
be  around  MZ-  The  masses  are,  therefore,  in  the  region  of  interest  for  LEP  200. 

Figure  1  shows  an  example  of  a  allowed  region.  Here  we  fixed  tan/?=  10,  and 
plotted  the  allowed  region  in  the  (/*,  A/2)  plane  for  various  values  of  mo.  The  dotted 
curve  delineates  the  region  excluded  by  sparticle  searches  at  LEP-I.  The  allowed 
region  for  fixed  mo  is  not  very  large,  although  the  fraction  of  the  plane  with  a  VLSP 
scenario  for  some  value  of  mo  is  substantial,  given  only  that  MI  satisfies  eq  7. 

Almost  any  value  of  p  can  accomodate  VLSPs  if  A/2  and  m0  are  chosen  properly. 
In  fig.2  we  have  plotted  the  allowed  region  in  the  (m0,  MI)  plane  for  tan/?=  2  (solid 
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Figure  2. 

curve)  and  10  (dashed  curve),  after  scanning  over  all  /i;  in  other  words,  the  curves 
enclose  the  region  where  at,  least  one  value  of  p.  can  be  found  so  that  a  valid  VLSP 
scenario  emerges. 

We  now  briefly  discuss  W\  pair  production  at  LEP  200  energies.  Our  signal  is 
a  pair  of  opposite  sign  stable  dileptons  (  e  or  //  )  +  missing  energy,  arising  from  the 
decay  of  W\  s  into  leptons  and  VLSPs  (sneutrinos). 

Since  the  dilepton  final  State  in  this  case  arises  from  the  decay  of  spin  1/2 
charginos,  the  distributions  of  the  final  state  leptons  depend  on  the  polarisations 


<*ji   tii<^   (-110,1  g,uivjD.      i  iicoc   puiaiiodbiuus  cau    uc  ^vju veuieutiy    ua.ii.cu   lutu   cvi^uuui    uy 

calculating  the  dilepton  cross-section  and  the  related  distributions  using  the  helicity 
projection  technique  [7]. The  calculations  are  tedious  but  straight  forward  and  the 
results  will  be  presented  elsewhere  [8]. 

In  this  scenario  with  a  tightly  constrained  mass  spectrum  the  lighter  chargino 
is  the  lightest  charged  sparticle  in  certain  region  of  the  allowed  parameter  space 
discussed  earlier.  These  regions  can  be  most  conveniently  probed  by  restricting 
the  beam  energy  below  the  threshold  for  the  production  of  other  sparticles  .  The 
above  dilepton  signal  consisting  of  any  combination  of  e  and  ft,  after  removing  the 
7  ~  Ti  T  —  T  and  WW  backgrounds  by  appropriate  kinematical  cuts  ,  then  provides 
an  unambiguous  signal  of  chargino  pair  production.  For  beam  energies  above  the 
slepton  threshold,  only  e  -p  pairs  provide  a  clean  signature  of  chargino  production. 
Such  pairs  of  course  cannot  arise  from  slepton  pairs  whose  decays  conserve  flavour. 
In  the  following  we  shall  present  some  illustrative  examples  of  chargino  signals  in 
the  VLSP  model. 

For  m0  =67.5  GeV  ,M2=  53  GeV  ,tan(3  =  2.0  and  p  =  -150  GeV,  the  sparticle 
spectrum  of  our  interest  is  given  by:  ra~  =  65.35  GeV,  m~  — •  79.58  GeV,  rn~  == 

"  '  R  *  Li 

89.98  GeV  and  m-~-  =  72.74  GeV.  In  this  scenario  the  dilepton  cross-section  arising 
out  of  direct  chargino  decays  at  the  c.m.  energy  \fs  —  160  GeV  is  O.lSpfc"1  where 
the  kinematical  cuts  used  are  as  above  (  the  cut  against  the  W-W  background  of 
course  is  irrelevant  in  this  case).  Thus  for  an  integrated  luminosity  of  500p6~1  we 
expect  about  75  events.  Here  since  2  m~  i  A/S,  the  dilepton  final  state  consisting 
of  any  combination  of  e  and  p  provides  the  signal. 

Since  the  charginos  can  also  decay  into  r's,  stable  leptons  from  the  decays 
T  — »•  e(  or  n  }v  v  can  enhance  the  signal.  The  cross-section  for  this  process 
, however,  turns  out  to  be  rather  small  due  to  the  small  leptonic  branching  ratio  of 
the  r  [8]. 

The  end  points  of  the  lepton  energy  spectrum  from  direct  chargino  decays  are 
particularly  interesting  since  they  determine  in  principle  both  m~  and  m~ .  It 
turns  out  that  the  number  of  stable  leptons  arising  from  r  decays  is  not  large 
enough  to  obscure  completely  the  characteristics  of  the  above  energy  spectrum. 

We  note  in  passing  that  in  the  above  example  the  mass  difference  between  the 
chargino  and  the  sneutrino  is  rather  small.  Yet  we  get  a  viable  signal.  This  is  to  be 
contrasted  with  chargino  search  in  this  channel  at  TEVATRON  where  the  signal 
can  be  completely  washed  out  if  the  above  mass  difference  is  small  [9]. 

The  other  case  we  have  considered  in  some  detail  consists  of  the  following  pa- 
rameters: m0  =40  GeV  ,M2=  70  GeV  ,tan(3  =  2.0  and  /*  =  -75  GeV.  This  yields 
the  sparticle  spectrum  :  m~  =  53.8  GeV,  m,~  =  61.9  GeV,  m~  =  82.0  GeV  and 
m—  =  79.9  GeV.  This  case  is  interesting  for  two  reasons. Firstly,  the  beam  energy 
has  to  be  above  the  WW  threshold  and  the  cuts  against  WW  background  are  called 
for.  Secondly  ,  since  mp-  <  m~  only  e  -  ^  final  states  should  be  considered.  A 
calculation  using  the  standard  cuts  yields  a  dilepton  cross-section  of  O.l3pb~l  from 
direct  decays  at  c.m.  energy  ^/s  =  180  GeV  .Thus  for  an  integrated  luminosity  of 
500p6-1  we  expect  about  65  events. 

We  have  also  checked  the  cross  -  section  for  several  representative  points  of  the 
parameter  space  for  tan/?  =  10.  The  number  of  events  for  -^/s  =  150  —  180  GeV  is 
between  20  -  150  for  an  integrated  luminosity  of  500  pb~l. 
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The  detection  of  a  fundamental  charged  scalar  particle  would  certainly  lead 
beyond  the  realm  of  the  Standard  Model  (SM).  These  particles  can  arise  either 
in  the  context  of  supersymmetric  models,  as  superpartners  of  quarks  and  leptons 
[2],  or  in  extended  Higgs  sectors  (with  or  without  supersymmetry).  In  general, 
the  different  charged  scalars  will  have  different  interactions  at  tree  level.  Hence  a 
model  independent  production  mechanism  is  welcome.  Such  a  model  independent 
interaction  is  clearly  given  by  the  scalar  QED  part  of  the  underlying  theory.  For 
example  the  77  fusion  processes: 


77   -*#+#-,    7+r,    ...  (1) 

are  uniquely  calculable  for  given  mass  of  the  produced  particles.  At  pp  colliders  we 
also  have,  however,  the  possibility  of  the  qq  annihilation  Drell-Yan  processes 

qq->  H+H~,    l+r    ...  (2) 

There  has  been  a  claim  in  the  literature  that  the  77  fusion  exceeds  the  Drell- 
Yan  (DY)  cross  sections  at  pp  by  orders  of  magnitude  [3].  This  would  be  an 
interesting  possibility  of  producing  charged  heavy  scalars  at  hadronic  colliders  or 
for  that  matter  any  charged  particle  which  does  not  have  strong  interactions. 

Apart  from  the  charged  scalars  mentioned  above  there  exist  various  candidates 
for  charged  fermions.  These  fermions  can  be  either  fourth  generation  leptons, 
charginos  or  exotic  leptons  in  extended  gauge  theories  like  E&  [4].  Current  lim- 
its on  the  masses  of  all  exotic  charged  particles  which  couple  to  the  Z  with  full 
strength  are  ~  Mz/2.  In  the  case  of  H^  there  exist  additional  constraints  (clearly 
model  dependent)  from  the  experimental  studies  of  the  b  —  *•  57  decay.  The  calcu- 
lation for  77  —  »•  L+L~  at  pp  colliders  has  been  done  recently  [5].  The  result  in  [5] 
is  that  the  77  cross  section  is  comparable  to  the  corresponding  Drell-Yan  process 
at  high  energies,  e.g.  at  ^/s  =  40  TeV  for  m^  ~  100  GeV.  At  LHC  energies  the  77 
cross  section  in  the  same  mass  range  was  found  to  be  [5]  one  order  of  magnitude 
smaller  than  the  DY  cross  section. 
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bution  [6].  In  what  follows  we  outline  briefly  the  basic  tools  and  approximations  in 
the  calculation. 

In  order  to  calculate  the  pp  cross  section  we  have  used  the  Weizsacker-  Williams 
approximation  [7]  for  the  inelastic  case  (ipX  vertex)  and  a  modified  version  of  this 
approximation  [8,  9]  for  the  elastic  case  (ypp  vertex).  In  the  latter  case  the  proton 
remains  intact.  The  inelastic  (inel.)  total  pp  cross  section  for  H+H~  as  well  as 
production  reads 

**i  I'          <tea  /'  **i  ^  dz>  cjej, 

4ma/s 


.    -/« 

(3) 


ivhere  m  is  the  mass  of  either  H^  or  L*,  eu  =  2/3,  64  —  —1/3  and  a^  is  the  pro- 
duction subprocess  cross  section  with  the  center  of  mass  energy  \/l  = 


Fhe  structure  functions  have  the  usual  meaning:  fq/p  is  the  quark  density  inside 
;he  proton,  /7/g  is  the  photon  spectrum  inside  a  quark.  We  use  the  the  MRSD_ 
Darameterization  for  the  partonic  densities  inside  the  proton  [10].  The  scale  Q2  has 
Deen  chosen  throughout  the  paper  to  be  s/4.  We  use 

ln(Q?/Q5)  (4) 


Ihere  is  a  certain  ambiguity  about  the  choice  of  the  scales  Q?  in  the  argument 
}f  the  log  in  eq.  (4).  We  choose  Q\  to  be  the  maximum  value  of  the  momentum 
transfer  given  by  s/4  -  m2  and  the  choice  of  Q\  -  1  GeV2  is  made  such  that  the 
ahotons  are  sufficiently  off-shell  for  the  Quark-Parton-Model  to  be  applicable. 
The  semi-elastic  (semi-el.)  cross  section  for  pp  -*  H+H~(L+L~)pX  is  given  by 

<""'-"•(*)  =  2  /         dx,  I  dz,  I  dz,- 


=  2  /         dx,  I  dz,  I 

./4maa  J^m^»xl  Jsm^ 


'pp  W  —  ^  /  U£l  I  uzl   I  uzf 

Um*/, 


The  subprocess  energy  now  is  given  by  \/£  =  *Jsx  \Z\Z-L.  The  elastic  photon  spec- 
trum f*ljp(z]  has  been  obtained  in  the  form  of  an  integral  in  [8].  However,  we  use 
an  approximate  analytic  expression  given  in  [9]  which  is  known  to  reproduce  exact 
results  to  about  10%.  The  form  we  use  is  given  by 


(6) 

*HA  L  U  -it          4  SI-          O^1"J 

where 

.      ,      0.71(GeV)2 
A  =  l  +  —QT. (?) 

**min 

with 

Q2          __       o*«2  _L         \f0  _i_  «,2^/'o        ^»  j    ^,2\ 
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At  high  energies  Qj^m  ^s  given  to  a  very  good  approximation  by  mpZ2/(l  —  z).  Since 
the  relevant  values  of  the  scaled  photon  energy  Z{  can  in  general  take  smaller  values 
in  the  elastic  case  as  compared  to  the  inelastic  case,  eqs.  (8), (7)  and  (6)  imply  that 
even  in  the  elastic  case  there  is  a  logarithmic  enhancement  of  the  photon  densities. 
Finally  the  pure  elastic  contribution  wherein  both  the  photons  remain  intact 
and  hence  can  in  principle  give  rise  to  clean  events,  can  be  written  as 

(9) 

Defining  fa,  H  —  (1  —  4m|  u/s)1^  the  77  subcross  sections  take  the  simple 
form 

o»^,2  i  \xi  \     r     ^     i  —  #4     i  +  0i 

z  —  p H  —       "„ —  in         „ 


(10) 
and  for  lepton  production 

!     (Ml  \  .      Ps  _  fc         1  _L  Rr 

-(2 -/?1)|  (ii) 


Note  that  we  have  used  orem  =  1/137  in  (4)  and  (6)  and  <xem(M^}  =  1/128  in  the 
subcross  sections  (10)  and  (11). 

For  completeness  we  also  give  here  the  Drell-Yan  qq  annihilation  cross  section  to 
H+H~  including  Z°  exchange,  for  the  case  that  H^  resides  in  an  SU(2)  doublet: 


In  the  above  py,  ,  g^q  are  the  standard  vector  and  axial  vector  coupling  for  the 
quark.  The  results  of  our  calculations  are  presented  in  Fig.  1  for  H+H~  production 
and  in  Fig.  2  for  the  lepton  case.  As  far  as  the  HJfH~  production  in  77  fusion  is 
concerned  we  differ  from  the  results  given  in  [3]  by  roughly  three  orders  of  mag- 
nitude: our  77  cross  section  is  far  below  their  results  and  also  approximately  two 
orders  of  magnitude  smaller  than  the  DY  cross  section.  The  logarithmic  enhance- 
ment of  the  photon  densities  is  simply  not  enough  to  overcome  completely  the  extra 
factor  a^m  in  the  77  process.  Even  if  the  Higgs  is  doubly  charged  (such  a  Higgs 
appears  in  triplet  models  [11])  the  ratio  of  DY  to  77  cross  section  changes  only  by 
a  factor  1/4  as  compared  to  the  singly  charged  Higgs  production.  We  also  find  that 
contributions  from  elastic,  semi-elastic  and  inelastic  processes  to  the  total  77  cross 
section  are  of  the  same  order  of  magnitude.  The  elastic  process  contributes  ~  20% 
of  the  total  77  cross-section  at  smaller  values  of  mjj  going  up  to  30%  at  the  high 
end.  This  can  be  traced  to  the  logarithmic  enhancement  of  the  photon  density  even 
in  the  elastic  case  mentioned  earlier.  Assuming  the  //,,  IR  to  be  degenerate  in  mass 


to4 


10' 


£     10' 


-   10° 

it 

e 

0   10-' 


10" 


10" 


n  -  nV 

Va  -  14  TeV 


x  \\  \      Drell-Yan 

:v> 

•vV 

/     V    *    N\ 

Tlli.UC  NV^  \     77  Total 

/  ^^ 
Inelutic         x;:^^ 

Semi-ela.itic  "•"«  ^ 


Jy. 


100       200       300       400       500       600       700 
Mass  of  Htggs  (G«V) 


Figure  1.  Cross  section  in  fb  for  DY  and  77  production  of  the  charged  Higgs  at  LB 
energies,  as  a  function  of  the  Higgs  mass.  The  dashed,  dash-dotted  and  long-dashed  lin 
show  the  el.,  inel.  and  semi-el,  contributions  (as  defined  in  the  text)  to  the  77  crc 
sections.  The  total  77  cross  section  and  the  DY  contributions  are  shown  by  the  label 
solid  lines. 
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Figure  2.  Cross  section  in  fb  for  DY  and  77  production  of  the  charged  Leptons  at  LHC 
energies,  as  a  function  of  the  Lepton  mass.  The  convention  is  the  same  as  in  Fig.  1 

masses  in  the  range  of  50  —  100  GeV[6j.  In  general  the  L+L~  cross-sections  are 
higher  than  the  corresponding  H+H~  cross-sections  (both  for  77  and  DY)  by 
about  a  factor  of  5-7.  This  can  be  traced  to  the  different  spin  factors  and  the 
different  J3  dependence  of  the  subprocess  cross-section  for  the  fermions  and  scalars. 
The  cross-section  for  the  77  production  of  charginos  will  again  be  the  same  as  that 
of  the  charged  leptons. 

Another  process  that  contributes  to  the  pair  production  of  Higgs  bosons  and 
charged  leptons  is  one  loop  gluon  fusion: 


(13) 

These  contributions  will  only  be  competitive  with  ordinary  DY  production  if  some 
couplings  of  the  produced  particles  grow  with  their  mass.  Accordingly  the  first 
process  will  be  large  [12]  if  mt  >  mj/+  (in  which  case  H+  production  from  t  decays 
will  have  even  larger  rates)  but  is  expected  to  decrease  for  m//-+  >  m<.  Since  the 
coupling  of  chiral  leptons  to  Higgs  bosons  and  longitudinal  Z  bosons  grows  with  the 
lepton  mass,  graphs  containing  the  (1-loop)  ggH°*  and  ggZ°*  couplings  dominate 
the  production  of  both  charged  [13]  and  neutral  [14]  chiral  leptons  of  sufficiently 
large  mass. 

In  summary,  we  have  shown  that  the  cross  section  for  the  pair  production  of 
heavy  charged  scalars  or  fermions  via  77  fusion  amounts  to  at  best  a  few  %  of  the 
corresponding  Drell-Yan  cross  section;  in  many  cases  there  are  additional  produc- 
tion mechanisms  with  even  larger  cross  sections.  Moreover,  at  the  LHC  overlapping 
events  prevent  one  from  isolating  77  events  experimentally  unless  the  machine  is 


larger  than  at  LEP200.  We  do  therefore  not  expect  77  fusion  processes  at  the 
LHC  to  be  competitive  with  more  traditional  mechanisms  for  the  production  of 
new  particles. 

While  writing  this  note  we  have  received  a  preprint  [15]  which  treats  the  same 
subject  of  77  processes  in  pp  colliders  and  gets  similar  results.  However  we  differ 
somewhat  in  the  details  which  is  most  probably  due  to  the  different  treatment  of 
the  photon  luminosity  functions. 

This  report  is  based  on  the  paper  Physical  Review  D  50  (1994) 
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Whereas  all  the  charged  fermions  must  get  their  masses  via  Yukawa  couplings 
with  Higgs  doublets[l]  ,  the  vacuum  expectation  values  (vev)  of  triplets  can  give 
Majorana  masses  to  left-handed  neutrinos.  It  is  thus  natural  to  consider  other 
phenomenological  implications  of  triplet  scalars[2].  However,  the  vev  of  the  neu- 
tral member  of  the  triplet  gives  additional  contributions  to  the  parameter  p  — 
rnw/(rn'z  cos20w)  (where  0w  is  the  Weinberg  angle)  at  the  tree-level,  tending 
to  deviate  its  value  from  unity.  Since  the  present  experimental  value  of  p  is 
(1.0004  ±0.0022  ±0.002)  [3],  any  scenario  with  scalar  triplets  must  be  constrained 
accordingly. 

One  way  to  avoid  this  problem  is  to  postulate  that  the  vev  of  the  neutral  member 
of  the  triplet  is  small  enough  so  that  its  contribution  to  p  is  within  the  experimental 
limits.  The  other  option  is  to  assume  [4],  [5],  that  there  are  in  fact  more  than 
one  triplets,  arranged  in  such  a  manner  that  their  contributions  to  p  cancel  each 
other.  For  this  the  minimun  requirement  is  one  complex  (Y=2)  and  one  real  (Y=0) 
triplet,  in  addition  to  the  Y=l  complex  doublet  of  the  minimal  standard  model. 
Furthermore  the  vev's  of  these  two  triplet  fields  must  be  equal  to  guarantee  p  =  1. 

We  have  examined  some  testable  consequences  of  triplets  in  both  these  situa- 
tions, using  the  fact  that  with  triplets,  there  exists  a  tree-level  coupling  involving  a 
charged  Higgs,  W  and  a  Z.  This  interaction  vertex  was  studied  earlier  in  the  context 
of  LEP-I  [6].  We  find  that  in  LEP-II  and  higher  energy  e+e~  colliders  also,  it  leads 
to  new  types  of  signals. 

In  the  first  of  the  two  cases  the  complex  triplet  A  and  a  doublet  <j>  are  of  the 
form 


A=        A+        ,    </,=      *  (1) 


Taking  the  vacuum  expectation  values  to  be  <  <j>°  >=  v  and  <  A°  >=  w  the 
p  parameter  in  this  model  is  p  =  (1  +  ^r)/(l  +  ^75-)-  The  current  experimental 
constraint  translates  (at  99%  confidence  level)  to 

-  <  0.066  (2) 


to  DC  (Sffi  =  sin  OH'  etc.; 
V5w 

ca>  = 


It  is  worth  noting  here  that  in  this  case  the  H  +  couples  to  fermions  through  the 
doublet  component. 

The  above  scenario  causes  hierarchy  problems  because  of  the  large  splitting  of 
the  two  vacuum  expectation  values.    A  solution  is  to  add  a  real  triplet  field  x 


X=        X°  (4) 

\X~  J 

and  impose  on  the  Higgs  potential  a  SU($)L  <8>  SU(1}R  symmetry  which  at  tree 
level  forces  the  two  vacuum  expectation  values,  <  A°  >  and  <  x°  >  *°  be  equal. 
As  a  result  the  p  parameter  is  one  at  tree  level. 

Upon  diagonalisation  of  the  mass  matrix  of  the  scalar  sector,  in  general  one 
finds,  after  duly  absorbing  the  Goldstone  bosons,  a  5-plet,  //5++'+1°'~'  ,  a  3- 
plet,  #3+>0>~  and  two  singlets,  HI°  and  //i'°,  as  the  physical  states.  The  different 
multiplets  charaterise  their  respective  transformation  properties  under  the  custodial 
SU(2).  The  members  of  the  //5-plet  in  this  case  do  not  have  any  overlap  with  the 
doublet  (/),  and  as  such  they  cannot  interact  with  fermion-antifermion  pairs. 

Another  constraint  on  the  triplet  scenario  arises  from  the  lepton-lepton  cou- 
plings of  the  complex  Y=2  triplet.  The  mass  thus  acquired  by  the  electron  neutrino 
will  be  given  by 


with 


,       *H  /KN 

=  hee—  —  —  (5) 

2 


.  . 

*H  =     ,  (6) 

Vt>2  4- 


where  haj,  is  the  AL  =  2  coupling  strenth  of  the  first  generation.  The  experimental 
constraints  from  neutrinoless  double  beta-decay  imply  that  MVe  <  leV.  This 
means  that  either  the  doublet-triplet  mixing  angle  or  the  AL  =  2  Yukawa  coupling 
is  restricted  to  very  small  values.  Since  in  this  model  p  =  1  at  tree  level  a  large 
mixing  angle  8  a  is  apriori  not  excluded.  Though  it  still  requires  fine-tuning  to 
maintain  the  equality  of  the  triplet  vev's  at  higher  orders,  the  degree  of  such  fine- 
tuning  is  not  higher  than  that  involved  in  connection  with  the  naturalness  problem 
within  the  standard  model  itself  [7],  [8]. 

As  has  been  mentioned  before,  in  both  the  models  discussed  above  the  H+W~  Z 
coupling  exists  at  the  tree  level.  The  lagrangians  are 
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lower  energies.  Here  we  concentrate  upon  the  final  states  consisting  of  the  leptomc 
decay  products  of  the  W,  i.e.  on  e+e~  — »•  H±lvi.  In  such  cases,  the  tve  final  state 
also  receives  contributions  from  a  t-channel  diagram  with  the  H^  emitted  from 
the  propagator.  For  large  values  of  the  centre-of-mass  energy,  this  latter  diagram 
is  dominant.  Fig.  1  shows  the  cross-sections  for  H*  /^-production  plotted  against 
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Figure  1.  cr(e+e 


+~ 


(pb)  as  a  function  of  MH+  (for  SH(SH'CH')  —  1)  f°r  different 


values  of  e+e~  center  of  mass  energies:  the  thin  solid,  dotted,  dashed  and  thick  solid  line 
correspond  to  v/a  =  200  GeV,500  GeV,  1  TeV  and  2  TeV  respectively.  Cross-sections  for 
both  charges  of  the  Higgs  and  mnonic  as  well  as  electronic  channels  in  the  final  state  are 
added. 

Mff±  for  different  values  of^/s.  The  contributions  due  to  the  electronic  and  muonic 
final  states  are  added.  The  curves  correspond  to  SH  =  I  and  CH>SH>  =  1  for  the  two 
cases  of  H^  ,  Hf±  respectively.  The  cross-sections  for  various  values  of  the  mixing 
angles  can  be  read  off  by  multiplying  by  the  appropriate  factor.  It  is  clear  from  the 
graphs  that  in  the  LEP-II  case  (thin  solid  line),  assuming  an  integrated  luminosity 
of  1039cm~2  per  year  there  will  be  a  few  hundreds  of  events  for  an  =  1  upto  at 
least  MH&  —  110  GeV.  If  now  one  has  only  the  Y  —  2  triplet,  the  restriction  from 
the  p-parameter  allows  a  maximum  s#  of  0.009.  This  leaves  us  with  about  2  events 
per  year.  In  such  a  case,  the  only  reasonable  chance  of  observing  this  process  exists 
in  a  higher-energy  e+e~  machine.  As  can  be  seen  from  the  plot  (dotted  line),  such 
a  machine  with  a  luminosity  of  50/6"1  /year  [9]  can  lead  to  few  tens  of  events  upto 
MH>  ^  300  GeV  even  with  a  value  of  SH  •  well  within  the  limits  imposed  by  />. 
On  the  other  hand,  since  this  restriction  gets  relaxed  if  one  presupposes  a  complex 
and  a  real  triplet,  at  LEP-II  itself  one  can  investigate  a  large  area  of  the  p  —  SH 
parameter  space  in  the  latter  scenario. 

The  signals  of  the  triplet  scalars  thus  produced  also  depend  on  their  decay 
channels.    For  the  (complex-l-real)  case,  as  has  been  mentioned,  H$   do  not  have 
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;ree-level  interactions  with  fermions.  Possibilities  of  observing  them  through  loop- 
nduced  decays  into  fermion  pairs  have  already  been  studied  [7].  However,  there  are 
ilso  the  tree-level  decays  into  four-fermions  mediated  by  (real  or  virtual)  W  and  Z 
coupling  with  #5.  We  have  explicitly  calculated  such  decay  widths  using  methods 
iescribed  in  ref.  [10]  modified  appropriately.  The  decay  widths  into  this  channel 
'or  leptonic  final  states  (with  SH  —  1)  are  as  a  function  of  M//5  by  the  dashed  line  in 
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Figure  2.  The  different  possible  two  body  decay  widths  (//+  — »•  cs)at  loop  level  (dotted 
ine),  tree  level  (solid  line)  and  the  tree  level  leptonic  four  body  decay  width  T(H^  — *• 
W* Z*  — *•  4  leptons)  (dashed  line)  for  the  charged  Higgs.  Again  the  mixing  angles  are  put 
jqual  to  1  as  in  fig.  1.  The  Y(Hf  -+  cs)  (loop)  has  been  taken  from  ref.  [7].  The  four 
body  decay  width  is  summed  over  electrons  and  p's  in  the  final  state.  For  four  quark  final 
state  the  numbers  are  obtained  by  multiplying  the  dashed  figure  by  ~  35 

ig.  2.  Here  we  have  summed  over  the  muons  and  the  electrons  in  the  final  state.  For 
Diirposes  of  comparison  we  also  show  by  the  dotted  line  one  of  the  representative 
•esults  for  the  loop  induced  partial  width  T(H}  -*  cs)  taken  from  ref.  [7]  (also 
Jrawn  there  for  SH  —  1).  The  figure  clearly  shows  that  the  partial  decay  width 
?or  our  four-fermion  modes  is  of  similar  order  for  MHS  ~  50GeV,  and  completely 
lominate  for  higher  values  of  M/f5 .  Consequently,  the  branching  ratio  for  decays 
into  a  pair  of  weak  gauge-bosons  (WZ)  remains  close  to  100  per  cent  over  most  of 
Lhe  parameter  space.  The  4-lepton  (/i///'+/'~)  channel  then  has  a  healthy  branching 
ratio  of  ~  1 .4%  The  corresponding  signal  is  practically  free  from  standard  model 
backgrounds,  and  thus  it  should  be  considered  as  the  principal  technique  in  looking 
for  triplet  Higgs  bosons  of  this  type,  produced  in  e+e~  collider  experiments.  In 
addition,  with  appropriate  invariant  mass  cuts,  the  signal  with  four  jets  in  the  final 
states  can  be  used  increasing  the  useful  branching  ratio  to  ~  50%.  This  would 
increase  the  discovery  range  of  H^  even  further  since  the  width  into  four  quarks 
final  state  will  be  a  factor  ~  35  higher  than  the  four  lepton  channel.  Hence,  the 
discovery  range  at  LEP-200  (NLC)  gets  extended  to  125  (425)  GeV,  for  SH  =  1. 
For  the  case  of  model  1  (i.e.  H  +)  the  two-fermion  decay-modes  can  occur  at 


tree  level.  As  can  be  seen  from  T(H  +  — +  cs)  given  by  the  solid  line  in  fig.  2  (and 
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Figure  3.  The  different  possible  tree  level  decay  widths  (H  +  — - >  cs)  (dotted  line), 
W* Z*  — >•  4  quarks)  (dashed  line),  F(#  "*"  — *•  W* Z)  (thin  solid  line)  and 
tb)  (thick  solid  line)  for  the  charged  Higgs.  Again  the  mixing  angles  are  put 
equal  to  1  as  in  fig.  1.  m&  has  been  neglected. 

dotted  line  in  fig.  3)  this  decay  mode  will  dominate  over  the  four-lepton  decay 
mode  upto  MH<+  =  150  GeV.  However,  the  width  for  the  4-quark  decay  mode, 
shown  by  dashed  line  in  fig.  3,  will  be  comparable  to  the  two-quark  decay  mode  for 
MH>+  >  120  GeV. 

Another  way  of  detecting  a  triplet  scenario  in  e~*~e~  machines  is  to  look  for 
production  of  of  a  doubly  charged  scalar  via  processes  such  as  e+e~  — >•  l~  l~  H++ . 
As  mentioned  before,  these  involve  the  AL  =  2  interactions  which  are  not  very 
tightly  constrained  when  the  doublet-triplet  mixing  angle  is  small.  A  detailed 
investigation  is  currently  under  way. 
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Measurements  ofF^  in  7*  7  scattering  at  the  e+e~  colliders  PEP,  PETR  A,  TRIS- 
TAN and  LEP  [1]  have  by  now  yielded  a  lot  of  information  on  the  parton  content 
of  the  photon  over  a  wide  range  of  a;  and  Q2.  However,  these  measurements  give 
direct  information  only  about  the  quark  content  of  the  photon.  The  gluon  density 
9~*(x,Q'1)  is  poorly  determined  as  it  affects  Fj  only  through  the  QCD  evolution 
equations.  At  the  current  values  of  Q2  the  charm  quark  contribution  to  F^  is 
approximated  by  the  quark-parton-model  (QPM)  matrix  elements  for  the  process 
TT*  -+  cc  and  j*g^  -*  cc.  Through  the  latter  process,  the  effective  charm  content 
of  the  photon  becomes  sensitive  to  07(z,Q2).  At  larger  values  of  Q2,  cT(x,Q2) 
computed  using  the  massive  Altarelli-Parisi  (AP)  evoultion  equations,  is  consider- 
ably different  from  the  pure  QPM  predictions  [2].  A  study  of  the  charm  content 
of  the  photon  might  also  help  shed  some  light  on  the  correct  treatment  of  a  heavy 
parton  inside  a  target.  The  various  different  available  parametrisations  of  q"1(x,  Q2) 
and  gt(x,  Q2)  [3]  treat  the  charm  density  c7(r,(p2)  with  varying  amount  of  rigour 
and  care.  It  is  therefore  interesting  to  take  a  phenomenological  approach  and  think 
of  measurements  which  will  probe  ^(x^Q2)  directly  and  hence  perhaps  also  yield 
information  about  <77(£,Q2). 

One  possibility  is  to  study  production  of  single  charm  in  ep  collisions  via  the 
excitation  processes  (the  subprocesses  being  c7  +  qP  _+  c  -f-  q  an^  cT  +  9  ~  *  c  +  9> 
here  we  neglect  the  contribution  coming  from  charm  in  the  proton)  shown  in  fig.  1. 
This  will  give  rise  to  a  single  high-pT  charm  particle  whose  transverse  momentum 
is  balanced  by  a  light  q/g  jet.  Of  course  the  use  of  structure  functions  to  compute 
this  process  is  meaningful  only  for  large  values  of  the  pr  °f  tne  cnarm  quark. 
Admittedly  for  lower  values  of  pr  the  more  reliable  computation  will  be  that  of  the 
2  —  »•  3  subprocesses  (some  of  which  are  shown  in  fig.  2),  but  at  larger  values  of 
pr  the  structure  function  language  sums  up  the  large  log(/?7-/m?)  terms  and  hence 
is  more  accurate.  Another  contribution  to  the  inclusive  charm  signal  comes  from 
cc  pair  production,  via  the  'direct'  jg  fusion  subprocess  as  well  as  the  'resolved' 
processes,  where  the  balancing  high-pj.  jet  is  the  c  (c)  quark  jet. 

The  excitation  contribution  of  diagrams  in  fig.  1  is  given,  in  the  Weizsacker- 
Williams  (WW)  approximation,  by 


Figure  1.  Excitation  processes  for  single  charm  production  in  ep  collisions 


Figure  2.  Some  of  the  2  — »•  3  diagrams  giving  rise  to  single  charm  production 
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Figure  3.  CT(PT  >  J>r,mm)  (pb)  as  a  function  of  pT,m\n  for  the  excitation  as  well  as  cc 
production  for  different  combinations  of  proton  and  photon  structure  functions. 

the  result  is  easily  understood.  In  principle,  the  other  parametrisations  of  c7(x,  Q2) 
do  also  look  quite  different,  both  in  the  small  and  large  £7  region,  but  the  effective 
c— quark  content  of  the  electron,  which  involves  the  convolution  of  this  with  the 
WW  function  is  very  similar  in  the  end;  this  is  reflected  in  the  similarity  of  the 
predictions  using  the  LAC  and  WHIT  parametrizations  for  c*(x,Q2}.  This  can  be 
looked  upon  as  a  positive  point  in  that  the  size  of  the  excitation  contribution  to 
the  inclusive  charm  signal,  at  large  pr,  can  be  estimated  quite  reliably.  However, 
it  is  also  clear  that  one  needs  to  devise  kinematic  cuts  to  separate  the  excitation 
contribution  to  the  inclusive  charm  signal  from  that  due  to  cc  pair  production. 
(This  is  also  necessary  if  direct  cc  pair  production  is  to  be  used  to  study  the  gluon 
density  in  the  proton.) 

To  this  end,  we  next  study  the  kinematic  distributions  of  the  decay  muon  which 
is  used  to  tag  the  charm  in  the  final  state  and  also  that  of  the  balancing  ('away-side') 
jet.  In  this  calculation  we  include  fragmentation  of  the  c— quark  into  a  charmed 
meson  a  la  Peterson  fragmentation  function  [9]  with  the  parameter  e  as  given  in 
ref.[10],  and  use  the  value  0.1  for  the  semileptonic  branching  ratio  of  the  charmed 
hadrons.  Since,  for  our  pr  cut,  cc  production  is  dominated  by  the  'direct'  process, 
the  real  kinematical  difference  between  the  excitation  and  cc  contributions  to  the 
charm  signal  comes  from  the  fact  that  in  the  excitation  process  only  a  fraction  of 
the  7  energy  is  available  for  the  subprocess,  whereas  in  the  'direct'  process  all  of 
it  goes  into  the  subprocess.  As  a  result,  the  direct  process  on  the  whole  receives 
contributions  from  smaller  xp  values  than  the  excitation  process  does.  Hence  the 
c  jet  in  the  cc  case  will  have  much  more  negative  rapidity  than  the  'light'  (q/g) 
jet  in  the  excitation  process  (the  proton  direction  is  taken  as  positive  z  axis);  this 
is  very  similar  to  the  corresponding  situation  with  the  photoproduction  of  jets  [3]. 
On  the  other  hand,  the  rapidity  distributions  of  the  charm  quarks  produced  in 
both  the  excitation  and  pair  production  processes  are  very  similar  and  hence  those 
of  the  decay  muons  also.  The  kinematic  distribution  in  p%,  and  y^  therefore  are 
very  similar  for  both  contributions.  Fig.  4  shows  the  rapidity  distribution  of  the 
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Figure  4.  Rapidity  distribution  for  the  'away-side'  jet  for  the  excitation  and  pair  pro- 
duction mechanisms. 

jet  balancing  the  large  pr  charm,  with  the  WHITS  parametrisations  of  q*(x,Q2) 
and  </7(z,Q2)  and  MRSD-  for  the  proton  structure  function.  As  we  can  see  very 
clearly  from  the  figure,  a  cut  on  yjet  <  0.5  can  neatly  separate  the  excitation  and 
the  cc  contribution  from  each  other.  The  rates  presented  in  the  figure  include  the 
semileptonic  branching  ratio  of  the  charm  meson.  It  should  also  be  mentioned  that 
these  distributions  do  have  some  sensitivity  to  c7,  but  only  for  negative  values  of 
yjet  where  the  signal  is  dominated  by  the  cc  contribution.  The  figure  also  tells  us 
that  the  signal  is  healthy  even  after  these  cuts  and  hence  is  measurable.  For  a  clear 
signal  one  will  have  to  make  an  additional  cut  on  PJ.  as  well  but  that  will  affect 
both  the  excitation  and  the  pair  production  contribution  similarly. 

Recently  both  HI  and  ZEUS  have  reported  measurements  of  inclusive  charm 
production  at  HERA  [11].  ZEUS  reports  D*  production  with  pr(D*}  >  1.7  GeV 
and  |r/(D*)|  <  1.5  where  77  is  the  pseudorapidity  with  P2  <  4  GeV2  and  0.15  <  z  < 
0.86,  whereas  HI  reports  observation  of  events  with  a  hard  muon  with  PJ.  >  1.5  GeV 
and  30°  <  0(/i)  <  130°.  The  ZEUS  analysis  then  uses  this  measured  cross-section 
to  estimate  the  'total'  cc  cross-section  by  extrapolating  it  outside  the  measured 
region  and  then  compare  the  value  so  obtained  with  the  QCD  NLO  calculations. 
We  attempted  instead  to  reproduce  the  cross-sections  measured  by  ZEUS  and  HI 
by  using  our  LO  QCD  calculations.  Since  the  pr  cut  and  mc  are  comparable 
it  is  not  clear  whether  factorisation  of  the  production  and  fragmentation  of  the 
charm  quark  is  such  a  good  approximation.  On  the  other  hand  if  we  include  the 
fragmentation  of  the  c— quark  in  the  final  state  then  we  must  include  the  excitation 
contribution  which  corresponds  to  the  fragmentation  of  the  initial  state  photon  into 
charm.  We  therefore  run  our  Monte  Carlo  with  two  different  options:  In  one  case 
(A)  we  fragment  the  charm  using  the  Peterson  fragmentation  function  and  include 
the  excitation  contributions  whereas  in  the  other  case  (B)  we  do  not  include  the 
fragmentation  of  the  final  state  c  quark  and  drop  the  excitation  contributions  as 


comparing  with  ZEUS  results  we  include  a  factor  of  0.26  which  is  the  probability 
of  a  charm  quark  to  fragment  into  a  charged  D*  meson. 


Table  1.  The  D*  and  \i  cross-sections  measured  at  HERA  by  ZEUS  and  Hi,  compared 
with  the  LO  predictions  discussed  in  the  text. 


ZEUS  (£*) 

HI  GO 

Data:  32  ±  7+?  nb 

Data:  2.03  ±  0.43  ±  0.7  nb 

A:  Frag  and  c1  '.icluded 

A:  Frag  and  c1  included 

p  str.fn. 

7  str.  fn. 

<r(nb) 

p  str.fn. 

7  str.  fn. 

<r(nb) 

MRSD- 
MRSD- 
MRSDO 
MRSD- 

LAC1 
WHIT5 
WHITS 
DG 

27.4 
26.0 
21.9 
64 

MRSD- 
MRSD- 
MRSDO 
MRSD- 

LAC1 
WHITS 
WHITS 
DG 

1.5 
1.5 

1.48 
2.9 

B:  No  frag  and  c1  not  included 

B:  No  frag  and  c7  not  included 

p  str.fn. 

7  str.  fn. 

(r  (nb) 

p  str.fn. 

7  str.  fn. 

<r(nb) 

MRSD- 
MRSDO 
MRSD- 
MRSD- 

LAC1 
WHITS 
WHITS 
DG 

26.8 
15.4 
37.0 
33.4 

MRSD- 
MRSDO 
MRSD- 
MRSD- 

LAC1 
WHITS 
WHITS 
DG 

1.8 
1.5 
2.4 
2.3 

Table  1  gives  a  summary  of  our  calculations  along  with  the  results  reported 
by  the  two  experimental  groups.  We  find  that  for  the  ZEUS  data,  in  case  A  the 
results  become  less  sensitive  to  the  low- a;  behaviour  of  the  proton  structure  function 
and  the  excitation  contribution  actually  dominates.  In  this  case  both  the  MRSD- 
and  MRSDO  for  the  partons  in  the  proton  and  WHITS  or  LAC1  partons  in  the 
photon  reproduce  the  data  whereas  DG  predict  too  big  a  cross-section.  On  the 
other  hand,  in  case  B,  the  results  are  more  sensitive  to  the  low-x  behaviour  of 
the  parton  densities  and  we  find  that  we  can  reproduce  the  cross-section  only 
for  a  steeply  rising  gluon  density.  In  this  case  all  the  photonic  parton  densities 
combined  with  MRSD-  for  the  proton  are  acceptable  whereas  the  MRSDO  gives 
answers  smaller  by  a  factor  2  for  all  reasonable  choices  of  the  momentum  scale  as 
well  as  the  photonic  parton  densities.  For  the  HI  sample,  the  py  cut  means  that 
the  produced  charm  quark  has  much  higher  pr  than  for  ZEUS.  Again,  inclusion 
of  fragmentation  reduces  the  sensitivity  to  the  low-ar  behaviour  of  the  gluon  in  the 
proton  and  results  for  various  combinations  of  partons  in  the  photon  and  proton  are 
almost  the  same.  As  before,  the  DG  parametrisation  predicts  a  large  cross-section, 
2.9  nb,  but  it  is  not  inconsistent  with  the  data. 

In  conclusion  we  have  studied  the  contribution  to  the  inclusive  charm  signal 
from  the  excitation  of  charm  in  the  photon.  We  find  the  rates  to  be  comparable 
to  the  contribution  coming  from  cc  pair  production.  Due  to  the  convolution  with 
/y|e  the  sensitivity  of  crexe  of  eq.  1  to  the  region  of  large  z-y,  where  the  various 
parametrisations  for  c7(x,Q2)  differ  most,  is  reduced.  This  means  that  this  're- 
solved' background  to  the  large  pr,  inclusive  charm  signal  coming  from  the  'direct' 
process  can  be  predicted  quite  reliably.  Making  a  cut  on  the  associated  jet-rapidity 
allows  a  separation  of  these  two  contributions.  Note  that  charm  excitation  events 
should  contain  a  second  'spectator5  charm  (anti-)quark  in  the  photon  remnant, 


wmcn  nas  a  large,  negative  rapidity  ^ciose  to  me  electron  oeam  direction;,  in 
contrast,  cc  pair  production  events  should  contain  a  second  high— pr  charm  (anti- 
)quark.  In  both  cases  the  presence  of  a  second  charmed  particle  should  be  visible 
in  some  fraction  of  the  events;  this  could  be  used  as  a  cross-check  of  the  relative 
sizes  of  the  two  contributions  to  the  inclusive  charm  signal.  Even  after  all  the  cuts 
and  folding  of  the  cross-sections  with  various  branching  fractions,  the  rates  for 
inclusive  charm  production  are  large  and  easily  measurable.  Further,  a  LO  compu- 
tation including  the  excitation  contributions  gives  results  comparable  to  the  recent 
measurements  of  the  inclusive  charm  signal  by  ZEUS  and  HI.  On  the  theoretical 
side,  it  would  be  interesting  to  compare  our  results  with  explicit  2  — *  3  subprocess 
calulations  and  see  at  what  values  of  pr  do  they  match. 
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Abstract.  Working  group  II  at  WHEPP3  concentrated  on  issues  related  to  the  super- 
symmetric  standard  model  as  well  as  SUSY  GUTS  and  neutrino  properties.  The  projects 
identified  by  various  working  groups  as  well  as  progress  made  in  them  since  WHEPP3  are 
briefly  reviewed. 


Participants: 

K.S.  Babu,  B.  Brahmachari,  C.  Burgess,  G.  Datta,  S.  Goswami,  A.  Joshipura, 
A.  Kundu,  Mohan  Narayan,  S.  Nayak,  M.  V.  N.  Murthy,  M.  K.  Parida, 
G.  Rajasekaran,  S.  Rindani,  Probir  Roy  and  J.W.F.  Valle. 

Working  group  II  (WGII)  identified  definite  topics  each  of  which  was  intensively 
discussed  within  the  corresponding  subgroup  during  the  workshop.  Significant 
progress  was  made  in  some  of  them  and  some  of  the  projects  have  been  completed 

in  the  meantime.  The  following  is  the  list  of  the  projects  addressed  by  WGII: 

« 

•  Evolution  of  R  parity  violating  couplings  (B.  Brahmachari  and  P.  Roy) 
«  Beyond  S,  T  and  U  (A.  Kundu  and  P.  Roy) 

•  Neutrino  masses  and  proton  lifetime  in  SUSY  5O(10)  (K.S.  Babu,   U.K. 
Parida  and  G.  Rajasekaran) 

•  Degenerate  neutrinos(./if.5.  Babu,   C.  Burgess,  A.S.  Joshipura,  S.  Rindani, 
J.W.F.  Valie) 

•  Solar  and  atmospheric  neutrino  problems  with  three  generations  (G.  Datta, 
S.  Goswami,  A.  Joshipura,  M.  V.N.  Murthy,  Mohan  Narayan,  G.  Rajasekaran 
and  S.  Rindani  ) 

•  Magnetic  moments  for  heavy  neutrinos  (K.S.  Babu,  S.  N.  Nayak  and  P.  Roy 


• 


Extraction  of  neutrino  magnetic  moment  from  experiments  (M.  V.N.  Murthy, 
G.  Rajasekaran  and  S.  Rindani) 

Evolution  of  couplings  in  SUSY  LR  model  (B.  Brahmacharf) 

393 


Brahmachan  and  Roy  [Ij  studied  the  evolution  ot  the  baryon  number  ana  /t-paruy 
violating  Yukawa  couplings  in  the  supersymmetric  standard  model  and  derived 
bounds  on  them  from  the  requirement  of  perturbative  unitarity.  They  added  the  fol- 
lowing terms  to  the  superpotential  of  the  minimal  supersymmetric  standard  model 
(MSSM): 

£  =  %k(DfDjUi),  (1) 

where  UC,DC  denote  the  anti  quark  superfields  and  i,j,  k  are  generation  indices. 
These  terms  violate  both  R  parity  and  the  baryon  number.  Unlike  the  analogous 
lepton  number  violating  terms,  the  presence  of  the  above  terms  by  themselves  is 
not  significantly  constrained  from  low  energy  considerations.  Interesting  bounds 
on  these  couplings  can  nevertheless  be  obtained  by  requiring  that  all  the  Yukawa 
couplings  Y  remain  less  than  unity  till  the  grand  unification  scale  MU  ~  2  x  1016 
GeV  is  reached.  Assuming  only  A133  and  A2ss  to  be  large,  they  set  up  the  RG 
equations  for  the  relevant  couplings.  The  requirement  of  perturbative  unitarity 
was  shown  to  lead  to  an  upper  bound  in  the  range  0.5-0.6  on  the  baryon  number 
violating  Yukawa  couplings,  the  exact  value  being  dependent  on  the  top  quark  mass 
as  well  as  on  the  ratio  tan/?  of  the  Higgs  vevs.  It  was  also  shown,  that  the  fixed 
point  value  of  the  top  Yukawa  coupling  was  somewhat  reduced  compared  to  that 
in  the  MSSM  because  of  the  presence  of  the  additional  baryon  number  violating 
Yukawa  couplings. 

2.  Beyond  5,  T  and  U 

A.  Kundu  and  P.  Roy  examined  the  g2-expansion  approximation  of  Peskin  and 
Takeuchi  in  the  context  of  1-loop  oblique  electroweak  radiative  corrections.  They 
were  able  to  give  definitions  of  the  oblique  parameters  which  did  not  depend  on 
this  approximation  but  kept  their  symmetry  contents  intact.  In  this  respect  the 
disagreement  with  Burgess,  Makysmik  and  London  were  highlighted.  The  organiz- 
ing principle  behind  the  ?2-expansion  approximation  was  found  —  namely  that  it 
was  needed  in  calculating  the  Z-  and  W-wavefunction  renormalization  constants. 
Measurable  ratios,  where  these  were  eliminated,  could  be  expressed  [2]  in  terms  of 
5,  T  and  U  without  reference  to  this  approximate  pfo^dure.  For  other  obs^rvabtes, 
which  involved  these  renormalization  constants,  the  ^'-expansion  was  generalised 
upto  quadratic  terms.  The  new  oblique  parameters  Vt  W,  X,  Y  could  be  bounded 
[2]  experimentally. 

3.  Neutrino  masses  in  SUSY  50(10) 

K.  S.  Babu,  M.  K.  Parida  and  G.  Rajasekaran  looked  at  the  issue  of  obtaining 
neutrino  masses  in  the  experimentally  interesting  range  in  the  context  of  super- 
symmetric  5O(10)  models.  The  neutrino  masses  needed  for  solving  the  solar  neu- 
trino problem  arise  naturally  in  and  50(10)  if  the  Majorana  masses  of  the  right 
handed  neutrinos  are  in  the  intermediate  range  ~  1010  GeV  [3].  The  generation 
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of  such  masses  through  the  vacuum  expectation  values  of  the  chargeless  scalars  in 
the  126  +  126  representation  in  SUSY  50(10)  models  requires  [4]  some  assump- 
tions of  the  extended  survival  hypothesis.  The  aim  of  the  project  was  to  provide 
an  alternative  mechanism  for  generating  the  right  handed  neutrino  masses  in  the 
intermediate  energy  range. 

The  following  breaking  chain  was  considered: 

50(10)— +  G/— >£SM,  (2) 

where  G/  =  5C7(3)C  x  SU(2)L  x  SE7(2)*  x  U(l)B-L  or  577 (4)e  x  577(2)L  x  517(2)* 
and  was  assumed  to  break  at  a  scale  M/  ~  1014GeV.  Although  the  representation 
126  + 126  was  present  it  did  not  acquire  a  vev.  The  right  handed  neutrino  masses 
were  induced  by  the  presence  of  the  16+16  representation  to  be 

*r1 

(3) 

This  could  be  significantly  lower  than  the  value  MNR  ~  MU  ~  1016  GeV  permitted 
in  a  single  step  breaking. 

4.  Degenerate  neutrinos 

It  has  recently  been  realized  [5,  6]  that  simultaneous  solutions  of  the  solar  and 
atmospheric  neutrino  deficits  as  well  as  of  the  dark  matter  problem  with  a  hot 
component  of  about  30%  require  almost  degenerate  masses  for  the  three  neutrinos. 
Such  a  spectrum  was  shown  to  arise  in  a  natural  manner  in  left  right  symmetric 
models  augmented  with  a  suitable  generation  symmetry  [5,  6].  The  aim  of  the 
working  group  was  to  discuss  issues  related  to  the  construction  of  realistic  grand 
unified  models  following  the  scenario  proposed  in  refs.  [5,  6].  In  particular  one 
should  obtain  (a)  the  common  degenerate  mass  in  the  eV  range  (b)  mass  splittings 
appropriate  for  the  solar  and  the  atmospheric  neutrino  problems  and  (c)  the  right 
mixing  pattern.  The  required  mass  splitting  arise  naturally  [6]  if  the  Dirac  masses 
for  the  neutrinos  coincide  with  the  up  quark  masses  as  in  the  simplest  5O(10).  In 
this  case,  one  obtains 

|A2i|  „ 


This  nicely  reproduces  the  hierarchy  required  to  simultaneously  solve  the  solar 
and  atmospheric  neutrino  problems.  The  problem  to  be  addressed  was  to  obtain 
this  prediction  in  a  complete  model  based  on  S0(10)  preserving  other  successful 
features. 

While  a  complete  model  is  still  lacking,  significant  progress  was  made  by  the 
members  of  the  working  group  [8,  7,  10]  as  well  as  others  [9]  in  the  construction  of 
realistic  models.  In  particular,  Valle  and  loannissyan  constructed  a  model  based 
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proposed  by  Caldwell  and  Mohapatra  [9].  Bamert  and  Burgess  worked  out  a  sce- 
nario which  contained  a  singlet  fermion  in  addition  to  the  three  left  and  right  handed 
neutrinos.  A  horizontal  SU(2)  symmetry  was  introduced  to  obtain  the  degenerate 
spectrum.  The  couplings  involving  the  singlet  fermion  break  the  horizontal  sym- 
metry and  lead  to  a  departure  from  the  degeneracy  in  neutrino  masses.  The  singlet 
fermion  was  moreover  used  in  the  context  of  the  left  right  symmetric  theory  [10]  in 
order  to  understand  the  difference  between  the  quark  and  leptonic  mixing  angles  in 
scenarios  with  almost  degenerate  neutrinos.  The  singlet  also  played  a  crucial  role 
in  generating  the  required  mass  pattern  among  neutrinos  in  this  scenario. 

5.  Solar  and  atmospheric  neutrino  problems  with  three  generations 

The  understanding  of  the  solar  and  atmospheric  neutrino  deficits  in  terms  of 
neutrino  oscillations  seems  to  require  two  vastly  different  values  for  the  (mass)2 
difference  among  neutrinos.  Thus  at  least  two  neutrinos  need  to  be  massive  and 
analysis  of  the  solar  and  atmospheric  neutrino  data  in  terms  of  three  generations 
becomes  interesting.  Such  an  analysis  was  carried  out  earlier  [11,  12]  assuming 
the  MSW  mechanism  to  be  responsible  for  the  solar  neutrino  conversion.  This 
working  group  looked  at  a  complimentary  scenario  in  which  two  of  the  neutrinos 
were  assumed  to  be  almost  degenerate  with  very  small  (mass)2  difference  ~  10~10 
(eV)2  while  the  other  (mass)2  difference  was  assumed  to  be  in  the  range  ~  10" 2  — 
10~3  (eV)2.  Thus  the  vacuum  oscillations  are  responsible  for  both  the  solar  and 
the  atmospheric  neutrino  deficit.  Since  two  of  the  relevant  (mass)2  differences 
show  hierarchy,  the  oscillation  probabilities  involve  only  one  more  mixing  angle 
compared  to  the  case  of  two  generations  [11,  12].  Fixing  this  mixing  angle  (0^)  to 
be  in  the  range  appropriate  for  the  atmospheric  neutrino  problem,  restrictions  on 
other  mixing  angle  (namely  Oefi  )  and  the  (mass)2  difference  AeA,  were  determined 
from  the  data  on  solar  neutrino  deficit. 

6.  Neutrino  magnetic  moment 

Two  different  problems  were  analyzed  in  connection  with  the  neutrino  magnetic 
moment.  One  was  the  issue  of  a  large  magnetic  moment  of  a  very  heavy  neutrino. 
Since  the  magnetic  moment  of  fermion  turns  out  to  be  proportional  to  its  mass  in 
a  number  of  situations,  it  is  interesting  to  ask  if  the  magnetic  moments  of  heavy 
singlet  neutrinos  can  be  large  enough  to  dominate  over  their  point  couplings  to  W 
and  Z  induced  by  mixing  with  the  light  neutrinos.  The  typical  magnetic  moment  of 
a  very  heavy  right  handed  neutrino  N  was  estimated  from  the  one-loop  graph  and 
the  mass-dependence  was  seen  to  come  through  the  factor  mjr,Mjv(M^  + 
where  the  W  couples  to  t  and  N,  so  that  there  was  no  enhancement  for  MN 
Thus  it  was  found  that,  contrary  to  naive  expectation,  the  point  couplings  always 
dominated  over  the  magnetic  moment  couplings. 

The  conventional  procedure  of  extracting  information  on  the  neutrino  magnetic 
moment  coupling  from  the  data  on  v  e  scattering  was  questioned.  In  order  to  ex- 
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tering.  An  analogous  treatment  of  the  e  —  p  scattering  has  been  shown  to  lead  to 
a  drastic  overestimation  of  the  QED  radiative  corrections  [13].  By  the  same  token, 
the  inclusion  of  the  neutrino  magnetic  moment  term  through  the  Pauli  term  must 
lead  to  wrong  results  at  some  energy  scale.  The  main  issue  was  to  determine  the 
relevant  scale  where  the  Pauli  approximation  breaks  down.  The  suggestion  was  to 
do  a  detailed  calculation  of  i/e  scattering  in  specific  model  which  leads  to  large  mag- 
netic moment  and  compare  it  with  the  phenomenological  result  obtained  assuming 
the  Pauli  term  as  is  conventionally  done. 

7.  Evolution  of  couplings  in  SUSY  LR  model 

B.  Brahmachari  studied  the  1-loop  evolution  of  Yukawa  copulings  in  the  minimal 
supersymmetric  left-right  model.  He  found  [14]  a  fixed  point  behaviour  in  the  top 
Yukawa  coupling  that  was  rather  analogous  to  the  one  one  in  the  MSSM.  He  was 
able  to  explicitly  exhibit  the  dependence  of  the  fixed  point  solution  of  Yt(mt)  on 
the  right-symmetry  breaking  scale.  The  predicted  top  mass  value  in  this  scheme 
was  between  168  and  174  GeV.  Brahmachari  was  also  able  to  fix  the  value  of  the 
Majorana  Yukawa  coupling  which  is  otherwise  a  free  parameter. 
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SOURENDU  GUPTA 

Theory  Group,  T.  I.  F.  R.,  Bombay  400  005. 

Participants: 

Rahul  Basu,  Avijit  Ganguly,  Rajiv  Gavai,  Rohini  Godbole,  Sourendu  Gupta, 
N.D.Haridass,  Prakash  Mathews,  Hiranmay  Mishra,  J.  Pasupathy,  J.  C.  Parikh, 
Hiranmay  Mishra,  J.  Pasupathy,  J.  C.  Parikh,  V.  Ravindran,  Helmut  Satz  and 

Vikram  Soni. 

This  working  group  discussed  several  aspects  of  heavy  ion  physics.  Several 
projects  were  found  to  be  interesting  enough  to  pursue  further.  The  specific  projects 
chosen  for  further  work  are  listed  below. 

1.  Braaten-Pisarski  Resummation:  Hiranmay  Mishra  and  Avijit  Ganguly  will 
try  to  use  Braaten-Pisarski  resummation  to  compute  the  pressure  in  pure 
QCD  at  three  loop  order.  There  is  some  interest  in  computing  the  pressure 
to  as  high  an  order  in  perturbation  theory  as  possible,  so  that  the  result  can 
be  compared  with  lattice  gauge  theory  measurements. 

2.  J/V>  Transverse  Momentum:  R.  M.  Godbole,  Prakash  Mathews,  V.  Ravin- 
dran, S.  Gupta  and  R.  V.  Gavai  have  started  a  project  to  use  resummed 
perturbative  QCD  in  order  to  predict  the  transverse  momentum  distribution 
of  J/t/}  and  T  in  the  heavy-ion  collider  mode  of  LHC.  Deviations  from  such 
predictions  are  among  the  signals  of  the  plasma. 

3.  Disoriented  Chiral  Condensates:   N.  D.  Haridass  outlined  a  computation  of 
the  critical  exponents  of  the  O(4)  sigma  model  in  three  dimension.  R.  Basu 
and  S.  Gupta  plan  to  compute  the  dynamics  of  disoriented  condensates  after 
quenching.    This  has  been  suggested  recently  as  a  mechnism  for  Centauro 
events. 
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S.  UMA  SANKAR 

Institute  of  Mathematical  Sciences,  Madras  600  113. 

Participants: 

Tariq  Aziz,  Amitava  Datta,  N.  G.  Deshpande,  P.  H.  Frampton,  U-G.  Meissner, 
J.  Pasupathy,  Sreerup  Raychaudhury,  L.  M.  Sehgal,  Gurpreet  Kaur  Sidana, 
S.  Uma  Sankar 

This  working  group  discussed  several  topics  of  heavy  flavour  physics.  Some  of 
them  ace  listed  below. 

1.  Tariq  Aziz  gave  a  talk  on  the  decay  time  measurements  of  the  Bd  and  B9 
mesons  at  LEP.  These  measurements  can  be  used  to  determine  AmBd  and 
Amp. .  The  ratio  of  these  mass  differences  can  provide  a  sensitive  test  of  the 
three  generational  unitarity  of  CKM  matrix. 

2.  Ulf-G.  Meissner  gave  a  talk  on  the  application  of  the  techniques  of  chiral 
perturbation  theory  to  the  non-leptonic  decays  of  kaons. 

3.  L.  M.  Sehgal  gave  a  talk  on  the  issue  of  final  state  interactions  in  the  non- 
leptonic  B  meson  decays.   In  the  two  pion  decay  of  neutral  kaons,  the  final 
state  contains  only  two  isospin  states  which  can  interact  with  each  other  via 
strong  interactions.   Therefore  these  decays  can  simply  be  parametrized  by 
two  amplitudes  and  two  phases.    But  in  B  meson  decays,  a  large  number 
of  intermediate  non-leptonic  states,  which  can  interact  with  one  another  via 
strong  interactions,  are  kinematically  accessible.  It  is  impossible  to  take  into 
account  all  of  them.  It  is  desirable  to  develop  ways  of  parametrizing  how  the 
final  state  interactions  (FSIs)  affect  the  two  meson  decays  of  the  B  mesons. 
This  will  be  extremely  useful  in  CP  violation  measurements  in  B  mesons. 

4.  J.  Pasupathy  gave  a  talk  on  the  relation  between  QCD  sum  rules  and  the 
Isgur-Wise  Heavy  Quark  symmetry. 

5.  Sreerup  Raychaudhury  gave  a  talk  on  a  four  generational  model  in  which 
one  can  make  the  fermions  of  the  first  three  generations  light  without  much 
fine-tuning.  The  fourth  generation,  however,  will  be  very  massive. 
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